repowise-devrepowise
- Files
- 729 files
CODEBASE INTELLIGENCE FOR AI AGENTS · OPEN SOURCE · HOSTED
repowise indexes any repo into a documented dependency graph in under 30 seconds — feeding Claude Code, Cursor, and your own agents real architecture, ownership, and decisions instead of guesses.
Live on Repowise
We index and keep these repos fresh ourselves — open one to see exactly what Repowise does on a real, large codebase. No mock data, no signup.
Install once. Choose the interface that fits your workflow — or use all three. They share the same data, the same intelligence, the same stores.
For the solo developer
pip install repowise. Run init, update, search, dead-code, and 10 more commands. Works fully offline with Ollama. Your code never leaves your machine.
For AI-native workflows
8 tools that plug into Claude Code, Cursor, or Cline. Your AI agent calls get_context() instead of reading 40 files. Config auto-generated after repowise init.
For the whole team
Browse the wiki, explore the dependency graph in D3, view hotspot tables, track doc freshness, and chat with your codebase. All served from repowise serve.
One pip install. Works offline with Ollama. No account required to get started — run everything on your machine.
Works on Python 3.10+. Supports Anthropic, OpenAI, Gemini, and Ollama (fully offline).
Parses your codebase, builds a dependency graph, mines git history, and generates wiki pages with an LLM.
Run after commits. Detects changed files, cascades updates through the dependency graph.
Full dashboard: wiki explorer, interactive dependency graph, hotspot tables, ownership maps, and codebase chat.
Graph structure, git history, generated documentation, and architectural decisions — four layers that compound into genuine codebase understanding.
repowise parses your codebase into a directed dependency graph using tree-sitter ASTs across 10 languages. PageRank identifies your most critical symbols. Community detection discovers logical modules even when directory structure doesn't reflect them.

GitIndexer mines your commit history to classify files as hotspots or stable, compute ownership, extract significant commit messages, and discover co-change partners. These signals flow into generation prompts, so your wiki explains why code was written — not just what it does.

Each wiki page is generated with 9 layers of context: source code, symbol signatures, graph metrics, git history, import summaries, RAG context, co-change docs, dead code findings, and reverse imports. Confidence scores decay when source changes — stale pages auto-regenerate.

Decisions are extracted from four sources: inline markers (# WHY:, # DECISION:, # TRADEOFF:), git archaeology, README/docs mining, and manual CLI capture. Each decision tracks staleness — when affected files change, the decision is flagged for review.

The MCP server exposes the entire wiki as structured, queryable tools. Instead of reading 40 source files, your agent calls get_overview() and gets a current architecture summary. Config auto-generated for Claude Code, Cursor, and Cline.
get_overview()Architecture summary, module map, entry points, tech stack.
First call when exploring an unfamiliar codebase.
get_context()Docs, ownership, history, decisions, freshness for files, modules, or symbols. Pass multiple targets in one call.
Before reading or modifying specific code.
get_risk()Hotspot score, dependents, co-change partners, risk summary. Also returns top 5 global hotspots.
Before modifying files — assess blast radius.
get_why()Three modes: natural language search over decisions, path-based lookup, or health dashboard.
Before architectural changes — understand existing intent.
search_codebase()Semantic search over the full wiki using LanceDB or pgvector. Natural language queries.
When you don't know where something lives.
get_dependency_path()Connection path between two files or modules in the dependency graph.
Understand how two things are connected.
get_dead_code()Unreachable files, unused exports, zombie packages — sorted by confidence and cleanup impact.
Before cleanup or refactoring tasks.
get_architecture_diagram()Mermaid diagram for the full repo or a specific module scope.
For documentation, presentations, or onboarding.

The chat agent has access to all 8 MCP tools. It searches the wiki, reads the graph, checks git history, and cites architectural decisions — all automatically. Answers are grounded in your actual codebase, not hallucinated from training data.

After every repowise update, the CLAUDE.md is regenerated from real data — graph metrics, git history, dead code findings. No LLM needed. No templates. Just your codebase, distilled into the context your AI agent needs.
Also generates cursor.md for Cursor users. Same data, different format.

repowise mines commit history into actionable signals: which files break most often, who owns what, and which files are secretly coupled. These signals drive generation depth, confidence decay, and risk assessment.
Top 25% churn + complexity files flagged automatically. Hotspot pages get deeper generation and faster confidence decay.
git blame + commit history reveals who owns what. Bus factor calculated per module. Single-owner modules highlighted.
Files that change together without import relationships. These hidden couplings are invisible to static analysis.
Large diffs, refactors, and migration commits are identified and included in generation prompts for richer context.
When source files get new commits, associated wiki pages decay in confidence. Stale pages auto-regenerate on next run.
All git metrics use rolling 90-day windows by default. Reflects current activity, not ancient history.

Most tools solve one slice of the problem. repowise is the only open-source platform that combines auto-generated documentation, git intelligence, decision records, and MCP tools in a single self-hostable package.
| Feature | repowise | Google CodeWiki | DeepWiki | CodeScene | Sourcegraph |
|---|---|---|---|---|---|
| Self-hostable OSS | ✓ | — | — | — | — |
| Works with private repos | ✓ | — | ✓ | ✓ | ✓ |
| Auto-generated wiki (LLM) | ✓ | ✓ | ✓ | — | — |
| Git intelligence (hotspots / ownership / co-changes) | ✓ | — | — | ✓ | — |
| Dead code detection | ✓ | — | — | — | — |
| Architectural decision records | ✓ | — | — | — | — |
| MCP server for AI agents | ✓ | — | — | — | — |
| Semantic search | ✓ | ✓ | ✓ | — | ✓ |
| Doc freshness / confidence scoring | ✓ | — | — | — | — |
| CLAUDE.md auto-generation | ✓ | — | — | — | — |
| Codebase chat (agentic) | ✓ | ✓ | ✓ | — | — |
| Dependency graph visualization | ✓ | ✓ | ✓ | ✓ | ✓ |
| Provider choice (4 LLM providers) | ✓ | — | — | — | — |
| Privacy (code never leaves your infra) | ✓ | — | — | ✓ | ✓ |
Self-assessed against publicly documented features as of May 2026. Vendor capabilities change — please verify before committing to any tool.
Everything in the open-source product, plus team features. No infrastructure to manage. We handle ingestion, storage, and updates.
Everyone on the team queries the same wiki, same graph, same decisions. New hires get onboarded with get_overview() instead of a stale Notion doc.
Extend repowise with community plugins: custom parsers, integration hooks, specialized MCP tools, and export formats.
Control who can trigger regeneration, view sensitive repos, or modify architectural decisions. SSO integration included.
Track doc freshness trends, generation costs, token usage, hotspot evolution over time, and team adoption metrics.
Learn about codebase intelligence, MCP tools for AI agents, and how repowise compares to other tools.

For decades, 'good documentation' was the white whale of software engineering. We all knew it was necessary, but the friction of maintaining it—manually upda...

Technical debt is rarely a single catastrophic event. Instead, it’s a slow accumulation of 'paper cuts'—a complex module here, a high-churn file there, and a...

Every engineering team eventually hits the 'Documentation Wall.' It’s the moment when the codebase grows faster than the human capacity to describe it. You’v...
pip install repowise. Run on your machine, your server, your CI. AGPL-3.0. Full feature set. Your code never leaves your infrastructure.
Managed indexing, team workspaces, and semantic chat. Free tier for public repos up to 250 MB; auto-sync and bigger repos on Pro. Pro at $15/mo with LLM credits included. Sign up free →
On-prem deployment, SSO, role-based access, dedicated support, SLAs. Reach out below and we'll scope it together.