Code search & knowledge engine for the AI era. Semantic + full-text hybrid search, real-time indexing, call graph + code vectors + commit vectors + knowledge vectors β unified into one native MCP server.
Built natively for Claude Code/Codex CLI β zero daemon, zero config overhead.
π δΈζζζ‘£
Goes beyond keyword matching. Dense vector search understands code intent ("login logic" β authenticateUser), while BM25 full-text search locks in exact matches. Results are fused via RRF and re-ranked by Cross-Encoder for precision. When embedding API is unavailable, gracefully falls back to graph search β never breaks.
Call graph + code vectors + commit vectors + knowledge vectors β four dimensions of codebase awareness. Tree-sitter AST parses 7 languages to build complete function/class/method relationships:
- Who calls this function?
- What does this function depend on?
- Find code by describing what it does
Full build on first run, only re-processes changed files thereafter (MD5 diff). Auto-indexes on MCP startup and watches file changes during runtime. Auto-cleans orphaned embeddings β index never bloats.
Built specifically for Claude Code/Codex CLI MCP stdio protocol. Install registers MCP automatically β no manual config, no persistent daemon. Starts and exits with Claude Code, zero residue. All code and data stay local, no SaaS required.
curl -fsSL https://raw.githubusercontent.com/iohub/codexray/main/install.sh | shAuto-detects OS/arch/libc, downloads, installs, and registers MCP. Restart Claude Code after β done.
First run: codexray install auto-launches an interactive setup wizard for the embedding API (graph search works without configuration).
curl -L -o codexray.tar.gz https://github.com/iohub/codexray/releases/latest/download/codexray-linux-x64-musl.tar.gz
tar -xzf codexray.tar.gz
./codexray install && rm codexray.tar.gzOther platforms: replace linux-x64-musl with darwin-arm64, darwin-x64, or linux-x64 from the latest release.
git clone https://github.com/iohub/codexray.git && cd codexray
cargo build --release && ./rust-core/target/release/codexray installSource files
β Tree-sitter AST parse (7 languages)
β Extract functions / classes / methods
β Build call graph (PetCodeGraph)
β Batch embed via API (SQLite cache)
β Store vectors in LanceDB
β Build BM25 index in Tantivy
β Save to ~/.codexray/<project_hash>/
Idempotent: index builds are incremental β the first run is a full build, subsequent runs compare MD5 hashes and only re-process changed files.
βββββββββββββββββββββββ
User query ββββββββββββββ Embedding Model ββββ Query vector
βββββββββββββββββββββββ
β
βββββββββββββββββββββββββΌββββββββββββββββββββββββ
βΌ βΌ βΌ
βββββββββββββββ βββββββββββββββ βββββββββββββββ
β Dense Search β β Sparse Searchβ β Graph Search β
β (LanceDB ANN)β β (Tantivy BM25)β β (PetCodeGraph)β
ββββββββ¬ββββββββ ββββββββ¬βββββββ ββββββββ¬βββββββ
β β β
ββββββββββββββββββββββββΌβββββββββββββββββββββββ
βΌ
βββββββββββββββββββ
β RRF Fusion β β Reciprocal Rank Fusion
β (Top-20 candidates)β
ββββββββββ¬βββββββββ
β
βΌ
βββββββββββββββββββ
β Reranker β β Cross-Encoder fine re-ranking
β (Qwen3-Reranker)β scores each (query, code) pair
ββββββββββ¬βββββββββ
β
βΌ
βββββββββββββββββββ
β Final Results β β Top-5 (or Top-N)
βββββββββββββββββββ
| Stage | Technology | Role |
|---|---|---|
| Dense Search | LanceDB + Embedding Model | Semantic vector similarity |
| Sparse Search | Tantivy BM25 | Keyword & token matching |
| RRF Fusion | Reciprocal Rank Fusion | Merge heterogeneous scores fairly |
| Reranker | Cross-Encoder (Qwen3-Reranker-4B) | Full-interaction precision scoring |
| Fallback | PetCodeGraph | Graph-based name search (no API needed) |
If embedding/reranker are unavailable, the pipeline falls back gracefully to graph-based name search and BM25-only mode.
| Mode | When | Trigger |
|---|---|---|
| MCP server | On startup + file changes | codexray install + restart Claude Code |
The MCP server automatically indexes on startup, watches file changes during runtime, and injects CLAUDE.md for tool discovery.
- Config:
~/.codexray/config.json(global, shared across all projects) - Index:
~/.codexray/<md5(project_root)>/project.jsonβ Project metadatagraph.binβ Serialized call graphembeddings.lance/β LanceDB vector datatantivy_bm25/β BM25 full-text indexfile_hashes.jsonβ MD5 incremental trackingembedding_hashes.jsonβ Embedding incremental tracking
No daemon, no HTTP server. Every CLI command is a standalone process.
| Language | Functions | Structs/Classes | Call Graph |
|---|---|---|---|
| Rust | β | β | β |
| Python | β | β | β |
| JavaScript | β | β | β |
| TypeScript | β | β | β |
| Go | β | β | β |
| C/C++ | β | β | β |
| Java | β | β | β |
~/.codexray/config.json:
{
"embedding": {
"provider": "openai-compatible",
"model": "Qwen/Qwen3-Embedding-4B",
"api_token": "sk-...",
"api_base_url": "https://api.siliconflow.cn/v1",
"dimensions": 2560
},
"index": {
"min_code_block_length": 16,
"enable_reranker": true,
"hybrid": {
"enable_bm25": true,
"bm25_top_k": 100,
"vector_top_k": 100,
"rrf_k": 60,
"rrf_top_k": 20,
"short_code_threshold": 30,
"short_code_penalty": 0.5
},
"reranker": {
"enabled": true,
"model": "Qwen/Qwen3-Reranker-4B",
"api_token": "sk-...",
"api_base_url": "https://api.siliconflow.cn/v1/rerank",
"top_n": 5,
"candidate_multiplier": 5,
"timeout_secs": 60
}
},
"installed_hooks": {}
}| Model | Role | When |
|---|---|---|
Qwen/Qwen3-Embedding-4B |
Converts code β vectors for dense search | Index building |
Qwen/Qwen3-Reranker-4B |
Scores (query, code) pairs for precision | Search time |
Set via the interactive wizard on first run, or create manually. If embedding API is unavailable, graph-based search still works.
cd rust-core
# Build
cargo build
# Build release
cargo build --release
# Run tests
cargo test
# Run specific test
cargo test test_build_graph_functionality -- --nocaptureMIT
Built with: Tree-sitter Β· Petgraph Β· LanceDB Β· Tantivy Β· Tokio Β· Clap Β· Axum