Understand Your Code. At Any Scale.

Semantic search that finds what code does, not just what it says. Works locally. Scales across repos. Free for production use.

Source-available · BSL 1.1 · 18 languages · Works offline

42 repos indexed for $0.70

$ grep 'error handling'
$ codeindex search 'error handling'

Code Search is Broken

grep / ripgrep
Fast but dumb

Keyword matches only, no cross-repo awareness. Searching "error handling" returns every comment and string literal — not actual error handlers.

IDE Search
Local and limited

One project at a time. Can't show patterns across your architecture. No structural awareness beyond the current workspace.

GitHub Search
Cloud, limited

GitHub-only, no cross-repo intelligence, no semantic understanding. And it costs money at scale.

Your teams search code every day. They shouldn't need three tools.

How codeindex Works

Semantic Search

Finds code intent, not strings.

Search "authentication middleware" — grep returns noise, codeindex returns validators, handlers, and schemas.

1.4xbetter ranking vs expert grep
$ grep "authentication middleware"
config.ts:12 auth_middleware: true
README.md:34 # Authentication Middleware
test/mock.ts:8 // stub auth middleware
$ codeindex search "authentication middleware"
src/middleware/auth.ts — JWT validation and session refresh
src/api/guards/permissions.ts — Role-based access control middleware
src/auth/schemas/token.ts — Token validation schemas

Cross-Repo Intelligence

See dependencies across your architecture.

`codeindex xref UserDTO` finds every caller across all indexed repos. Map blast radius in seconds, not hours.

Secondsto map blast radius across repos

Agent-Native (MCP)

Claude, Cursor, Windsurf integrate instantly.

Session caching, batch queries, multi-agent deduplication. Your AI tools get smarter context with less cost.

20-30%estimated savings from session caching alone

Benchmarked Against Real Code

330 queries · 19 repos · 16 languages · vs expert grep patterns

1.4x
better ranking quality
0.718
MRR
16
languages tested
$0.70
to index 42 repos

Tested on kubernetes, elasticsearch, langchain, discourse, and 18 more public repos

Built for How You Work

Solo Developers
Find code in your first hour, not your first week

Search "payment handling" and find all payment logic in seconds — no grepping through directory trees.

AI Agents
Agents work cheaper

Session caching (20-30% savings), batch queries (15-20%), deduplication (10-25%). 42 repos indexed for $0.70.

Platform Teams
Architecture visibility in seconds

`codeindex xref` traces dependencies across repos. Map blast radius before you ship.

Enterprise
Security by design

Local embeddings, pre-embedding secret scanning, scoped tokens, air-gapped deployment. Your code never leaves your network.

Up and Running in 30 Seconds

Minimal config. BYO API keys or run locally with Ollama.

Install
git clone https://github.com/smart-knowledge-systems/codeindex.git && cd codeindex && bun install && bun src/index.ts init
Search
bun src/index.ts search "authentication middleware"

How We Compare

FeaturecodeindexgrepGitHubSourcegraph
Semantic searchLimited
Cross-repoConfig required
Self-hostedN/APartial
Agent integration (MCP)
Source-available
CostFree + infraFree$$$$$

When NOT to Use codeindex

We tested this. Grep wins in specific scenarios. We're honest about tradeoffs.

Small codebase (<10K LOC)

grep is fine — the overhead isn't worth it.

Exact filename search

Use `find` or `fd`. We're not a file finder.

Need <500ms cold start

Embedding has startup cost. Warm queries are fast, but first run takes time.

Unsupported language

We support 18 languages. Adding more is straightforward via tree-sitter, but we won't pretend we cover everything yet.

Professional Services

Need Expert Help?

codeindex is free and self-serve. But if your team needs hands-on training or a purpose-built context layer, we can help.

Agent Ready Workshop

Teach your team to use coding agents effectively

A 1-2 day hands-on workshop where we train your engineering team on your actual codebase. Your team bought AI coding tools — we make sure they get real value from them.

  • Hands-on agent workflow training with your team's real code
  • CLAUDE.md and AGENTS.md starter files written during the session
  • Before/after measurement on real tasks from your backlog
  • Agent workflow playbook customized to your stack
  • 30-day follow-up check-in
Starting at $5,000Book a Workshop

Not right for: Teams under 5 engineers, or teams already getting strong results from AI coding agents.

CodeReady Pilot

We build the context layer and prove it works

We build a structured context layer for your codebase using codeindex, integrate it with your AI tools via MCP, and run a before/after eval on your actual tasks. You see the measured improvement before you pay.

  • Structured context layer (AGENTS.md, CLAUDE.md) built with codeindex
  • codeindex deployed and configured for your codebase
  • MCP server integration for Claude Code, Cursor, or Copilot
  • Before/after eval on representative tasks from your backlog
  • Benchmark report with quantified improvement
  • Knowledge transfer: how to maintain and extend the context layer

We define success criteria together before starting. If we don't hit them, no charge. You keep everything we build.

Starting at $15,000Start a CodeReady Pilot

Not right for: Codebases under 50K lines of code, or teams that don't use AI coding agents. For smaller projects, codeindex self-serve is likely all you need.

CodeReady Retainer After a successful Pilot, ongoing context maintenance with automated drift detection, quarterly reviews, and priority support. Starting at $3,000/month.

Built by a knowledge management consultant (MBA, PMP) who builds production software with coding agents and structured context.

I built cidx-cloud — auth, billing, managed MCP server, Cloudflare deployment — in 3 days as a solo developer. That's not a client case study. It's a data point about what's possible with the right approach.

Ready to understand your codebase?

Get Started
Install and search in 30 seconds
git clone https://github.com/smart-knowledge-systems/codeindex.git && cd codeindex && bun install && bun src/index.ts init
Try cidx-cloud (Early Access)
Managed search for your team
Learn more →
Train Your Team
Agent Ready Workshop — hands-on training on your codebase
Schedule a call →
Build Your Context Layer
CodeReady Pilot — we build the context layer and prove it works
Schedule a call →
codeindex is licensed under BSL 1.1 — source-available, free for production use, converts to Apache 2.0 in 2030.