Polaris — MCP Semantic Search for Claude Code, Cursor & Coding Agents

Polaris is a local-first RAG server that gives coding agents fast, ranked answers from your project's documentation — over the Model Context Protocol (MCP). Drop it next to any repo and Claude Code, Cursor, or Codex stops grepping blindly: a single semantic search call returns the exact section the agent needs, typically 10–40× cheaper in tokens than the usual grep-and-read loop.

Single static binary. No API keys, no cloud, no runtime dependencies — your code never leaves the machine. Named after the North Star: a fixed reference point so your agent always knows where it is.

curl -fsSL https://raw.githubusercontent.com/girard-g/polaris/main/install.sh | bash
polaris setup            # wire into current project (.mcp.json + agent files)
polaris index ./docs     # index your markdown
polaris serve            # MCP server for Claude Code / Cursor / Codex

Features

Hybrid semantic + lexical search — vector KNN (sqlite-vec) and BM25 (FTS5) fused with Reciprocal Rank Fusion, then MMR-reranked for diversity.
MCP-native — exposes search, index, status tools over stdio. Works with Claude Code, Cursor, Codex, and any MCP-compatible coding agent.
Local-first, zero cloud — embeddings run on-CPU via a local ONNX model. No API keys, no telemetry, your code never leaves the machine.
Single static binary — one curl | bash install. No Python, no Node, no Docker.
Auto-watch — polaris watch ./docs re-indexes on file changes within ~500 ms, so the agent always sees fresh docs.
Token-savings analytics — polaris savings shows the cumulative tokens you've saved vs. the agent grepping and reading raw files.

Why Polaris — Token Savings vs. Grep + Read

For documentation questions, calling the Polaris MCP search tool is dramatically cheaper than the usual agent loop of grepping the docs tree and reading the matching files. Measured on a real query ("how does Polaris embed documents?") against this repo's docs/:

Approach	Tokens returned	vs. grep + read
`grep -rn "embed" docs/` + read `embedding.md` + read `architecture.md`	~12,700	1×
MCP `search`, vague query, `top_k=5`	~925	~14× cheaper
MCP `search`, focused query (2-4 domain nouns), `top_k=2`	~285	~45× cheaper

Estimates use ~4 chars/token. Accuracy actually improves with the focused query: the top-ranked chunk is the canonical Embedding Pipeline section instead of the generic overview.

Run polaris savings to see your own cumulative number once you've made some queries.

The MCP server's tool descriptions and server instructions brief calling agents on these patterns (use specific domain terms, start at top_k=2), so a well-behaved client picks them up automatically.

polaris index ./docs
polaris search "how does chunking work"
polaris watch ./docs        # auto re-index on file changes
polaris serve               # stdio MCP server for Claude Code

How It Works

Polaris uses hybrid search: it indexes markdown as both vector embeddings (via a local ONNX model) and full-text search entries (FTS5). At query time it runs vector KNN and BM25 in parallel, fuses the results with Reciprocal Rank Fusion, applies a heading boost, and reranks with MMR for diversity.

query → vector KNN + BM25 → RRF fusion → heading boost → MMR rerank → top-k results

Library Use

Polaris's retrieval pipeline is also available as a Rust library (polaris-core):

use polaris_core::{Bank, BankConfig, IndexOpts, SearchOpts, SharedEmbedding};

let embed = SharedEmbedding::load("nomic-embed-text-v1.5", 512)?;
let bank = Bank::open(BankConfig {
    repo_root: "./docs".into(),
    index_path: "./docs/.polaris/index.db".into(),
    embedding_dim: 512,
    model_id: "nomic-embed-text-v1.5".into(),
    ..Default::default()
}, embed)?;

bank.index_path(std::path::Path::new("./docs"), IndexOpts::default())?;
let results = bank.search("how does chunking work", SearchOpts { top_k: 5 })?;

For multi-bank searches with score fusion, use BankSet. For incremental
updates after a git pull, use Bank::index_diff(&changed, &removed).

Quick Start

Install (recommended)

curl -fsSL https://raw.githubusercontent.com/girard-g/polaris/main/install.sh | bash

Downloads the latest release binary and installs it to ~/.local/bin (or /usr/local/bin with --system). Run polaris update later to upgrade in place.

Windows: build from source (see below).

Install from source

cargo build --release
# binary at: ./target/release/polaris

For Windows or any platform where the curl | bash install does not have a release asset, build from source.

First search

polaris setup
polaris index ./docs
polaris search "your first query"

The first search downloads the embedding model (~137 MB) to a user-global cache shared across projects (default ~/.cache/polaris/models/ on Linux). See Configuration → Model Caching for the full resolution order and the POLARIS_CACHE_DIR override.

Usage

Setup

polaris setup            # configure current directory
polaris setup ./my-proj  # configure a specific directory

Creates .mcp.json (pointing at the running polaris binary) and adds polaris-related entries to .gitignore (polaris.db, polaris.db-shm, polaris.db-wal, .fastembed_cache/, .mcp.json itself). Idempotent: re-running is safe.

polaris setup also writes a marker-delimited Polaris MCP block into CLAUDE.md, AGENTS.md, and GEMINI.md at the project root, steering compatible coding agents toward polaris.search for documentation queries. Existing user content in those files is preserved (the block is delimited by  …  markers and only that range is rewritten on re-run). Pass --no-agents to skip these three files.

Additionally, polaris setup installs a Claude Code auto-index hook into .claude/settings.json. The hook fires after every Write, Edit, or MultiEdit and re-indexes the touched file if it is a .md file under an already-indexed root. This keeps the search index current without needing polaris watch. Pass --no-hooks to skip hook installation (and the initial index pass). Re-run polaris setup --no-hooks on a project that already has the hook to remove it.

Index

polaris index ./docs                # recursive, incremental
polaris index ./docs --force        # re-embed all files
polaris index ./docs --no-recursive # top-level only

Only .md files are indexed. Unchanged files (same SHA256) are skipped automatically.

Watch

Claude Code users don't need polaris watch — the auto-index hook installed by polaris setup re-indexes .md files on every edit automatically. Use polaris watch for non-Claude-Code workflows (Cursor, Codex, manual edits, git pull, etc.).

polaris watch ./docs                # watch and auto re-index on changes
polaris watch ./docs ./notes        # multiple paths
polaris watch ./docs --no-recursive # top-level only

Runs an initial index on start, then re-indexes affected paths automatically within ~500 ms of a file change. Ctrl+C to stop.

Search

polaris search "authentication flow"
polaris search "database schema" -k 10

Status

polaris status

Database   : polaris.db
Documents  : 13
Chunks     : 145
DB size    : 2.2 KB
Embed dim  : 512
Last index : 2026-02-26T14:41:28Z

Savings

polaris savings              # cumulative summary
polaris savings --history    # per-query log (newest first)
polaris savings --output json

Reports the cumulative tokens you've saved by going through Polaris instead of grep + read. The baseline for each query is the total content of the unique files in the result set — i.e., what an agent without Polaris would have opened after grepping. Tokens are estimated at ~4 chars/token.

MCP Server

polaris serve

Starts a stdio MCP server. Run polaris setup once per project to write .mcp.json automatically, or add it manually:

{
  "mcpServers": {
    "polaris": {
      "command": "/path/to/polaris",
      "args": ["serve"]
    }
  }
}

Tools exposed: search, index, status.

Polaris vs. Alternatives

	Polaris	grep + read (default agent loop)	Hosted RAG (Pinecone, LlamaIndex Cloud)	File-system MCP servers
Setup	one `curl \| bash`	none	account, API keys, infra	one `npx`
Privacy	local, code never leaves machine	local	code uploaded to vendor	local
Token cost per query	~285–925 tokens	~12,700 tokens	varies (network round-trip)	high (returns raw files)
Ranking	hybrid (vector + BM25 + RRF + MMR)	none	vector-only by default	none

Numbers from this repo's docs; your mileage will vary.

Polaris Pro

Polaris Pro extends the open-source server with polaris-ingest — a companion ingester for code, PDF, and .docx — and a web UI for browsing the index without an agent. Same hybrid retrieval pipeline; broader inputs and a human-friendly view on top.

In development. Join the waitlist — early signups get 50% off at launch.

Frequently Asked Questions

What is Polaris?

A local RAG server that lets coding agents semantically search your project docs over MCP.

What is MCP?

The Model Context Protocol, an open standard from Anthropic that lets AI agents talk to external tools. Polaris implements an MCP server so Claude Code, Cursor, Codex, and similar agents can call it directly.

What's the difference between Polaris and Polaris Pro?

The open-source Polaris server indexes Markdown only and is CLI-driven. Polaris Pro adds polaris-ingest for code, PDF, and .docx, plus a web UI. Pro is in development — see the waitlist.

Does Polaris work with Claude Code, Cursor, and Codex?

Yes. Any MCP-compatible client works. polaris setup writes the .mcp.json and updates CLAUDE.md / AGENTS.md / GEMINI.md automatically.

Do I need an API key or cloud account?

No. Embeddings run locally via a bundled ONNX model. Nothing is sent over the network after the initial model download.

How does Polaris compare to LangChain, LlamaIndex, or Haystack?

Those are full RAG frameworks for building applications. Polaris is a thin, zero-config server purpose-built for the coding-agent use case: drop in next to a repo, get MCP search.

What file types does Polaris index?

The open-source server indexes Markdown (.md). For code, PDF, and .docx, see polaris-ingest in Polaris Pro.

Can I use Polaris on private or proprietary code?

Yes. Everything runs locally; there's no telemetry. The MIT license permits commercial use.

How do I update Polaris?

Run polaris update to upgrade in place, or re-run the install one-liner.

Configuration

Create polaris.toml in the working directory (all fields optional):

db_path = "polaris.db"
embedding_dim = 512          # 64–768, Matryoshka truncation
max_chunk_tokens = 450
chunk_overlap_chars = 200
model_id = "nomic-embed-text-v1.5"
mmr_lambda = 0.7             # 0 = diversity, 1 = relevance
mmr_candidate_multiplier = 3
heading_boost = 0.05
rrf_k = 60

Config is resolved in order: --config <path> → ./polaris.toml → ~/.config/polaris/polaris.toml → defaults.

CLI overrides: polaris --dim 384 --db /tmp/test.db search "query"

Known Limitations

Markdown only — only .md files are indexed; .txt, .rst, code files, and PDFs are ignored
Single-user — all tool calls are serialized through a single DB mutex; not designed for concurrent MCP sessions
Approximate chunk offsets — start_byte/end_byte metadata is approximate and not verified post-split

Tech Stack


Embeddings	fastembed 5 (nomic-embed-text-v1.5, ONNX, CPU)
Vector search	sqlite-vec 0.1 (cosine KNN)
Full-text search	SQLite FTS5 (BM25)
Database	rusqlite 0.32 (bundled SQLite)
MCP server	rmcp 0.16 (stdio transport)

Documentation


docs/overview.md	Goals, architecture diagram, tech stack
docs/search.md	Hybrid search pipeline, scoring, MMR
docs/database.md	Schema, FTS5 sync, migrations
docs/configuration.md	All config fields and defaults
docs/cli.md	Full CLI reference
docs/mcp-server.md	MCP tools and integration
docs/architecture.md	Module map and data flow
docs/todos.md	Known limitations and roadmap

License

MIT — see LICENSE.