Token Pilot

Name: Token Pilot
Availability: InStock
Author: digital-threads

Free

Save up to 80% tokens when AI reads code — MCP server for token-efficient code navigation, AST-aware structural reading instead of dumping full files into conte

by digital-threads

GitHub

About

Save up to 80% tokens when AI reads code — MCP server for token-efficient code navigation, AST-aware structural reading instead of dumping full files into context window

README

Token-efficient AI coding, enforced. Cuts context consumption in AI coding assistants by up to 90% without changing the way you work.

Three layers, each useful on its own, stronger together:

MCP tools — structural reads (smart_read, read_symbol, read_for_edit, …). Ask for an outline or load one function by name instead of the whole file.
PreToolUse hooks — intercept heavy native tool calls (Read on large files, recursive Grep, unbounded git diff) and redirect to token-efficient alternatives.
tp-* subagents — Claude Code delegates with MCP-first behaviour and tight response budgets.

How It Works

Traditional:  Read("user-service.ts")  →  500 lines  →  ~3000 tokens
Token Pilot:  smart_read("user-service.ts")  →  15-line outline  →  ~200 tokens
              read_symbol("UserService.updateUser")  →  45 lines  →  ~350 tokens
              After edit: read_diff("user-service.ts")  →  ~20 tokens

Files under 200 lines are returned in full — zero overhead for small files.

Benchmarks

Measured on public open-source repos. Files ≥50 lines only:

Repo	Files	Raw Tokens	Outline Tokens	Savings
token-pilot (TS)	55	102,086	8,992	91%
express (JS)	6	14,421	193	99%
fastify (JS)	23	50,000	3,161	94%
flask (Python)	20	78,236	7,418	91%
Total	104	244,743	19,764	92%

smart_read outline savings only. Real sessions additionally benefit from session cache, read_symbol, and read_for_edit. Reproduce: npx tsx scripts/benchmark.ts.

Quick Start

npx -y token-pilot init

Creates (or merges into) .mcp.json with token-pilot + context-mode, then prompts to install tp-* subagents. Restart your AI assistant to activate.

What You Get

22 MCP tools — structural reads, symbol search, git analysis, session analytics → tools reference
PreToolUse hooks — block heavy Grep/Bash/Read calls; redirect to efficient alternatives → hooks & modes
25 tp-* subagents (Claude Code only) — MCP-first delegates with haiku/sonnet model tiers and budget enforcement → agents reference
Tool profiles — trim advertised tools/list to save ~2 k tokens per session → profiles & config

Client Support Matrix

Client	MCP tools	PreToolUse hooks	`tp-*` subagents
Claude Code	✅	✅	✅
Cursor	✅	✅	❌
Codex CLI	✅	✅	❌
Gemini CLI	✅	✅	❌
Cline (VS Code)	✅	✅	❌
Antigravity	✅	✅	❌

Manual config snippets for each client → installation guide

Enforcement Mode

TOKEN_PILOT_MODE controls how aggressively Token Pilot redirects heavy native tool calls:

Value	Behaviour
`advisory`	Allow all — hooks pass through, advisory notes only
`deny` (default)	Block heavy `Grep`/`Bash` patterns; intercept large `Read` calls
`strict`	Deny + auto-cap MCP output (`smart_read` ≤ 2 000 tokens, `find_usages` → list mode, `smart_log` → 20 commits)

TOKEN_PILOT_MODE=strict npx token-pilot

→ Full hook & mode docs

Ecosystem

Token Pilot owns input tokens — the stuff Claude reads from files, git, search. The other half of a session (what Claude writes back, how it executes code, how it remembers state across days) is owned by separate tools. They compose cleanly:

Tool	Owns	Typical savings
Token Pilot	code reads, git, search	60-90% input
caveman	Claude's response prose (terse-speak skill)	~75% output
ast-index	the structural indexer Token Pilot rides on	foundation
context-mode	sandboxed shell / python / js execution	90%+ on big stdout

A session that pairs token-pilot + caveman typically hits ~85-90% total reduction — each cuts a different half, no overlap. Install what you need; none of them assume the others are present.

→ full ecosystem map

Rules of thumb: read code → smart_read/read_symbol; execute code with big output → context-mode execute; bash-only agent → ast-index CLI. Never copy the whole stack into CLAUDE.md — Token Pilot's doctor warns when CLAUDE.md exceeds 60 lines.

Supported Languages

TypeScript, JavaScript, Python, Go, Rust, Java, Kotlin, C#, C/C++, PHP, Ruby. Non-code (JSON/YAML/Markdown/TOML) gets structural summaries. Regex fallback handles most other languages.

Update / New Machine

Claude Code (plugin — recommended):

# Install on a new machine:
claude plugin marketplace add https://github.com/Digital-Threads/token-pilot
claude plugin install token-pilot@token-pilot

# Update to latest:
claude plugin update token-pilot

Other clients (Cursor, Codex, Cline, …):

# Install on a new machine:
npx -y token-pilot init

# Update to latest — npx always pulls fresh, just restart your client.
# Or if installed globally:
npm i -g token-pilot@latest
npx token-pilot install-hook
npx token-pilot install-agents --scope=user --force

Troubleshooting

npx token-pilot doctor          # diagnose: ast-index, config, upstream drift
# "ast-index not found"  →  npx token-pilot install-ast-index
# "hooks not firing"     →  restart your AI assistant

Credits

Built on ast-index · @ast-grep/cli · MCP SDK · chokidar

License

MIT

How to install

Add this to claude_desktop_config.json and restart Claude Desktop.

{
  "mcpServers": {
    "token-pilot": {
      "command": "npx",
      "args": [
        "-y",
        "token-pilot"
      ]
    }
  }
}

Token Pilot

About

README

How It Works

Benchmarks

Quick Start

What You Get

Client Support Matrix

Enforcement Mode

Ecosystem

Supported Languages

Update / New Machine

Troubleshooting

Credits

License

How to install

Related MCPs

Compare Token Pilot with

GitHub

Supabase

Everything

Filesystem

Command Palette

Token Pilot

About

README

How It Works

Benchmarks

Quick Start

What You Get

Client Support Matrix

Enforcement Mode

Ecosystem

Supported Languages

Update / New Machine

Troubleshooting

Credits

License

How to install

Related MCPs

Compare Token Pilot with

GitHub

Supabase

Everything

Filesystem