loading…
Search for a command to run...
loading…
A token-efficient MCP server for the Heroku Platform API that uses a search-and-execute pattern to minimize context window overhead. It enables secure app manag
A token-efficient MCP server for the Heroku Platform API that uses a search-and-execute pattern to minimize context window overhead. It enables secure app management through just-in-time endpoint discovery and built-in safety guardrails for API mutations.
A token-efficient MCP server for the Heroku Platform API using a Code Mode pattern:
search+execute+auth_status.
Design references:
| Approach | Tool surface | Approx context cost at tool-list time | Fits in a 200k-token context window? |
|---|---|---|---|
| official Heroku MCP | 37 endpoint-oriented tools | ~6,375 tokens | Yes, but consumes meaningful budget up front |
heroku-code-mcp (this repo) |
3 control tools (search, execute, auth_status) |
~368 tokens | Yes, with minimal up-front overhead |
The practical impact is that the agent starts with a much smaller tool schema, then asks the server for just-in-time endpoint discovery. This keeps prompt budget available for user intent, planning, and response quality instead of spending it on static endpoint metadata.
Heroku’s API surface is broad, and an endpoint-per-tool MCP model makes the agent choose between many tools before it has enough task context. That usually increases tool-selection ambiguity, consumes tokens early, and makes multi-step tasks more brittle. The issue is not that many tools are inherently bad, but that model context is scarce and endpoint selection is an agent planning problem, not just a transport problem.
This server applies a Code Mode-style control loop with deterministic inputs:
search maps natural language intent to ranked operation_id candidates.execute validates and performs the selected Heroku API operation.auth_status provides explicit auth state so agents can branch cleanly.The server holds schema intelligence and safety policy centrally. The agent gets a small control surface and a stable execution contract.
| Tool | What it does | Why it exists |
|---|---|---|
search |
Ranks Heroku operations from schema + docs context | Reduces endpoint selection ambiguity |
execute |
Validates params/body and executes by operation_id |
Gives one deterministic execution path |
auth_status |
Returns {authenticated, scopes, expires_at} |
Supports explicit auth-aware planning |
Agent MCP Server
│ │
├──search({query: "list apps"})──►│ rank operations from catalog/index
│◄──[GET /apps, ...]───────────────│
│ │
├──execute({operation_id: ...})───►│ validate + call Heroku API
│◄──{status, headers, body}────────│
Benchmarks were captured on February 22, 2026 on the same machine and account for both implementations. The numbers below compare this repo’s local HTTP MCP endpoint against the official Heroku MCP server over stdio.
| Metric | heroku-code-mcp |
official Heroku MCP | Delta |
|---|---|---|---|
| Tool count | 3 | 37 | 91.9% lower |
| Tool-list payload bytes | 1,469 | 25,500 | 94.2% lower |
| Tool-list approx tokens | 368 | 6,375 | 94.2% lower |
| Connect avg | 14.8 ms | 10,168.7 ms | 687x faster |
list_tools avg |
4.3 ms | 10.3 ms | 2.4x faster |
| Read op avg | 528.0 ms (execute GET /apps) |
9,697.4 ms (list_apps) |
18.4x faster |
These are static charts so labels stay readable in GitHub without giant auto-scaled Mermaid panels.
The strongest win is context footprint. A 3-tool interface materially lowers initial prompt overhead and reduces tool-choice branching for the model. The second win is connection and read-path latency under this benchmark harness. In measured runs, the official Heroku MCP paid a much larger connect-time cost, and its measured read operation was substantially slower than execute GET /apps on this server.
This does not mean every endpoint in every environment will always have the same multiplier. It means the measured default experience in this setup favored the Code Mode control surface for both context economy and latency.
ceil(list_tools_json_bytes / 4) for rough token approximation.execute GET /appslist_appsArtifacts:
benchmarks/results/context-footprint-2026-02-22.jsonbenchmarks/results/custom-local-http-2026-02-22.jsonbenchmarks/results/official-heroku-mcp-start-2026-02-22.jsonBENCHMARKS.mdMCP URL: http://127.0.0.1:3000/mcp
cd heroku
npm install
npm run build
npm test
Configure OAuth env vars and use /oauth/start + /oauth/callback.
heroku auth:whoami
npm run seed:token
Start server:
TOKEN_STORE_PATH=./data/tokens.integration.json \
TOKEN_ENCRYPTION_KEY_BASE64='<seed-output-key>' \
PORT=3000 HOST=127.0.0.1 npm run dev
Smoke test:
curl -sS http://127.0.0.1:3000/healthz
MCP_URL=http://127.0.0.1:3000/mcp USER_ID=default npm run smoke:mcp
{
"mcpServers": {
"heroku-code-mcp": {
"transport": "streamable_http",
"url": "http://127.0.0.1:3000/mcp",
"headers": {
"x-user-id": "default"
}
}
}
}
{
"mcpServers": {
"heroku-code-mcp": {
"command": "npx",
"args": ["mcp-remote", "http://127.0.0.1:3000/mcp"],
"env": {
"MCP_REMOTE_HEADERS": "{\"x-user-id\":\"default\"}"
}
}
}
}
auth_status.search with intent.operation_id.execute with path_params, query_params, and body as needed.dry_run=true first, then replay with confirm_write_token and ALLOW_WRITES=true.Example search:
{
"query": "list apps",
"limit": 5
}
Example read execute:
{
"operation_id": "GET /apps"
}
Example write dry-run:
{
"operation_id": "PATCH /apps/{app_identity}",
"path_params": {
"app_identity": "my-app"
},
"body": {
"maintenance": true
},
"dry_run": true
}
POST, PATCH, PUT, DELETE) are blocked by default.ALLOW_WRITES=true and a matching confirm_write_token.GET / HEAD) are enabled for transient failures.CATALOG_CACHE_PATH) avoids cold-start re-ingestion.ETag/Last-Modified) reduce refresh cost.READ_CACHE_TTL_MS) improves repeated read latency.EXECUTE_MAX_BODY_BYTES, EXECUTE_BODY_PREVIEW_CHARS) prevent oversized responses from dominating context.Key env vars:
ALLOW_WRITESREQUEST_TIMEOUT_MSMAX_RETRIESCATALOG_CACHE_PATHREAD_CACHE_TTL_MSEXECUTE_MAX_BODY_BYTESEXECUTE_BODY_PREVIEW_CHARSFull example: .env.example
src/schema/*: ingestion + operation normalization + cachesrc/search/*: search index + rankingsrc/execute/*: validation + Heroku API executionsrc/auth/*: OAuth + encrypted token storagetests/*: catalog/search/execute testsbenchmarks/results/*: benchmark artifactsBENCHMARKS.md: benchmark methodology detailsREFERENCES.md: external referenceshttp://127.0.0.1:3000/mcp and server is running.AUTH_REQUIRED: seed token or complete OAuth flow.ALLOW_WRITES=true and send matching confirm_write_token.Добавь это в claude_desktop_config.json и перезапусти Claude Desktop.
{
"mcpServers": {
"heroku-code-mcp": {
"command": "npx",
"args": []
}
}
}