Talonic

Name: Talonic
Availability: InStock
Author: talonicdev

Бесплатно

Official Talonic MCP server. Lets AI agents extract structured, schema-validated data from any document, including PDFs, scans, images, spreadsheets, and forms,

автор: talonicdev

GitHub

Описание

Official Talonic MCP server. Lets AI agents extract structured, schema-validated data from any document, including PDFs, scans, images, spreadsheets, and forms, directly inside the chat.

README

Official Talonic MCP server. Lets AI agents extract structured, schema-validated data from any document via the Model Context Protocol.

Status: Listed on the official MCP Registry as io.github.talonicdev/talonic-mcp. Seven tools and one resource live: talonic_extract, talonic_search, talonic_filter, talonic_get_document, talonic_to_markdown, talonic_list_schemas, talonic_save_schema, plus the talonic://schemas resource. Verified end-to-end against production.

Why an agent should use this

When an agent needs to pull structured data out of a PDF, scan, image, or messy document, the usual approach is raw OCR plus an LLM call. Results are unreliable; tables get mangled, dates get misread, totals drift.

With this MCP server installed, the agent has a talonic_extract tool that returns schema-validated JSON with per-field confidence scores, a detected document type, and stable IDs for follow-up calls. Six other tools cover the rest of the workflow: searching the workspace, filtering by extracted field values, fetching a document's metadata, getting OCR markdown, listing saved schemas, and saving new ones.

Get an API key (30 seconds)

Each user runs against their own isolated Talonic workspace. Your documents and schemas are private to you.

Sign up at https://app.talonic.com. Free tier: 50 extractions per day, no credit card.
Settings → API Keys → Create New Key.
Copy the tlnc_ value into your MCP client config (snippets below).

Install

The package is on npm. Every MCP client launches it the same way: a one-line npx invocation with your API key in the env block. No clone, no build.

{
  "command": "npx",
  "args": ["-y", "@talonic/mcp@latest"],
  "env": { "TALONIC_API_KEY": "tlnc_..." }
}

The -y flag skips the npm install prompt.

Version pinning. @latest is fine for trying things out and for personal use. For production deployments and CI, pin to a specific version (e.g. @talonic/[email protected]) so a future release cannot silently change tool descriptions, validation rules, or the response shape your agent depends on. Bump the pin manually after reviewing the CHANGELOG.

Talonic uses a single API key per workspace. The same key authorises all tools. There is no scoping mechanism in v0.1; treat the key like any other secret and store it in your client's secret store rather than in version control.

Per-client snippets are below.

MCP client setup

Claude Desktop

Edit ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or %APPDATA%\Claude\claude_desktop_config.json (Windows):

{
  "mcpServers": {
    "talonic": {
      "command": "npx",
      "args": ["-y", "@talonic/mcp@latest"],
      "env": {
        "TALONIC_API_KEY": "tlnc_your_key_here"
      }
    }
  }
}

Fully restart Claude Desktop (Cmd+Q on macOS, not just close the window). Talonic appears in the connected servers list with all seven tools.

Cursor

Edit ~/.cursor/mcp.json (or open Cursor settings → MCP → edit config):

{
  "mcpServers": {
    "talonic": {
      "command": "npx",
      "args": ["-y", "@talonic/mcp@latest"],
      "env": {
        "TALONIC_API_KEY": "tlnc_your_key_here"
      }
    }
  }
}

Cline (VS Code extension)

Open the Cline panel → settings (gear icon) → MCP Servers → Edit. Add the entry above. Save and restart the panel.

Continue (VS Code / JetBrains)

Edit ~/.continue/config.json. Add to the mcpServers array:

{
  "name": "talonic",
  "command": "npx",
  "args": ["-y", "@talonic/mcp@latest"],
  "env": {
    "TALONIC_API_KEY": "tlnc_your_key_here"
  }
}

Cowork

Open Cowork settings → MCP Servers → Add. Use the same shape as Claude Desktop above.

Tool reference

Each tool's description is written for an LLM, with explicit USE WHEN / DO NOT USE WHEN sections. Agents pick the right tool reliably without further prompting. Briefly:

talonic_extract, status: stable. Extract structured, schema-validated data from a document. Inputs: one of file_data + filename (recommended for chat clients, see below), file_path, file_url, or document_id, plus a schema or schema_id. Returns JSON with data, per-field confidence, and document metadata. Schema is required; the MCP layer rejects schema-less calls.
talonic_search, status: stable. Omnisearch across documents, fields, sources, and schemas in the workspace. Use for conceptual or fuzzy queries.
talonic_filter, status: stable. Filter documents by extracted field values using composable conditions (eq, gt, between, contains, is_empty, etc.). Accepts canonical field names (e.g. vendor.name) which the Talonic API resolves to ids server-side, or UUIDs directly. is_not_empty is intentionally not exposed in v0.1; see Known limitations.
talonic_get_document, status: stable. Fetch full metadata for a single document by id, including processing log and link URLs.
talonic_to_markdown, status: stable. Get OCR-converted markdown for a document. Accepts document_id (cheapest), file_data + filename, file_path, or file_url.
talonic_list_schemas, status: stable. List all saved schemas with their definitions. Returns both UUID and SCH-XXXXXXXX short id; either is accepted by talonic_extract.
talonic_save_schema, status: stable. Save a schema definition to the workspace for reuse.

Two resources are also exposed for clients that browse them separately (Claude Desktop and Cowork render these in the UI):

talonic://schemas: saved-schemas list.
talonic://webhooks/reference: webhook event types, delivery behavior, signature verification algorithms, and retry policies. Use this when an agent needs to set up or troubleshoot a webhook integration without leaving the MCP context.

Agent decision guide

Pick the right tool before you call. The wrong tool returns the wrong data, costs unnecessary credits, and slows the conversation.

User has a file (or just dropped one in)

They want specific fields (vendor, total, dates, parties): talonic_extract with schema or schema_id.
They want full text content for summarisation, translation, or analysis: talonic_to_markdown.
They want both: talonic_extract with include_markdown: true, one upload.

User is asking about existing documents

Conceptual or fuzzy ("any docs about indemnification"): talonic_search.
Value-based on extracted fields ("invoices over 1000 EUR"): talonic_filter.
They reference a document_id: talonic_get_document for metadata, talonic_to_markdown for text, talonic_extract to re-extract with a new schema. Re-using a document_id is cheaper than re-uploading.

User is working with schemas

One-off extraction: pass schema inline.
Same schema across many documents: talonic_save_schema once, then talonic_extract with schema_id.
Discover existing schemas first: talonic_list_schemas.
Iterate inline before saving. Avoid clutter.

Confidence and human review

confidence.overall below ~0.7: tell the user the extraction may be unreliable, surface low-confidence fields, confirm before any downstream action.
Per-field confidence below ~0.7: mark as "needs review", do not use silently in calculations or external API calls.
Critical fields (amounts, legal terms, names, dates): confirm with user before acting, even at high confidence.
Per-field provenance (page, bbox) is not surfaced in v0.1. Send users to links.dashboard to verify against the source.

When not to call Talonic

General-knowledge or chat questions. Do not pre-emptively extract.
The data is already in conversation history from a previous tool call. Re-use it.
The user wants to discuss or revise an extraction you already produced. Reason over the previous result instead of re-extracting.
Cost, EUR price, and remaining balance are not surfaced in v0.1 tool responses. If the user asks about cost or credit balance, point them to https://app.talonic.com.

Drag-and-drop in chat clients

When the user drag-drops a PDF (or any supported file) into a chat-style MCP host such as Claude Desktop, Cowork, or Cursor, the file lands in a host-owned sandbox directory the MCP server cannot read. The path the host then hands the agent (something like /mnt/user-data/uploads/abc.pdf) is meaningless to a separately-running npx MCP process, so file_path calls fail with a filesystem error.

@talonic/[email protected] and later solve this by accepting file_data (base64-encoded file bytes) and filename on talonic_extract and talonic_to_markdown. The agent reads the file bytes from the conversation, base64-encodes them, and passes them through the MCP tool call. The MCP server decodes, infers MIME type from the filename, and uploads to the Talonic API as a normal multipart request. The file never has to live on the MCP server's disk.

Tool descriptions advertise file_data as the recommended input for chat-style clients, so well-trained agents will reach for it automatically. No client-side configuration is required beyond installing @talonic/mcp@latest (or @0.1.4 or newer) as documented above.

How it works

Agent (Claude Desktop / Cursor / Cline / etc.)
  ↓ MCP protocol over stdio
Talonic MCP server (this package)
  ↓ HTTPS, Bearer auth
api.talonic.com

Each tool call is one HTTP request to the Talonic API, using your API key. The server handles auth, retries on transient failures (429, 5xx), MIME-type detection on file uploads, multipart serialisation, and structured error formatting.

Configuration

Set via the env block in your MCP client config:

Variable	Required	Description
`TALONIC_API_KEY`	yes	Your Talonic API key. Starts with `tlnc_`.
`TALONIC_BASE_URL`	no	Override the API base URL. Default: `https://api.talonic.com`.

Troubleshooting

Tool calls return Error: TALONIC_API_KEY environment variable is required. The env block in your MCP client config is missing or not being read. Double-check the JSON shape. After editing the config, fully restart the client (not just the conversation).

Talonic does not appear in the connected servers list. Make sure the command is npx and the args are exactly ["-y", "@talonic/mcp@latest"]. As a sanity check, in any terminal run npx -y @talonic/mcp@latest --version; it should print a version number. If you are on an older 0.1.x and see no output at all, you are hitting the silent-bin bug fixed in 0.1.3; upgrade by setting the args to ["-y", "@talonic/mcp@latest"] and restarting the client.

talonic_extract returns 500 INTERNAL_ERROR with auto-discovery. Known limitation. Always provide either an inline schema or a schema_id. The platform team is stabilising the auto-discovery code path.

talonic_extract rejects with unsupported_file_type. The MIME type was inferred as application/octet-stream. The SDK now infers from common file extensions; if your filename has an unusual extension, pass content_type explicitly to the SDK call (the MCP layer does not yet expose this; a future tool version will).

talonic_filter errors with VALIDATION_ERROR / "No field matches name". You passed a field value the API does not know. Field names must be canonical names from the field registry (e.g. vendor.name, policy.0_term_end). Call talonic_search first; the canonical names appear in fields[].canonicalName of the response. You can also pass a UUID via field_id if you have one.

talonic_filter returns no results when you expect data. The is_not_empty operator currently underreports. Use a more specific operator (eq, gt, gte, lt, lte, between, contains) against a known value when possible.

Tool descriptions look wrong in my client. Some MCP clients cache tool descriptions. Restart the client after a server update.

Known limitations (v0.1)

Auto-discovery extract (no schema) is not reliable on production. Always pass a schema or schema_id to talonic_extract.
Schema definition: prefer full JSON Schema for now. The flat key-type map ({ vendor_name: "string", ... }) is documented as supported and the API's own error message lists it as accepted, but as of writing the server-side normaliser does not actually translate it. If a flat-map save returns a "no fields" error, fall back to:
```
{
  "type": "object",
  "properties": {
    "vendor_name": { "type": "string", "title": "Vendor Name" },
    "total_amount": { "type": "number", "title": "Total Amount" }
  },
  "required": ["vendor_name", "total_amount"]
}
```
is_not_empty filter currently underreports. A condition with operator: "is_not_empty" may return zero documents even when fields are populated. Use specific operators (eq, gt, gte, lt, lte, between, contains) against known values instead.
schema_id on talonic_extract requires the UUID form. Other endpoints accept either UUID or SCH-XXXXXXXX, but /v1/extract currently only accepts UUIDs there. Pass the UUID from talonic_list_schemas.

Upgrading from 0.1.0 / 0.1.1 / 0.1.2

Versions before 0.1.3 had a bug where the bundled MCP server bin would exit silently when launched via the npm bin symlink, which is exactly how every MCP client invokes it via npx. If your client config is on an older version and you see no talonic_* tools surface despite the config looking correct, you are hitting that bug.

The fix is to point your args at @latest (or @0.1.3 explicitly) and fully restart the client:

"args": ["-y", "@talonic/mcp@latest"]

Develop

git clone https://github.com/talonicdev/talonic-mcp.git
cd talonic-mcp
npm install
npm run build
npm test
node dist/server.js --version

License

MIT (c) Talonic GmbH

Как установить

Добавь это в claude_desktop_config.json и перезапусти Claude Desktop.

{
  "mcpServers": {
    "talonic-mcp": {
      "command": "npx",
      "args": []
    }
  }
}

Talonic

Описание

README

Why an agent should use this

Get an API key (30 seconds)

Install

MCP client setup

Claude Desktop

Cursor

Cline (VS Code extension)

Continue (VS Code / JetBrains)

Cowork

Tool reference

Agent decision guide

Drag-and-drop in chat clients

How it works

Configuration

Troubleshooting

Known limitations (v0.1)

Upgrading from 0.1.0 / 0.1.1 / 0.1.2

Develop

License

Как установить

Похожие MCP

Gmail

Slack

Runbear

Discord Server

Command Palette

Talonic

Описание

README

Why an agent should use this

Get an API key (30 seconds)

Install

MCP client setup

Claude Desktop

Cursor

Cline (VS Code extension)

Continue (VS Code / JetBrains)

Cowork

Tool reference

Agent decision guide

Drag-and-drop in chat clients

How it works

Configuration

Troubleshooting

Known limitations (v0.1)

Upgrading from 0.1.0 / 0.1.1 / 0.1.2

Develop

License

Как установить

Похожие MCP

Gmail

Slack

Runbear

Discord Server