loading…
Search for a command to run...
loading…
Provides high-fidelity OCR for scientific PDFs using Meta's Nougat, with output format control and settings for agent workflows.
Provides high-fidelity OCR for scientific PDFs using Meta's Nougat, with output format control and settings for agent workflows.
PyPI version Python versions License: GPL v3 MCP Protocol
nougat-mcp is a Model Context Protocol (MCP) server for high-fidelity OCR of scientific PDFs using Meta's Nougat.
It is designed for agent workflows where you need equations, tables, and structure preserved better than traditional OCR.
mmd: raw Nougat/Mathpix-style output.md: renderer-friendly conversion (math delimiter and KaTeX compatibility fixes).Install from PyPI:
uv pip install nougat-mcp
This package installs nougat-ocr and pins known-sensitive dependencies for stability.
parse_research_paperArguments:
file_path (string): Absolute path to a local PDF.output_format (string, optional):default (default): uses server settings.mmd: raw Nougat output.md: converted markdown-friendly output.Returns:
get_output_settingsReturns resolved server output settings, including where settings were loaded from.
mmd -> md)When output_format="md", the server applies compatibility conversions:
\[ ... \] -> $$ ... $$\( ... \) -> $ ... $\tag{...} -> visible equation label \qquad\text{(...)}\bigl{\|} ... \bigr{\|} -> \bigl\| ... \bigr\|This avoids common renderer parse errors in markdown environments that are not fully MathJax-compatible.
Settings are read in this order:
NOUGAT_MCP_SETTINGS (if set)./settings.json (current working directory)Example settings.json:
{
"nougat_mcp": {
"default_output_format": "md",
"md_rewrite_tags": true,
"md_fix_sized_delimiters": true
}
}
Add to ~/.codex/config.toml:
[mcp_servers.nougat]
command = "uvx"
args = ["nougat-mcp"]
enabled = true
[mcp_servers.nougat.env]
NOUGAT_MCP_SETTINGS = "/absolute/path/to/settings.json"
Add to claude_desktop_config.json:
{
"mcpServers": {
"nougat": {
"command": "uvx",
"args": ["nougat-mcp"],
"env": {
"NOUGAT_MCP_SETTINGS": "/absolute/path/to/settings.json"
}
}
}
}
Add to ~/.gemini/settings.json:
{
"mcpServers": {
"nougat": {
"type": "stdio",
"command": "uvx",
"args": ["nougat-mcp"],
"env": {
"NOUGAT_MCP_SETTINGS": "/absolute/path/to/settings.json"
}
}
}
}
In Cursor MCP settings, add:
{
"mcpServers": {
"nougat": {
"command": "uvx",
"args": ["nougat-mcp"],
"env": {
"NOUGAT_MCP_SETTINGS": "/absolute/path/to/settings.json"
}
}
}
}
Note: Cursor MCP config location can vary by version/platform; use the MCP settings UI or your current JSON settings file.
A real extraction from page 5 of src/2405.08770v1.pdf is included:
mmd output: showcase/2405.08770v1_page5.mmdmd output: showcase/2405.08770v1_page5.mdQuick comparison:
# mmd
\[DV=V_{x}. \tag{3.2}\]
# md
$$
DV=V_{x}. \qquad\text{(3.2)}
$$
To keep Nougat stable across environments, the package pins sensitive dependency ranges:
transformers>=4.35,<4.38albumentations>=1.3,<1.4pypdfium2<5.0huggingface-hub<1.0fsspec<=2025.10.0GNU General Public License v3.0 (LICENSE).
Выполни в терминале:
claude mcp add nougat-mcp -- npx Web content fetching and conversion for efficient LLM usage.
Retrieval from AWS Knowledge Base using Bedrock Agent Runtime.
автор: modelcontextprotocolProvides auto-configuration for setting up an MCP server in Spring Boot applications.
A very streamlined mcp client that supports calling and monitoring stdio/sse/streamableHttp, and can also view request responses through the /logs page. It also
автор: xuzexin-hzНе уверен что выбрать?
Найди свой стек за 60 секунд
Автор?
Embed-бейдж для README
Похожее
Все в категории ai