loading…
Search for a command to run...
loading…
A self-improving knowledge agent that provides local retrieval, web research, and structured answer synthesis for Claude Code and VS Code Copilot via MCP. It en
A self-improving knowledge agent that provides local retrieval, web research, and structured answer synthesis for Claude Code and VS Code Copilot via MCP. It enables AI tools to manage a project-specific knowledge lifecycle through automated research and indexing.
AI that gets smarter in YOUR domain — every question compounds.
Scholar Agent is part of the MindPulse Academic Suite, forming a powerful synergy between local open-source tools and fully-managed cloud services:
| Feature | Scholar Agent (Local) | PaperPulse (Cloud SaaS) |
|---|---|---|
| Hosting & Mode | Local MCP Server (Open Source) | Fully-Managed SaaS (Closed Source) |
| Core Workflow | On-demand research query & knowledge synthesis | Automated daily crawling, scoring & email/WeChat push |
| Storage | Local Markdown Files / Vector DB | Cloud Postgres / Managed Index |
| IDE Integration | Deeply integrated with Claude Code, VS Code, Cursor | Web-based Dashboard & Chatbot |
| Pricing | Free & Open Source | Free Tier / Premium Subscriptions |
💡 Synergy (One-Click Local Sync):
knowledge/ directory via a secure local loopback interface, bypass browser sandbox constraints and rebuild your search index automatically!Every AI conversation generates knowledge — research findings, technical explanations, citations. But LLMs are stateless: each new session starts from zero. The research your AI completed yesterday is not available today.
Scholar Agent makes AI knowledge persistent. It saves research and answers as local knowledge cards — structured, citable, and interconnected. Before answering, the AI checks existing local knowledge first, building on what it has already learned rather than starting from scratch each time.
The result is a personal LLM-Wiki: structured, traceable, continuously growing — making your AI increasingly accurate in the domains you care about.
Ask → Research → Save as knowledge card → Knowledge compounds over time
When you ask a question, the agent routes the query through a local-first retrieval loop before falling back to external sources:
sequenceDiagram
actor User
participant Host as Claude Code / VS Code
participant MCP as Scholar Agent (MCP Server)
participant Local as Local Index (BM25)
participant Web as arXiv / Semantic Scholar
User->>Host: Prompt: "Explain MoE"
Host->>MCP: query_knowledge("MoE")
MCP->>Local: BM25 Query
alt Local Hit (BM25 Score >= Threshold)
Local-->>MCP: Match (e.g. mixture-of-experts.md)
MCP-->>Host: Local Note Context
else Local Miss
MCP->>Web: API Fallback (arxiv + web search)
Web-->>MCP: Raw Papers & Metadata
MCP->>MCP: Synthesize & Distill
MCP->>Local: Save Card (Staging -> Validate -> Promote)
MCP-->>Host: Synthesized Answer + Citations
end
Host->>User: Natural Language Response
Each conversation can produce a knowledge card — a structured record with:
These cards accumulate into a searchable local knowledge base. Next time a similar question comes up, the AI draws from what's already been researched.
Cards aren't isolated files. Scholar Agent:
draft → reviewed → trusted → stale → deprecated[[wiki-links]] between related cards~/scholar/) directly as an Obsidian Vault to navigate your visual knowledge graph.When researching a question, Scholar Agent:
For paper research, Scholar Agent provides:
pip install py-scholar-agent
Or with pipx (isolated environment):
pipx install py-scholar-agent
Or from source:
git clone https://github.com/zfy465914233/scholar-agent.git
cd scholar-agent
pip install -e .
scholar-agent init
One command creates data directories, writes config, and registers MCP with Claude Code. Done.
| Mode | Command | Data Location | Scope |
|---|---|---|---|
| Global (recommended) | scholar-agent init |
~/scholar/ |
Every project |
| Project-Local | SCHOLAR_HOME=./scholar scholar-agent init |
my-project/scholar/ |
Current project only |
| Docker | docker run -v ~/scholar:/data scholar-agent serve-mcp |
Container volume | Isolated |
Scholar Agent runs as an MCP server, integrating directly into your tools:
scholar-agent install claude --writescholar-agent install vscode --writescholar-agent install opencode --writeCore tools (always available): query_knowledge · save_research · list_knowledge · capture_answer · ingest_source · build_graph
Academic tools (set SCHOLAR_ACADEMIC=1): search_papers · search_conf_papers · download_paper · analyze_paper · extract_paper_images · paper_to_card · daily_recommend · link_paper_keywords
Add this to your claude_desktop_config.json:
{
"mcpServers": {
"scholar-agent": {
"command": "scholar-agent",
"args": ["serve-mcp"],
"env": {
"SCHOLAR_ACADEMIC": "1"
}
}
}
}
Knowledge is indexed with BM25 for fast keyword search — no external dependencies required. An optional embedding layer can be enabled for semantic similarity with scholar-agent index --build-embedding-index.
| Command | Description |
|---|---|
scholar-agent init |
One-command setup: data dirs + config + MCP registration |
scholar-agent serve-mcp |
Start the MCP server |
scholar-agent doctor |
Show environment and config diagnostics |
scholar-agent config show |
Show resolved configuration |
scholar-agent install claude --write |
Register MCP with Claude Code |
scholar-agent install vscode --write |
Register MCP with VS Code Copilot |
scholar-agent install opencode --write |
Register MCP with OpenCode |
| Variable | Required | Description |
|---|---|---|
SCHOLAR_ACADEMIC |
No | Set to 1 to enable academic tools |
SCHOLAR_HOME |
No | Override data directory (default: ~/scholar/) |
S2_API_KEY |
No | Semantic Scholar API key (get one free) |
LLM_API_KEY |
No | LLM API key for advanced synthesis pipeline |
See .scholar.example.json for a full example. Key sections:
knowledge_dir — Knowledge cards directoryindex_path — BM25 search indexacademic.research_interests — Your domains, keywords, arXiv categoriesacademic.scoring — Paper scoring weightsscholar/
├── config/ # Configuration files
├── knowledge/ # Knowledge cards
├── paper-notes/ # Paper analysis notes
├── daily-notes/ # Daily paper recommendations
├── indexes/ # BM25 search index
├── cache/ # Cached data
└── outputs/ # Generated outputs
Ask a question (via MCP)
→ Scholar Agent searches local knowledge first
→ Falls back to web/academic APIs when needed
→ Synthesizes answer with citations
→ Saves as a knowledge card
→ Next similar question draws from local knowledge
For best paper analysis quality:
download_paper("2510.24701", title="Paper Title", domain="LLM")extract_paper_images("2510.24701")analyze_paper(paper_json)paper_to_card(paper_json)Downloading the PDF first enables full-text extraction, producing notes with specific data, formulas, and experimental results.
make dev # Install with dev dependencies + pre-commit hooks
make lint # Run ruff + mypy
make test # Run test suite (1121 tests, ~20s, fully offline)
make coverage # Run tests with coverage report
make build # Build distribution package
make docker # Build Docker image
See CONTRIBUTING.md for detailed guidelines.
[[wiki-links]]. Your data, no lock-inWondering how Scholar Agent compares to mem0, MemGPT, or Zep? See docs/comparison.md for a detailed breakdown.
MIT — see LICENSE.
Выполни в терминале:
claude mcp add lore-agent -- npx CSA PROJECT - FZCO © 2026 IFZA Business Park, DDP, Premises Number 31174 - 001
Безопасность
Низкий рискАвтоматическая эвристика по публичным данным — не гарантия безопасности.