loading…
Search for a command to run...
loading…
A read-only Model Context Protocol server that exposes a semantic knowledge base to AI agents via 27 tools. It enables querying of documents and data integrated
A read-only Model Context Protocol server that exposes a semantic knowledge base to AI agents via 27 tools. It enables querying of documents and data integrated from sources like Notion, SharePoint, HubSpot, and Slack.
The data backbone powering Volterra's AI tools -- ingesting documents from 5 sources, generating embeddings, and enforcing GDPR compliance so every other platform component can search company knowledge.
Operational knowledge was scattered across five systems -- Notion wikis, SharePoint drives, HubSpot tickets, Slack threads, and local file shares. Support agents re-answered questions that had been solved months earlier. Product and leadership had no way to search across sources. And with EU-based customers, any AI tool touching this data needed automatic PII detection and GDPR-compliant handling.
| Metric | Detail |
|---|---|
| Ingestion sources | 5 systems unified into one searchable pipeline |
| MCP tools shipped | 27 read-only tools for AI agent access |
| Ticket categorization | 90.6% accuracy over 11,800+ tickets (auto-classifier built on this data) |
| PII handling | Automatic detection and sensitivity classification |
| Sync frequency | Daily automated ingestion via pg_cron + Edge Functions |
Knowledge Engine is the foundation layer. It generates the embeddings that power the Semantic Platform's 5 GPT apps, feeds the website AI chat via n8n, and provides ticket data for Call Intelligence analysis.
graph TB
CLI[CLI Commands] -->|Ingest| DP[Document Processor]
DP -->|Parse| Parsers[Format Parsers]
DP -->|Embed| OAI[OpenAI API]
DP -->|Store| DB[(PostgreSQL + pgvector)]
DP -->|Compliance| PII[PII Detector]
subgraph Sources
FS[Local Files]
NO[Notion API]
SP[SharePoint]
HS[HubSpot]
SL[Slack Export]
end
Sources -->|Fetch| DP
subgraph EF[Edge Functions]
HTS[HubSpot Ticket Sync]
NPS[Notion Pages Sync]
SCS[Slack Channel Sync]
MCP[MCP Server]
end
Cron[pg_cron] -->|Scheduled| EF
EF -->|Read/Write| DB
| Layer | Technology |
|---|---|
| Runtime | Node.js 18+ with TypeScript (ESM) |
| Database | PostgreSQL + pgvector (Supabase) |
| Embeddings | OpenAI text-embedding-3-small (1536d) |
| Parsers | pdfjs-dist, mammoth, xlsx, mailparser |
| Sources | Notion API, Microsoft Graph, HubSpot API, Slack API |
| Compliance | Custom PII detector with redact-pii, franc (language) |
| Scheduling | pg_cron + Supabase Edge Functions |
| CLI | Commander.js with structured logging (Winston) |
src/
├── core/
│ ├── document-processor.ts # Main orchestration (451 lines)
│ ├── embedding-service.ts # OpenAI embedding generation
│ └── metadata-inference.ts # Auto-classification
├── parsers/ # Format-specific text extraction
│ ├── pdf-parser.ts
│ ├── docx-parser.ts
│ ├── xlsx-parser.ts
│ ├── wod-parser.ts # Structured deal data extraction
│ └── ...
├── sources/ # Data source connectors
│ ├── notion-source.ts
│ ├── sharepoint-source.ts
│ ├── hubspot-source.ts
│ └── slack-source.ts
├── compliance/
│ ├── pii-detector.ts # PII pattern detection
│ └── gdpr-handler.ts # Sensitivity classification
├── services/
│ ├── n8n-api-client.ts # n8n REST API client
│ └── vision-service.ts # GPT-4o image analysis
└── scripts/ # CLI entry points
supabase/
├── functions/ # Edge Functions (sync, MCP)
└── migrations/ # PostgreSQL schema migrations
Install dependencies:
npm install
Configure environment:
cp .env.example .env
Set up database (run migrations in Supabase SQL Editor):
CREATE EXTENSION IF NOT EXISTS vector;
-- Then apply migration files in chronological order
Ingest documents:
# Local files
npm run ingest:file ./documents/
# From Notion
npm run ingest:notion
# From HubSpot
npm run ingest:hubspot
# Slack export
npm run ingest:slack -- --export-path /path/to/export
The system automatically detects PII (emails, phone numbers, SSNs, names) and classifies document sensitivity:
| Mode | Behavior |
|---|---|
| Flag | Detects and flags PII, stores original content |
| Redact | Replaces PII with placeholders before storing |
Documents with detected PII are automatically upgraded to restricted or confidential access levels.
Adrian Marten — GitHub
Добавь это в claude_desktop_config.json и перезапусти Claude Desktop.
{
"mcpServers": {
"volterra-knowledge-engine": {
"command": "npx",
"args": []
}
}
}Read, send and search emails from Claude
Send, search and summarize Slack messages
No-code MCP client for team chat platforms, such as Slack, Microsoft Teams, and Discord.
A community discord server dedicated to MCP by [Frank Fiegel](https://github.com/punkpeye)