loading…
Search for a command to run...
loading…
Enables natural language data queries with statistical rigor and a semantic layer, supporting 23+ analytical tools for product analytics.
Enables natural language data queries with statistical rigor and a semantic layer, supporting 23+ analytical tools for product analytics.
📦 Public Archive - November 2025
This project is being archived as a complete reference implementation of a local-first AI-powered analytics assistant. It demonstrates best practices for MCP integration, dbt semantic bridging, and extending data pipeline infrastructure. It was built to showcase duckDB, dlt, dbt, and Ibis (Boring Semantic Layer).
AI-powered data analyst with semantic layer, statistical rigor, and natural language insights
License: MIT Python 3.10+ Tests MCP Compatible
Status: v1.0 Production Ready | Works with Claude Desktop & ChatGPT Desktop | Quick Start →
# Ask Claude Desktop or ChatGPT Desktop:
"How many users do we have?"
"Compare engagement by plan type"
"Run a comprehensive conversion analysis"
How it works: Claude/ChatGPT Desktop → MCP Protocol → AI Analyst → Semantic Layer → Your Data
# 1. Install UV package manager
curl -LsSf https://astral.sh/uv/install.sh | sh
# 2. Clone and setup
git clone https://github.com/sbdk-dev/claude-analyst.git
cd claude-analyst
./scripts/setup.sh
# 3. Test the system
cd semantic-layer
uv run python test_all_functionality.py
# Expected: Tests Passed: 7 | Success Rate: 100.0%
# 1. Add to ~/Library/Application Support/Claude/claude_desktop_config.json:
{
"mcpServers": {
"ai-analyst": {
"command": "/opt/homebrew/bin/uv",
"args": ["run", "python", "run_mcp_server.py"],
"cwd": "/FULL/PATH/TO/claude-analyst/semantic-layer"
}
}
}
# 2. Find your uv path
which uv # Use this full path in config above
# 3. Restart Claude Desktop
# 4. Ask: "List available data models"
# 1. Set your OpenAI API key
export OPENAI_API_KEY="sk-your-key-here"
# 2. Start the OpenAI API server
cd semantic-layer
uv run python run_openai_server.py
# Server starts on http://localhost:8000
# 3. Configure ChatGPT Desktop
# Settings → Beta Features → Actions → Add Custom Action
# URL: http://localhost:8000
# Auth: Bearer token (optional)
# 4. Start chatting!
Detailed guides: QUICK_START.md | Claude Desktop Setup
Pattern: Build → Execute → Annotate
Every answer is based on REAL query results, not AI guesses.
Query: "What's our user breakdown by plan?"
→ Generates SQL from semantic model
→ Executes against real database
→ Interprets actual results
→ "700 Free (70%), 250 Pro (25%), 50 Enterprise (5%)"
Automatic significance testing when comparing groups:
Query: "Is the difference in engagement statistically significant?"
→ Auto-runs appropriate test (chi-square, t-test)
→ Reports p-values, effect sizes, confidence intervals
→ "Pro users 2.3x higher DAU (p<0.001, n=250 vs n=700)"
Built-in analytical workflows for comprehensive analysis:
The system includes sample product analytics data:
Users (1,000 users)
Events (34,000+ events)
Engagement
"How many users do we have?"
"What's our conversion rate from free to paid?"
"Show me the top 5 features by usage"
"What's our DAU trend this month?"
"Compare engagement between Pro and Free users"
"Is the difference in conversion rate statistically significant?"
"How does feature adoption vary by industry?"
"Run a comprehensive conversion analysis"
"Analyze feature usage patterns across user segments"
"What optimization opportunities do you see in our data?"
Traditional AI analysis can make up results when writing observations before executing queries.
┌─────────────────────────────────┐
│ Claude Desktop / ChatGPT │
└────────────┬────────────────────┘
│ MCP Protocol
┌────────────▼────────────────────┐
│ AI Analyst MCP Server │
│ ┌──────────────────────────┐ │
│ │ Multi-Query Workflows │ │
│ │ Query Optimization │ │
│ │ Conversation Memory │ │
│ │ Statistical Testing │ │
│ └───────────┬──────────────┘ │
│ │ │
│ ┌───────────▼──────────────┐ │
│ │ Semantic Layer │ │
│ │ (Business Metrics) │ │
│ └───────────┬──────────────┘ │
└──────────────┼──────────────────┘
│
┌─────▼──────┐
│ DuckDB │
└────────────┘
Key Components:
MCP Server: FastMCP - Production-grade MCP framework Semantic Layer: Boring Semantic Layer - Business metrics abstraction Query Engine: Ibis - Portable dataframe abstraction Database: DuckDB - Analytical database (prototype) Statistical Testing: scipy - Significance testing and effect sizes
After setup, confirm everything works:
cd semantic-layer
# Test semantic layer
uv run python -c "
from mcp_server.semantic_layer_integration import SemanticLayerManager
import asyncio
async def test():
manager = SemanticLayerManager()
await manager.initialize()
models = await manager.get_available_models()
print(f'SUCCESS: {len(models)} models loaded')
print(f'Available: {[m[\"name\"] for m in models]}')
asyncio.run(test())
"
Expected output:
SUCCESS: 3 models loaded
Available: ['users', 'events', 'engagement']
If you see this, restart Claude Desktop and try: "List available data models"
Current Version: 1.0 Production Ready Release Date: November 2025 Test Pass Rate: 100% (7/7 tests) Completion: All v1.0 features implemented and validated Performance: 95% cache hit rate, <100ms query response
Issues: Found a bug or have a feature request? Open an issue
Contributing: Pull requests welcome! Please follow the existing code style.
MIT License - See LICENSE for details
Inspiration & Research:
Technology:
Last Updated: November 2025 Version: 1.0 Author: Matt Strautmann Status: v1.0 Production Ready | 100% Test Pass Rate
Выполни в терминале:
claude mcp add ai-analyst -- npx Web content fetching and conversion for efficient LLM usage.
Retrieval from AWS Knowledge Base using Bedrock Agent Runtime.
автор: modelcontextprotocolProvides auto-configuration for setting up an MCP server in Spring Boot applications.
A very streamlined mcp client that supports calling and monitoring stdio/sse/streamableHttp, and can also view request responses through the /logs page. It also
автор: xuzexin-hzНе уверен что выбрать?
Найди свой стек за 60 секунд
Автор?
Embed-бейдж для README
Похожее
Все в категории ai