loading…
Search for a command to run...
loading…
Transforms YouTube videos into LLM-ready knowledge bases through transcription, semantic chunking, and vector embedding services. It provides 12 specialized MCP
Transforms YouTube videos into LLM-ready knowledge bases through transcription, semantic chunking, and vector embedding services. It provides 12 specialized MCP tools for video processing, semantic search, and SEO intelligence analysis.

Python 3.8+ License: MIT MCP Compatible Code style: black
Transform YouTube videos into LLM-ready knowledge bases with a production-ready MCP backend.
# Install
git clone https://github.com/leolech14/ytpipe.git
cd ytpipe
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
# Process a video
ytpipe "https://youtube.com/watch?v=dQw4w9WgXcQ"
Result: Metadata + Transcript + Semantic Chunks + Embeddings + Vector Storage
python -m ytpipe.mcp.server
Then from Claude Code:
"Process this video: https://youtube.com/watch?v=VIDEO_ID"
"Search video dQw4w9WgXcQ for 'machine learning'"
"Optimize SEO for video dQw4w9WgXcQ"
# Basic
ytpipe "https://youtube.com/watch?v=VIDEO_ID"
# Advanced
ytpipe URL --backend faiss --whisper-model large --verbose
from ytpipe.core.pipeline import Pipeline
pipeline = Pipeline(output_dir="./output")
result = await pipeline.process(url)
print(f"✅ {result.metadata.title}")
print(f" Chunks: {len(result.chunks)}")
print(f" Time: {result.processing_time:.1f}s")
ytpipe_process_video - Full pipelineytpipe_download - Download onlyytpipe_transcribe - Transcribe audioytpipe_embed - Generate embeddingsytpipe_search - Full-text searchytpipe_find_similar - Semantic searchytpipe_get_chunk - Get chunk by IDytpipe_get_metadata - Get video infoytpipe_seo_optimize - SEO recommendationsytpipe_quality_report - Quality metricsytpipe_topic_timeline - Topic evolutionytpipe_benchmark - Performance analysisMCP Server (12 tools) → Pipeline Orchestrator → 11 Services → Pydantic Models
Services:
8 Processing Phases:
| Metric | Value |
|---|---|
| Processing Speed | 4-13x real-time |
| Memory Usage | <2GB peak |
| Chunk Quality | 85%+ high quality |
| Embedding Dimension | 384 |
Contributions welcome! Please read CONTRIBUTING.md first.
MIT License - see LICENSE for details.
Built with:
Leonardo Lech
⭐ Star this repo if you find it useful!
Transform YouTube → Knowledge Base in seconds
Добавь это в claude_desktop_config.json и перезапусти Claude Desktop.
{
"mcpServers": {
"ytpipe": {
"command": "npx",
"args": []
}
}
}