loading…
Search for a command to run...
loading…
Provides free and unlimited vision capabilities for AI coding assistants using the Groq API and Meta Llama 4 Vision model. It enables users to perform image ana
Provides free and unlimited vision capabilities for AI coding assistants using the Groq API and Meta Llama 4 Vision model. It enables users to perform image analysis, OCR, UI layout description, and error diagnosis directly from screenshots and documents.
Free, unlimited vision capabilities for your AI coding assistant using Groq API and Meta Llama 4 Vision model.
cd vision-mcp-server
# Option 1: Using install script (recommended)
./install.sh
# Option 2: Manual installation
pip3 install mcp groq pillow aiofiles
Add to ~/.claude/config.json:
{
"mcpServers": {
"vision-mcp-server": {
"command": "python",
"args": ["-m", "vision_mcp_server.server"],
"env": {
"GROQ_API_KEY": "your-groq-api-key-here"
}
}
}
}
Add to OpenCode settings:
{
"$schema": "https://opencode.ai/config.json",
"mcp": {
"vision-mcp-server": {
"type": "local",
"command": ["python", "-m", "vision_mcp_server.server"],
"environment": {
"GROQ_API_KEY": "your-groq-api-key-here"
}
}
}
}
Add to Cline settings:
{
"mcpServers": {
"vision-mcp-server": {
"command": "python",
"args": ["-m", "vision_mcp_server.server"],
"env": {
"GROQ_API_KEY": "your-groq-api-key-here"
}
}
}
}
Describe this image: screenshot.png
Extract text from this document: scan.jpg
What's wrong with this error screenshot: error.png
Explain this architecture diagram: system-diagram.png
Compare these two UI screenshots: old-ui.png vs new-ui.png
analyze_image - General image analysisextract_text - OCR text extractiondescribe_ui - UI component analysisdiagnose_error - Error screenshot analysisunderstand_diagram - Diagram interpretationanalyze_chart - Chart and dashboard analysiscompare_images - Image comparisoncode_from_screenshot - Code extraction from screenshotsRun locally:
export GROQ_API_KEY=your-api-key
python -m vision_mcp_server.server
MIT
Выполни в терминале:
claude mcp add vision-mcp-server -- npx Extract design specs and assets
автор: FigmaEnables AI agents to read, write, and edit Office documents via LibreOffice with token-efficient design. Supports multiple formats including DOCX, XLSX, PPTX, a
автор: passerbyflutterSearch and retrieve company logos by brand or domain. Customize size, format, and theme to match your design needs. Accelerate design, prototyping, and content
автор: NOVA-3951Enables GUI automation for controlling PIX4Dmatic on Windows through MCP. Supports launching, focusing, capturing screenshots, sending hotkeys, clicking UI elem
автор: jangjo123Не уверен что выбрать?
Найди свой стек за 60 секунд
Автор?
Embed-бейдж для README
Похожее
Все в категории design