loading…
Search for a command to run...
loading…
An MCP server that provides agentic vision capabilities for visual regression testing by capturing and comparing screenshots using Gemini Flash. It enables user
An MCP server that provides agentic vision capabilities for visual regression testing by capturing and comparing screenshots using Gemini Flash. It enables users to detect UI changes and conduct conversational investigations to distinguish between intended and unintended visual modifications.
A Model Context Protocol (MCP) server that brings agentic vision capabilities to Claude Code for visual regression testing using Gemini 3 Flash.
Never again have ambiguous conversations about visual changes. See exactly what changed, circled and annotated, with intended vs unintended change detection.
Where's Waldo Rick provides:
# Install via uvx
uvx --from git+https://github.com/bretbouchard/gemini-vision-mcp wheres_waldo.server
# Or install locally
pip install -e .
Add to your Claude Code MCP configuration (~/.claude/mcp.json or project-specific):
{
"mcpServers": {
"wheres-waldo-rick": {
"command": "uvx",
"args": ["--from", "git+https://github.com/bretbouchard/gemini-vision-mcp", "wheres_waldo.server"],
"env": {
"GEMINI_API_KEY": "your-api-key-here"
}
}
}
}
# 1. Declare expected changes before work
/visual:prepare "Card padding increases by 2px, button moves to right"
# 2. Capture baseline screenshot
/visual:capture "Phase 3 - Before card update"
# 3. Development happens...
# 4. Capture current state
/visual:capture "Phase 4 - After card update"
# 5. Compare and see all changes
/visual:compare screenshots/phases/3-before.png screenshots/phases/4-after.png
visual_captureCapture a screenshot and store it for visual regression testing.
await visual_capture(
name="Phase 3 - Before card update",
platform="macos" # auto, macos, ios, web
)
visual_prepareDeclare a baseline with expected changes before development.
await visual_prepare(
phase="Phase 3 - Card Layout Update",
expected_changes="Card padding increases by 2px, button moves to right"
)
visual_compareCompare two screenshots with pixel-level precision and agentic vision.
await visual_compare(
before_path="screenshots/phases/3-before.png",
after_path="screenshots/phases/4-after.png",
threshold=2 # 1px, 2px, or 3px
)
visual_cleanupClean up old screenshots and cache.
await visual_cleanup(retention_days=7)
# Clone repository
git clone https://github.com/bretbouchard/gemini-vision-mcp
cd gemini-vision-mcp
# Install in development mode
pip install -e ".[dev]"
# Run tests
pytest
# Format code
black src/
ruff check src/
src/wheres_waldo/
├── __init__.py
├── server.py # MCP server with tool definitions
├── models/ # Pydantic domain models
├── services/ # Business logic (capture, compare, storage)
├── tools/ # MCP tool implementations
└── utils/ # Logging, hashing, path helpers
See ROADMAP.md for complete execution plan.
Contributions welcome! Please read REQUIREMENTS.md and ROADMAP.md before contributing.
MIT License - See LICENSE file for details
Built with:
Generated with Claude Code via Happy
Добавь это в claude_desktop_config.json и перезапусти Claude Desktop.
{
"mcpServers": {
"where-s-waldo-rick": {
"command": "npx",
"args": []
}
}
}