loading…
Search for a command to run...
loading…
macOS automation MCP server with screen capture, UI interaction, and AI analysis
macOS automation MCP server with screen capture, UI interaction, and AI analysis

npm package License: MIT macOS 15.0+ (Sequoia) Swift 6.2 node >=22 Download macOS Homebrew Ask DeepWiki
Peekaboo brings high-fidelity screen capture, AI analysis, and complete GUI automation to macOS. Version 3 adds native agent flows and multi-screen automation across the CLI and MCP server.
set-value, perform-action).brew install steipete/tap/peekaboo
npx -y @steipete/peekaboo
# Capture full screen at Retina scale and save to Desktop
peekaboo image --mode screen --retina --path ~/Desktop/screen.png
# Click a button by label (captures, resolves, and clicks in one go)
peekaboo see --app Safari --json | jq -r '.data.snapshot_id' | read SNAPSHOT
peekaboo click --on "Reload this page" --snapshot "$SNAPSHOT"
# Directly set a text field value when the accessibility value is settable
peekaboo set-value --on T1 --value "hello" --snapshot "$SNAPSHOT"
# Invoke a named accessibility action on an element
peekaboo perform-action --on B1 --action AXPress --snapshot "$SNAPSHOT"
# Run a natural-language automation
peekaboo agent "Open Notes and create a TODO list with three items"
# Run as an MCP server (Codex, Claude Code, Cursor)
npx -y @steipete/peekaboo
# Minimal MCP client config snippet:
# {
# "mcpServers": {
# "peekaboo": {
# "command": "npx",
# "args": ["-y", "@steipete/peekaboo"],
# "env": {
# "PEEKABOO_AI_PROVIDERS": "openai/gpt-5.5,anthropic/claude-opus-4-7"
# }
# }
# }
# }
Peekaboo can generate shell-native completions directly from the same Commander metadata that powers CLI help and docs:
# Current shell (recommended)
eval "$(peekaboo completions $SHELL)"
# Explicit shells
eval "$(peekaboo completions zsh)"
eval "$(peekaboo completions bash)"
peekaboo completions fish | source
For persistent setup and troubleshooting, see docs/commands/completions.md.
| Command | Key flags / subcommands | What it does |
|---|---|---|
| see | --app, --mode screen/window, --retina, --json |
Capture and annotate UI, return snapshot + element IDs |
| click | --on <id/query>, --snapshot, --wait-for, --coords |
Click by element ID, label, or coordinates |
| type | --text, --clear, --profile, --delay |
Enter text with pacing options |
| set-value | --on <id/query>, --value, --snapshot |
Directly set a settable accessibility value |
| perform-action | --on <id/query>, --action, --snapshot |
Invoke a named accessibility action |
| press | key names, --count, --delay, --hold |
Special keys and sequences |
| hotkey | combos like cmd,shift,t |
Modifier combos (cmd/ctrl/alt/shift) |
| scroll | --on <id>, --direction up/down, --amount |
Scroll views or elements |
| swipe | --from/--to, --duration, --steps |
Smooth gesture-style drags |
| drag | --from/--to, modifiers, Dock/Trash targets |
Drag-and-drop between elements/coords |
| move | --to <id/coords>, --screen-index |
Position the cursor without clicking |
| window | list, move, resize, focus, set-bounds |
Move/resize/focus windows and Spaces |
| app | launch, quit, relaunch, switch, list |
Launch, quit, relaunch, switch apps |
| space | list, switch, move-window |
List or switch macOS Spaces |
| menu | list, list-all, click, click-extra |
List/click app menus and extras |
| menubar | list, click |
Target status-bar items by name/index |
| dock | launch, right-click, hide, show, list |
Interact with Dock items |
| dialog | list, click, input, file, dismiss |
Drive system dialogs (open/save/etc.) |
| image | --mode screen/window/menu, --retina, --analyze |
Screenshot screen/window/menu bar (+analyze) |
| list | apps, windows, screens, menubar, permissions |
Enumerate apps, windows, screens, permissions |
| tools | --verbose, --json, --no-sort |
Inspect native Peekaboo tools |
| completions | [shell] |
Generate zsh/bash/fish completion scripts from Commander metadata |
| config | init, show, add, login, models |
Manage credentials/providers/settings |
| permissions | status, grant |
Check/grant required macOS permissions |
| run | .peekaboo.json, --output, --no-fail-fast |
Execute .peekaboo.json automation scripts |
| sleep | --duration (ms) |
Millisecond delays between steps |
| clean | --all-snapshots, --older-than, --snapshot |
Prune snapshots and caches |
| agent | --model, --dry-run, --resume, --max-steps, audio |
Natural-language multi-step automation |
| mcp | serve (default) |
Run Peekaboo as an MCP server |
Peekaboo's provider list changes with Tachikoma and the tested model catalog. See docs/providers.md for the current provider reference, including OpenAI, Anthropic, xAI/Grok, Google Gemini, MiniMax, Ollama, LM Studio, and compatible custom endpoints.
Set providers via PEEKABOO_AI_PROVIDERS or peekaboo config add.
pnpm install then pnpm run build:cli or pnpm run test:safe.pnpm run lint && pnpm run format.MIT
Выполни в терминале:
claude mcp add peekaboo -- npx -y @steipete/peekabooExtract design specs and assets
автор: FigmaAn Open-Sourced UI to install and manage MCP servers for Windows, Linux and macOS.
Build, validate, and deploy multi-agent AI solutions on the ADAS platform. Design skills with tools, manage solution lifecycle, and connect from any AI environm
автор: ariekoganMCP Bundles: Create custom bundles of tools and connect providers with OAuth or API keys. Use one MCP server across thousands of integrations, with programmatic
автор: thinkchainaiНе уверен что выбрать?
Найди свой стек за 60 секунд
Автор?
Embed-бейдж для README
Похожее
Все в категории design