loading…
Search for a command to run...
loading…
A standalone MCP server for Windows desktop control, enabling screenshots, mouse and keyboard input, app launch, window/display management, and clipboard access
A standalone MCP server for Windows desktop control, enabling screenshots, mouse and keyboard input, app launch, window/display management, and clipboard access via natural language.
A top-level Windows skill with a bundled standalone runtime and MCP server.
Published on ClawHub as computer-use-windows.
clawhub install computer-use-windows
This repository is:
skillIt is packaged skill-first, not Claude-first, so the same runtime can be adapted for multiple agent products.
The requirement is stricter than "wrap an existing install":
.node binariesThis project follows that rule on Windows.
Node.js + Python + pyautogui + mss + Pillow + psutil + pywin32~/.codex/skills/computer-use-windows/projectImplemented in this repository:
Still recommended before production use:
This session did not have a live Windows machine attached, so runtime behavior on Windows has been implemented but not end-to-end verified here.
Version 0.1.1 fixes a Windows packaging regression in the shared system-key blocklist logic. The migrated file had a broken platform branch that could apply the wrong denylist when checking OS-level shortcuts.
That logic now uses the correct win32 blocklist again, and the fix has been synced into both the source tree and the bundled skill payload.
flowchart LR
A[Agent / MCP Client] --> B[windows-computer-use-skill]
B --> C[Extracted TypeScript MCP tools]
B --> D[Standalone Python bridge]
D --> E[pyautogui]
D --> F[mss + Pillow]
D --> G[psutil + pywin32]
E --> H[Mouse / Keyboard]
F --> I[Screenshots]
G --> J[Apps / Windows / Displays / Clipboard]
git clone https://github.com/wimi321/windows-computer-use-skill.git
cd windows-computer-use-skill
npm install
npm run build
node dist/cli.js
On first launch, the project will automatically:
.runtime/venvpip if neededruntime/requirements.txt{
"mcpServers": {
"computer-use": {
"command": "node",
"args": [
"C:/absolute/path/to/windows-computer-use-skill/dist/cli.js"
],
"env": {
"CLAUDE_COMPUTER_USE_DEBUG": "0",
"CLAUDE_COMPUTER_USE_COORDINATE_MODE": "pixels"
}
}
}
}
This repo ships a top-level skill at skill/computer-use-windows.
clawhub install computer-use-windows
powershell -ExecutionPolicy Bypass -File .\skill\computer-use-windows\scripts\install.ps1
bash skill/computer-use-windows/scripts/install.sh
After installation, the bundled project lives at:
%USERPROFILE%\.codex\skills\computer-use-windows\project
If CODEX_HOME is set, use that location instead.
Validated in this session:
npm run checknpm run buildruntime/windows_helper.pyNot yet validated in this session:
Windows does not require the same Accessibility / Screen Recording prompts as macOS, but desktop control can still be limited by:
This standalone runtime reports screenshotFiltering: none.
That means screenshot filtering is not compositor-native; gating still happens at the MCP layer.
This repository is intentionally Windows-only.
Covered capabilities:
npm run build
node dist/cli.js
node --input-type=module -e "import { callPythonHelper } from './dist/computer-use/pythonBridge.js'; console.log(await callPythonHelper('list_displays', {}));"
src/
computer-use/
executor.ts
hostAdapter.ts
pythonBridge.ts
vendor/computer-use-mcp/
runtime/
windows_helper.py
requirements.txt
skill/
computer-use-windows/
examples/
assets/
CLAUDE_COMPUTER_USE_DEBUG=1CLAUDE_COMPUTER_USE_COORDINATE_MODE=pixelsCLAUDE_COMPUTER_USE_CLIPBOARD_PASTE=1CLAUDE_COMPUTER_USE_MOUSE_ANIMATION=1CLAUDE_COMPUTER_USE_HIDE_BEFORE_ACTION=0MIT
This project preserves and adapts reusable TypeScript computer-use logic recovered from the Claude Code workflow, then replaces the missing private runtime with a fully standalone public Windows implementation.
Run in your terminal:
claude mcp add computer-use-windows -- npx Security
Low riskAutomated heuristic from public metadata — not a security guarantee.