loading…
Search for a command to run...
loading…
The Gemini Audio MCP server brings enterprise-grade generative audio directly to your AI assistant. Built in high-performance Rust, it leverages Google's state-
The Gemini Audio MCP server brings enterprise-grade generative audio directly to your AI assistant. Built in high-performance Rust, it leverages Google's state-of-the-art models to provide a unified bridge for environmental sound design, expressive narration, and professional music production. ✨ Key Capabilities * 🎙️ Infinite Soundscapes: Generate complex, immersive environmental audio using the Gemini 2.0 Multimodal Live API. * 🎵 Music & SFX: Create high-fidelity rhythmic loops, full songs, and discrete foley cues via Google's Lyria 3 Pro and Clip models. * 🗣️ Expressive Voice: Convert text to speech with natural voice direction and emotional nuances. * 🎲 Seamless Looping: Features a proprietary 100ms micro-crossfade algorithm to ensure click-free, non-repeating background audio. * 🎭 Cinematic Transitions: Smoothly blend and crossfade between two distinct audio prompts for dynamic environment changes. * 🎛️ Universal Encoding: Direct Stdin-to-FFmpeg piping allows for zero-latency transcoding into 10+ formats (MP3, OGG, FLAC, OPUS, WAV, etc.). 🎮 Use Cases * Game Developers (UE5, Godot, Blender): Instantly generate procedural soundscapes and NPC dialogue lines during development. * Content Creators: Automate foley and background texture generation for video projects. * Productivity: Enhance your AI workspace with high-quality narration and focus-oriented ambient audio. --- 🛠️ Requirements * FFmpeg: Must be installed on the system path for audio transcoding. * API Key: A valid Google AI Studio (Gemini) API Key.
Smithery CLI connects your agents to thousands of skills and MCP servers directly from the command line. To get started, simply run npx skills add smithery/cli.
npm install -g smithery@latest
Requires Node.js 20+.
smithery mcp search [term] # Search the Smithery registry
smithery mcp add <url> # Add an MCP server connection
smithery mcp list # List your connections
smithery mcp remove <ids...> # Remove connections
Interact with tools from MCP servers connected via smithery mcp.
smithery tool list [connection] # List tools from your connected MCP servers
smithery tool find [query] # Search tools by name or intent
smithery tool get <connection> <tool> # Show full details for one tool
smithery tool call <connection> <tool> [args] # Call a tool
Browse skills on the Smithery Skills Registry and install them with the upstream installer:
npx skills add <skill> # e.g. npx skills add smithery-ai/cli
smithery auth login # Login with Smithery (OAuth)
smithery auth logout # Log out
smithery auth whoami # Check current user
smithery auth token # Mint a service token
smithery auth token --policy '<json>' # Mint a restricted token
smithery namespace list # List your namespaces
smithery namespace use <name> # Set current namespace
smithery mcp publish <url> -n <org/server> # Publish an MCP server URL
smithery mcp publish <bundle.mcpb> -n <org/server> # Publish an MCP bundle
# Search and connect to an MCP server
smithery mcp search "github"
smithery mcp add github --id github
# Find and call tools from your connected MCP servers
smithery tool find "create issue"
smithery tool call github create_issue '{"title":"Bug fix","body":"..."}'
# Browse and install skills
smithery skill search "frontend" --json --page 2
smithery skill add anthropics/frontend-design --agent claude-code
# Publish your MCP server URL
smithery mcp publish https://my-mcp-server.com -n myorg/my-server
# Publish a built MCP bundle
smithery mcp publish ./server.mcpb -n myorg/my-server
git clone https://github.com/smithery-ai/cli
cd cli && pnpm install && pnpm run build
npx . --help
Contributions welcome! Please submit a Pull Request.
Выполни в терминале:
claude mcp add gemini-audio-mcp -- npx -y @smithery/cli run jxoesneon/gemini-audio-mcpExtract design specs and assets
автор: FigmaEnables AI agents to read, write, and edit Office documents via LibreOffice with token-efficient design. Supports multiple formats including DOCX, XLSX, PPTX, a
автор: passerbyflutterSearch and retrieve company logos by brand or domain. Customize size, format, and theme to match your design needs. Accelerate design, prototyping, and content
автор: NOVA-3951Enables GUI automation for controlling PIX4Dmatic on Windows through MCP. Supports launching, focusing, capturing screenshots, sending hotkeys, clicking UI elem
автор: jangjo123Не уверен что выбрать?
Найди свой стек за 60 секунд
Автор?
Embed-бейдж для README
Похожее
Все в категории design