loading…
Search for a command to run...
loading…
A Model Context Protocol server for FlowSpeech text-to-speech. It lets MCP-compatible clients generate human-like audio with context-aware emotion control, paus
A Model Context Protocol server for FlowSpeech text-to-speech. It lets MCP-compatible clients generate human-like audio with context-aware emotion control, pause control, multi-speaker dialogue, and 30+ available voices.
An MCP (Model Context Protocol) server that brings FlowSpeech text-to-speech capabilities directly into your LLM workflows.
FlowSpeech is a context-aware text-to-speech tool that converts text into human-like audio with:
***(say cheerfully: Hello!)***Run the MCP server directly with npx (no install needed):
npx mcp-flowspeech-server
Or install globally:
npm install -g mcp-flowspeech-server
Add to your MCP client config (e.g. Claude Desktop claude_desktop_config.json):
{
"mcpServers": {
"flowspeech": {
"command": "npx",
"args": ["-y", "mcp-flowspeech-server"],
"env": {
"FLOWSPEECH_OUTPUT_DIR": "~/flowspeech-audio"
}
}
}
}
| Variable | Description | Default |
|---|---|---|
FLOWSPEECH_OUTPUT_DIR |
Directory to save generated audio files | ~/.flowspeech-mcp/audio |
flowspeech_ttsConvert text to speech with a single voice.
Parameters:
| Name | Type | Required | Description |
|---|---|---|---|
text |
string | ✅ | Text to synthesize. Supports emotion stage directions. |
voice |
string | Voice name (default: Kore) |
|
output_path |
string | Custom file path for the output audio |
Example:
Convert this to audio with a cheerful tone using the Puck voice:
"***(say cheerfully: Good morning, everyone!)*** Today we're going to explore something amazing."
flowspeech_tts_multiConvert a two-speaker dialogue to speech.
Parameters:
| Name | Type | Required | Description |
|---|---|---|---|
text |
string | ✅ | Dialogue with Speaker1: and Speaker2: prefixes |
voice_a |
string | Voice for Speaker1 (default: Charon) |
|
voice_b |
string | Voice for Speaker2 (default: Kore) |
|
output_path |
string | Custom file path for the output audio |
Example:
Speaker1: Welcome to the show! I'm your host.
Speaker2: And I'm your co-host. Great to be here!
Speaker1: Today we're talking about AI voices.
flowspeech_list_voicesList all available voices.
Parameters:
| Name | Type | Description |
|---|---|---|
gender |
male | female | all |
Filter by gender (default: all) |
| Voice | Gender | Style |
|---|---|---|
| Puck | Male | Upbeat and playful |
| Fenrir | Male | Excitable and energetic |
| Zephyr | Female | Bright and cheerful |
| Autonoe | Female | Bright and welcoming |
| Charon | Male | Informative and narrative |
| Kore | Female | Firm and focused |
| Orus | Male | Firm and decisive |
| Iapetus | Male | Clear and professional |
| Umbriel | Male | Easy-going and relaxed |
| Callirrhoe | Female | Easy-going and soothing |
| Sulafat | Female | Warm and inviting |
| Achernar | Female | Soft and tender |
| Gacrux | Female | Mature and experienced |
| … | … | 30+ total — use flowspeech_list_voices to see all |
FlowSpeech supports inline emotion control using the format ***(say <style>: text)***:
***(say cheerfully: Good morning!)***
Today I want to talk about something important.
***(say solemnly: This affects all of us.)***
MIT
from github.com/waeckerlinfederowicz66-sketch/mcp-flowspeech-server
Выполни в терминале:
claude mcp add flowspeech-mcp-server -- npx Transcripts, channel stats, search
автор: YouTubeAI image generation using various models.
автор: modelcontextprotocolUnified GPU inference API with 30 AI services (LLM, image gen, video, TTS, whisper, embeddings, reranking, OCR) as MCP tools. Pay-per-use via x402 USDC or API k
автор: gpu-bridgeA powerful image generation tool using Google's Imagen 3.0 API through MCP. Generate high-quality images from text prompts with advanced photography, artistic,
автор: hamflxНе уверен что выбрать?
Найди свой стек за 60 секунд
Автор?
Embed-бейдж для README
Похожее
Все в категории media