Piper TTS MCP Server
Integrates Piper TTS into the Model Context Protocol, allowing AI assistants to convert text to speech and play it through speakers with customizable voice settings and volume control.
README Documentation
🎤 Piper TTS MCP Server
A Model Context Protocol (MCP) server that integrates Piper TTS for high-quality text-to-speech functionality. This server provides a speak
tool that converts text to speech and plays it directly through your speakers with customizable volume control.
✨ Features
- 🔊 High-quality text-to-speech using Piper TTS
- 🎚️ Volume control (0.01 to 1.00)
- ⚡ Memory-based audio playback (no temporary files)
- 🔧 Customizable voice parameters (speaker, speed, voice variation)
- 🚀 Fast and lightweight MCP integration
- 🔇 Silent operation (no console output)
📋 Requirements
⚠️ Important Prerequisites:
- Python 3.12+ installed on your system
- Piper TTS service running on
localhost:5000
You can set this up using the included Docker configuration or by running Piper TTS separately.
🚀 Quick Start
1. 🐳 Start Piper TTS Service
Clone this repo and start the TTS service:
git clone https://github.com/CryptoDappDev/piper-tts-mcp.git
cd piper-tts-mcp
Option A: Using Docker Compose (Recommended)
docker compose up -d
Option B: Using Docker Build
docker build -t piper-tts-mcp .
docker run -p 5000:5000 piper-tts-mcp
2. 🔧 Configure MCP Client
Add the server to your MCP configuration (e.g., .mcp.json
for Claude Desktop):
{
"mcpServers": {
"speak": {
"command": "uv",
"args": [
"--directory",
"/path/to/your/piper-tts-mcp",
"run",
"server.py"
]
}
}
}
3. 🎉 Enjoy!
The speak
tool is now available in your MCP client!
🛠️ Usage
The MCP server provides a speak
tool with the following parameters:
Parameter | Type | Default | Description |
---|---|---|---|
text | string | required | Text to convert to speech |
speaker_id | int | 0 | Voice speaker ID |
length_scale | float | 1.1 | Speech speed (lower = faster) |
noise_scale | float | 0.667 | Voice variation control |
noise_w_scale | float | 0.333 | Pronunciation variation |
volume | float | 0.15 | Volume level (0.01 to 1.00) |
Example Usage
# Basic usage
speak("Hello, world!")
# With custom volume
speak("This is louder!", volume=0.5)
# With custom voice settings
speak("Fast and varied speech",
length_scale=0.8,
volume=0.3,
noise_scale=0.8)
🔧 Development
Dependencies
- Python 3.12+
mcp[cli]
- MCP frameworkrequests
- HTTP client for TTS APIpygame
- Audio playback
Local Development
# Install dependencies
uv sync
# Run the server
uv run server.py
📦 Docker Configuration
The included Docker setup provides:
- Piper TTS service on port 5000
- Pre-configured voice models
- Automatic startup
🎙️ Voice Models
The default voice model used in this repository is en_GB-cori-high
(British English, female voice).
🔍 Exploring Voice Options
- Voice Samples: Listen to different voice models at Piper Voice Samples
- Piper Documentation: Learn more about Piper TTS at rhasspy/piper
🛠️ Changing Voice Models
To use a different voice model:
- Choose a voice from the voice samples page
- Update the Dockerfile - Replace
en_GB-cori-high
with your chosen voice model:# Download your preferred voice model RUN python3 -m piper.download_voices your-chosen-voice-model # Update the server command CMD ["sh", "-c", "python3 -m piper.http_server -m your-chosen-voice-model"]
- Rebuild the Docker image:
docker build -t piper-tts-mcp . docker compose up -d
🎨 Creating Custom Voices
Creating your own custom voice requires additional effort and research. Please refer to the Piper documentation for guidance on voice training and customization.
🤝 Contributing
Contributions are welcome! Please feel free to submit issues and pull requests.
📄 License
This project is open source and available under the MIT License.
🙏 Acknowledgments
- Piper TTS - Fast, local neural text to speech
- Model Context Protocol - Standardized protocol for AI model interactions
Made with ❤️ for the MCP community