MiniMax Speech 2.8 Turbo Text to Speech
Create a text-to-speech generation task using the speech-2.8-turbo model.
Documentation Index
Fetch the complete documentation index at: https://mulerouter.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
Overview
Generate speech from text using the MiniMax Speech 2.8 Turbo model. Speech 2.8 Turbo delivers fast, high-quality synthesis:- Fast generation — optimized for speed with Turbo-tier performance
- Pause tags — insert precise pauses with
<#x#>syntax (x = 0.01-99.99 seconds) - Interjection tags — add natural expressions like
(laughs),(sighs),(coughs),(clears throat),(gasps),(sniffs),(groans),(yawns) - Voice settings — control speed, volume, pitch, and emotion
- 40+ languages — extensive language support with language boost
- Audio customization — configurable format (MP3/PCM/FLAC), sample rate, channel, and bitrate
- Voice modification — fine-tune pitch, intensity, and timbre
- Pronunciation dictionary — custom pronunciation replacements
- Loudness normalization — professional audio level control
Supported Voice IDs
Speech 2.8 Turbo supports the same 223+ system voice IDs as Speech 2.8 HD, covering 20+ languages including Chinese (Mandarin), Chinese (Cantonese), English, Japanese, Korean, Spanish, Portuguese, French, Indonesian, German, Russian, Italian, Arabic, Turkish, Ukrainian, Dutch, Vietnamese, Thai, Polish, Romanian, Greek, Czech, Finnish, and Hindi. See the Speech 2.8 HD documentation for the complete voice ID list.Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
Text to convert to speech. Use <#x#> for pauses (x = 0.01-99.99 seconds). Supports interjection tags: (laughs), (sighs), (coughs), (clears throat), (gasps), (sniffs), (groans), (yawns).
1 - 10000Voice configuration settings (optional, defaults to Wise_Woman).
Audio configuration settings.
Enhance recognition of specified languages and dialects.
Chinese, Chinese,Yue, English, Arabic, Russian, Spanish, French, Portuguese, German, Turkish, Dutch, Ukrainian, Vietnamese, Indonesian, Japanese, Italian, Korean, Thai, Polish, Romanian, Greek, Czech, Finnish, Hindi, Bulgarian, Danish, Hebrew, Malay, Slovak, Swedish, Croatian, Hungarian, Norwegian, Slovenian, Catalan, Nynorsk, Afrikaans, auto Format of the output content (non-streaming only). Default: hex.
url, hex Custom pronunciation dictionary for text replacement.
Loudness normalization settings for the audio.
Voice modification settings to adjust pitch, intensity, and timbre.
Response
Accepted - Task created successfully

