Convert text to speech using minimax/speech-02-turbo with advanced voice control
/audio-generate
command enables you to convert text to speech using the advanced Minimax Speech-02-Turbo model. You can create professional voiceovers, generate multilingual audio content, produce podcasts, create audiobooks, develop voice assistants, and generate high-quality speech with extensive control over voice characteristics, emotions, and audio quality.
text
- Text to convert to speech (max 5000 characters). Use <#x#>
for pause control (0.01-99.99s)pitch
- Speech pitch: -12 to 12 (defaults to 0)speed
- Speech speed: 0.5 to 2 (defaults to 1)volume
- Speech volume: 0 to 10 (defaults to 1)bitrate
- Audio bitrate: 32000, 64000, 128000, 256000 (defaults to 128000)channel
- Audio channels: “mono”, “stereo” (defaults to “mono”)emotion
- Speech emotion: “auto”, “neutral”, “happy”, “sad”, “angry”, “fearful”, “disgusted”, “surprised” (defaults to “auto”)voice_id
- Voice selection (defaults to “Wise_Woman”). See available voices belowsample_rate
- Sample rate: 8000, 16000, 22050, 24000, 32000, 44100 (defaults to 32000)language_boost
- Language enhancement (defaults to “None”). See language options belowenglish_normalization
- Enable English text normalization for better number reading (boolean, defaults to false)fileLinksExpireInDays
- How long generated files remain accessible: 1-7 days (defaults to 7)fileLinksExpireInMinutes
- How long generated files remain accessible in minutes (takes precedence over days)fileLinksExpireInDays
parameter.
<#x#>
notation<#x#>
for precise pause control (e.g., <#1.5#>
for 1.5 second pause)<#number#>
<#x#>
for pause control (0.01-99.99s)