Converts text to speech using various voice models and formats.
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Request to generate audio from text.
The text to generate audio for. The maximum length is 4096 characters.
1 - 4096"Hello, this is a test of the text to speech system."
The language of the input text. Only supported by Qwen 3 TTS models. If not specified, the language is auto-detected.
Auto, English, Chinese, Spanish, French, German, Italian, Japanese, Korean, Portuguese, Russian "English"
The model ID of a Venice TTS model.
tts-kokoro, tts-qwen3-0-6b, tts-qwen3-1-7b "tts-kokoro"
A style prompt to control the emotion and delivery of the speech. Only supported by Qwen 3 TTS models. Examples: "Very happy.", "Sad and slow.", "Excited and energetic."
500"Very happy."
The format to audio in.
mp3, opus, aac, flac, wav, pcm "mp3"
The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.
0.25 <= x <= 41
Should the content stream back sentence by sentence or be processed and returned as a complete audio file.
true
Sampling temperature for speech generation. Higher values produce more varied output. Only supported by Qwen 3 TTS models. Default is 0.9.
0 <= x <= 20.9
Nucleus sampling parameter. Only supported by Qwen 3 TTS models. Default is 1.0.
0 <= x <= 11
The voice to use when generating the audio. Voices are model-specific: Kokoro voices (e.g. af_sky, af_bella, am_adam) work with tts-kokoro; Qwen 3 voices (e.g. Vivian, Serena, Dylan, Eric, Ryan, Aiden) work with tts-qwen3-0-6b and tts-qwen3-1-7b. Using an incompatible voice returns a 400 error.
af_alloy, af_aoede, af_bella, af_heart, af_jadzia, af_jessica, af_kore, af_nicole, af_nova, af_river, af_sarah, af_sky, am_adam, am_echo, am_eric, am_fenrir, am_liam, am_michael, am_onyx, am_puck, am_santa, bf_alice, bf_emma, bf_lily, bm_daniel, bm_fable, bm_george, bm_lewis, zf_xiaobei, zf_xiaoni, zf_xiaoxiao, zf_xiaoyi, zm_yunjian, zm_yunxi, zm_yunxia, zm_yunyang, ff_siwis, hf_alpha, hf_beta, hm_omega, hm_psi, if_sara, im_nicola, jf_alpha, jf_gongitsune, jf_nezumi, jf_tebukuro, jm_kumo, pf_dora, pm_alex, pm_santa, ef_dora, em_alex, em_santa, Vivian, Serena, Ono_Anna, Sohee, Uncle_Fu, Dylan, Eric, Ryan, Aiden "af_sky"
Audio content generated successfully
The response is of type file.