Audio
Speech API (Beta)
Converts text to speech using various voice models and formats.
POST
Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Body
application/json
Request to generate audio from text.
The text to generate audio for. The maximum length is 4096 characters.
Required string length:
1 - 4096
Example:
"Hello, this is a test of the text to speech system."
The model ID of a Venice TTS model.
Available options:
tts-kokoro
Example:
"tts-kokoro"
The voice to use when generating the audio.
Available options:
af_alloy
, af_aoede
, af_bella
, af_heart
, af_jadzia
, af_jessica
, af_kore
, af_nicole
, af_nova
, af_river
, af_sarah
, af_sky
, am_adam
, am_echo
, am_eric
, am_fenrir
, am_liam
, am_michael
, am_onyx
, am_puck
, am_santa
, bf_alice
, bf_emma
, bf_lily
, bm_daniel
, bm_fable
, bm_george
, bm_lewis
, ef_dora
, em_alex
, em_santa
, ff_siwis
, hf_alpha
, hf_beta
, hm_omega
, hm_psi
, if_sara
, im_nicola
, jf_alpha
, jf_gongitsune
, jf_nezumi
, jf_tebukuro
, jm_kumo
, pf_dora
, pm_alex
, pm_santa
, zf_xiaobei
, zf_xiaoni
, zf_xiaoxiao
, zf_xiaoyi
, zm_yunjian
, zm_yunxi
, zm_yunxia
, zm_yunyang
Example:
"af_sky"
The format to audio in.
Available options:
mp3
, opus
, aac
, flac
, wav
, pcm
Example:
"mp3"
The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.
Required range:
0.25 <= x <= 4
Example:
1
Should the content stream back sentence by sentence or be processed and returned as a complete audio file.
Example:
true
Response
200
audio/mpeg
Audio content generated successfully
The response is of type file
.