Generate speech
Audio
TTS (Text to Speech) (Beta)
Generate TTS audio from text using the specified voice and format.
POST
Generate speech
Voice Mapping
voice is normalized by provider with an internal alias map:
openai: common voices (for examplealloy,ash,ballad,coral,echo,fable,onyx,nova,sage,shimmer,verse,cedar,marin).google(AI Studio / Gemini TTS): canonical prebuilt voices:Zephyr,Puck,Charon,Kore,Fenrir,Leda,Orus,Aoede,Callirrhoe,Autonoe,Enceladus,Iapetus,Umbriel,Algieba,Despina,Erinome,Algenib,Rasalgethi,Laomedeia,Achernar,Alnilam,Schedar,Gacrux,Pulcherrima,Achird,Zubenelgenubi,Vindemiatrix,Sadachbia,Sadaltager,Sulafat.elevenlabs: common public starter voices are mapped (for examplerachel,domi,bella,antoni,elli,josh,arnold,adam,sam).
voice is not valid for the routed provider/model mapping, the request returns 400 invalid_request_error with param: "voice".
Provider Overrides
You can still pass provider-native voice settings:- ElevenLabs:
config.elevenlabs.voice_id/config.elevenlabs.voice/config.elevenlabs.voiceName - Google:
config.google.voice_name/config.google.voiceName
voice should be your default for portability.
Voice Samples
- OpenAI: openai.fm
- Google TTS voice examples: Gemini text-to-speech docs
- ElevenLabs voice library: ElevenLabs Voice Library
Authorizations
Bearer token authentication
Body
application/json
Response
200 - audio/mpeg
Audio file
The response is of type file.
Last modified on April 21, 2026