Generate natural-sounding speech from text or SSML in 50+ languages and 380+ voices. Supports pitch, speed, effects profiles, MP3/WAV/OGG output, long-form synthesis, and Neural2/Studio voice models for production audio.
Use for voiceovers, IVR and phone prompts, accessibility audio, audiobook generation, language learning, pronunciation previews, product narration, support bots, podcast snippets, SSML-controlled speech, and long-form narration.
| Method | Path |
|---|---|
| POST | v1/text:synthesize Synthesizes speech synchronously: receive results after all text input has been processed. |
| GET | v1/voices Returns a list of Voice supported for synthesis. |