TTS Inference
Text-to-Speech
Text-to-Speech (REST)
Synchronous text-to-speech with full audio returned in one response.
POST
TTS Inference
Currently in beta. You’re on the priority waitlist and among the first to get access.
Overview
Get the complete synthesized audio in one response. Best for downloads or batch processing. For streaming playback, see TTS Streaming or TTS Realtime.Passing numbers, IDs, dates, or currency as raw strings causes mispronunciations. See the Input Formatting Guide for correct formatting of phone numbers, account numbers, PINs, Aadhaar, vehicle registration numbers, GSTIN, currency, and more.
Available Voices
| Voice | Gender | Description |
|---|---|---|
| Karan | Male | Bold, Trustworthy |
| Simran | Female | Confident, Bright |
| Nara | Female | Gentle, Expressive |
| Riya | Female | Cheerful, Energetic |
| Viraj | Male | Commanding, Dynamic |
| Raju | Male | Grounded, Conversational |
Python SDK
The official Python SDK lets you synthesize speech in one line, without constructing JSON payloads or handling binary audio responses manually.Installation
Authentication
The TTS client requires only your API key.Synthesize Speech
Thesynthesize method returns the complete audio as bytes, which you can write to a file or pass directly to an audio player.
Custom Audio Config
Control the sample rate, encoding, and container format of the output audio.List Available Voices
Supported Languages
The Vachana API supports 2 languages.| Language | Native Script | Example |
|---|---|---|
| English | Latin | ”I am going to the market” |
| Hindi | Devanagari (हिन्दी) | “मैं बाज़ार जा रहा हूँ” |
Headers
Body
application/json
Request body for TTS inference.
Response
Successful audio synthesis
The response is of type file.