TTS Stream
Text-to-Speech
Text-to-Speech (Streaming)
Stream audio in chunks as it’s generated via Server-Sent Events.
POST
TTS Stream
Currently in beta. You’re on the priority waitlist and among the first to get access.
Overview
Receive audio in chunks as it’s generated, allowing playback to start immediately. Reduces latency compared to TTS REST.Passing numbers, IDs, dates, or currency as raw strings causes mispronunciations. See the Input Formatting Guide for correct formatting of phone numbers, account numbers, PINs, Aadhaar, vehicle registration numbers, GSTIN, currency, and more.
Available Voices
| Voice | Gender | Description |
|---|---|---|
| Karan | Male | Bold, Trustworthy |
| Simran | Female | Confident, Bright |
| Nara | Female | Gentle, Expressive |
| Riya | Female | Cheerful, Energetic |
| Viraj | Male | Commanding, Dynamic |
| Raju | Male | Grounded, Conversational |
Python SDK
The SDK’s streaming client handles SSE parsing and chunk reassembly for you — you just iterate and write.Installation
Authentication
Stream Audio to a File
synthesize_stream yields audio chunks as they arrive. Playback or writing can begin before the full response is complete.
With Custom Audio Config
Supported Languages
The Vachana API supports 2 languages.| Language | Native Script | Example |
|---|---|---|
| English | Latin | ”I am going to the market” |
| Hindi | Devanagari (हिन्दी) | “मैं बाज़ार जा रहा हूँ” |
Headers
Body
application/json
Request body for TTS inference.
Response
Successful Server-Sent Events stream
The response is of type string.