TTS Inference
Voice Cloning
Voice Cloned TTS (REST)
Generate cloned voice audio in a single synchronous response.
POST
TTS Inference
Overview
Step 1 — Generate a voice embedding
Upload a reference audio clip to get a
speaker_embedding. See Voice Clone Embeddings.speaker_embedding obtained from Voice Clone Embeddings along with your text. The full audio is returned in one response. For streaming playback, see Voice Cloning Streaming or Voice Cloning Realtime.
Endpoint
Authentication
| Header | Required | Description | Example |
|---|---|---|---|
X-API-Key-ID | Yes | Your API key for authentication | your-api-key-id |
Content-Type | Yes | Must be application/json | application/json |
Request Body
The text to synthesize into speech
Voice cloning model to use. Currently supported:
vachana-vc-v1Audio output configuration
Voice clone embedding obtained from the Voice Clone Embeddings endpoint
Response
Returns binary audio data in the format specified byaudio_config.container:
audio/wavfor WAV filesaudio/mpegfor MP3 filesaudio/oggfor OGG files
Example Request
Error Responses
Invalid text or audio configuration
Rate limit exceeded
Unexpected error occurred
Headers
Body
application/json
Request body for TTS inference.
Response
Successful audio synthesis
The response is of type file.