Voice Clone Embeddings
Voice Cloning
Voice Clone Embeddings
Generate voice clone embeddings from an audio file.
POST
Voice Clone Embeddings
Voice Cloning Flow
Voice cloning is a two-step process. Complete Step 1 once per voice, then reuse the embedding across any synthesis endpoint.Generate a voice embedding — this page
Upload 5–30 seconds of clean reference audio to extract a
speaker_embedding. Cache the result — you only need to generate it once per voice.Overview
Generate aspeaker_embedding from a reference audio clip. Upload the file and receive a multi-dimensional embedding you can pass to any Voice Cloned TTS endpoint.Headers
API Key ID for authentication
Body
multipart/form-data
The audio file to generate embeddings for