Voice cloning

Your voice, on demand

Clone any voice from a single ~60-second recording with studio-grade fidelity, then generate consistent narration across every project. Available on Pro and Scale.

One short sample

From 60 seconds to a full voice model

No studio day, no re-records. Capture roughly a minute of clean speech and Voice Production AI trains a reusable voice model you can call on any time. Perfect for a consistent brand voice across thousands of renders.
  • A single ~60-second clean recording is all it takes
  • Models train in minutes, not days
  • Reuse the cloned voice across projects and the API
Voice Production AI

Fidelity

Studio-grade likeness

Clones preserve the timbre, cadence, and character that make a voice recognizable — close enough that listeners can’t tell the difference from the talent. Pair a clone with SSML to direct pacing and emphasis exactly as you would in a session.
  • Captures timbre and cadence, not just words
  • Directable with the same SSML controls as preset voices
  • Consistent delivery across long-form and batch jobs
Voice Production AI

Plans & limits

Clone as your library grows

Voice cloning is a paid capability. Start on Pro for up to 10 cloned voices, or move to Scale for up to 100 — ideal for agencies managing a distinct voice per client. Commercial usage rights are included on both paid plans.
  • Pro: up to 10 cloned voices, commercial rights included
  • Scale: up to 100 cloned voices, plus API access
  • Free plan uses preset voices only (no cloning)
Voice Production AI

How it works

Record, upload, clone, generate

1

Record a clean sample

Capture roughly 60 seconds of clear speech in a quiet room. The cleaner the input, the more faithful the clone.

2

Upload to your workspace

Drop the recording into Voice Production AI. We process it securely inside your multi-tenant workspace.

3

Train the clone

Voice Production AI builds a studio-grade voice model from the sample — typically ready in minutes, no audio engineering required.

4

Generate on demand

Use the cloned voice anywhere you use a preset voice: scripts, batch jobs, and the API, with SSML direction.

Responsible cloning

A cloned voice is a likeness of a real person. You must own the voice or have explicit, documented consent to clone it. Voice Production AI gates cloning to paid plans and prohibits impersonation, fraud, and deceptive use.

  • Only clone a voice you own or have explicit, documented consent to use
  • Cloning is gated to paid plans, with audit-friendly workspace controls
  • Cloned voices stay scoped to your workspace — never shared across tenants
  • We prohibit impersonation, fraud, and deceptive use in our terms

Voice cloning FAQ

A single clean recording of about 60 seconds is enough to produce a high-fidelity clone. A quiet room and a consistent microphone meaningfully improve the result.

Clone your first voice today

Voice cloning is included on Pro and Scale. Upgrade in seconds and train a model from a single recording.