Report Bug / Feature Request

Chatterbox TTS

Resemble AI's state-of-the-art zero-shot voice cloning model with independent emotion control.

Text
Files

0/500 characters · Sign up for 5,000 per generation →

SSML Mode (Speech Synthesis Markup Language for fine control)

Wrap your text in SSML tags for precise control:

<speak><prosody rate="slow">Slow speech</prosody></speak>

Emotion / Style Tags

Tags the selected model understands — click to drop one into your text where it happens:

Pronunciation Dictionary

Define custom pronunciations (word = pronunciation):

Pitch 0

-12 +12

AI Model

Voice

Language

Output Format

Speed 1.0x

0.5x 2.0x

Free with Piper, VITS, MeloTTS

Your generated audio will appear here. Choose a model, enter text, and click Generate.

About Chatterbox

Chatterbox by Resemble AI is a leading open-source zero-shot voice cloning model that replicates a voice from a single audio sample, capturing not just timbre but speaking style and emotional nuance. Its distinctive feature is fine-grained emotion control that operates independently of the voice identity, so you can keep a cloned voice but shift its emotional intensity. Built around ResembleEnhance and flow matching, it targets professional-grade cloning for content creation, dubbing, and character voices. Released under the permissive MIT license, Chatterbox has become a popular foundation for derivative models — TTS.ai also runs a Saudi-Arabic fine-tune of it. It favors quality, with a modest per-request character limit.

Best for: Professional voice cloning with emotional control, content creation

Browse all Chatterbox voices

At a glance

Developer: Resemble AI
License: MIT
Tier: premium
Speed: medium
Voice cloning: Yes
Languages: English
Max characters: 300

Chatterbox voices

Default

English

Premium Neutral

Chatterbox TTS — FAQ

A single audio sample is enough. Chatterbox is a zero-shot cloning model, so it captures a voice — including its style and emotional nuance — from one reference clip without any fine-tuning.

It offers fine-grained emotion control that works independently from the voice identity, letting you adjust the emotional tone of the output while keeping the same cloned voice.

Yes. Chatterbox is released by Resemble AI under the MIT license, which permits commercial use, and it serves as the base for several fine-tuned derivative models.

← All voices

Chatterbox TTS

Love TTS.ai? Tell your friends!

About Chatterbox

At a glance

Chatterbox voices

Default

Chatterbox TTS — FAQ

How much audio does Chatterbox need to clone a voice?

What is Chatterbox's emotion control?

Is Chatterbox free to use commercially?