Chatterbox Turbo

Chatterbox Turbo TTS

A faster Chatterbox with sub-200ms latency and inline paralinguistic tags for laughs, coughs, and chuckles.

Chatterbox Turbo is Resemble AI's 350M-parameter speed-focused upgrade to Chatterbox, reaching up to 6x real-time generation with sub-200-millisecond latency. It keeps the original's voice cloning while adding inline paralinguistic tags — you can drop [laugh], [cough], or [chuckle] directly into your text and have the model perform them. Every generation carries Perth watermarking for provenance tracking, a nod to responsible-AI deployment. The combination of low latency and expressive non-speech sounds makes it well suited to real-time voice agents and interactive characters. Like the original Chatterbox, it is MIT-licensed and English-focused.

At a glance

Developer
Resemble AI
License
MIT
Tier
standard
Speed
fast
Voice cloning
Yes
Languages
English
Max characters
1000

Chatterbox Turbo AI Voices

Default

English
Standardne Neutral
Kasutamine

Best for

Real-time voice agents, expressive speech with natural sounds

Chatterbox Turbo TTS — FAQ

Turbo is a 350M-parameter model that runs at up to 6x real-time with sub-200ms latency, making it suitable for real-time voice agents where the original Chatterbox would be too slow.

They are inline cues like [laugh], [cough], and [chuckle] that you place directly in the text. The model performs the corresponding non-speech sounds, adding natural expressiveness.

Yes. All generated audio includes Perth watermarking, which supports provenance tracking and helps identify the output as AI-generated.
← All voices