Chatterbox Turbo

Chatterbox Turbo TTS

A faster Chatterbox with sub-200ms latency and inline paralinguistic tags for laughs, coughs, and chuckles.

Chatterbox Turbo is Resemble AI's 350M-parameter speed-focused upgrade to Chatterbox, reaching up to 6x real-time generation with sub-200-millisecond latency. It keeps the original's voice cloning while adding inline paralinguistic tags — you can drop [laugh], [cough], or [chuckle] directly into your text and have the model perform them. Every generation carries Perth watermarking for provenance tracking, a nod to responsible-AI deployment. The combination of low latency and expressive non-speech sounds makes it well suited to real-time voice agents and interactive characters. Like the original Chatterbox, it is MIT-licensed and English-focused.

At a glance

Developer
Resemble AI
License
MIT
Tier
standard
Speed
fast
Voice cloning
Yes
Languages
English
Max characters
1000

Chatterbox Turbo AI Voices

Default

English
অবিকল্পিত Neutral
ব্যৱহাৰ কৰক

Best for

Real-time voice agents, expressive speech with natural sounds

Chatterbox Turbo TTS — FAQ

Turbo is a 350M-parameter model that runs at up to 6x real-time with sub-200ms latency, making it suitable for real-time voice agents where the original Chatterbox would be too slow.

They are inline cues like [laugh], [cough], and [chuckle] that you place directly in the text. The model performs the corresponding non-speech sounds, adding natural expressiveness.

Yes. All generated audio includes Perth watermarking, which supports provenance tracking and helps identify the output as AI-generated.
← All voices