Pocket TTS

Cosette

Free English Female Pocket TTS

Cosette is a female AI voice powered by the Pocket TTS text-to-speech model. This free-tier voice speaks English and delivers high-quality speech synthesis. With near-instant generation speed and a quality rating of 4/5, Cosette is well-suited for lightweight deployment, cpu-only environments, quick voice cloning. The Pocket TTS engine is developed by Kyutai under the MIT license, making it safe for commercial use. Key capabilities include: 100m parameters, cpu inference, voice cloning, single-sample cloning, edge-ready. The Pocket TTS model also supports voice cloning — upload a short audio sample to create a custom voice that retains the same quality characteristics.

No ratings yet

Pocket TTSModel Information

Model Pocket TTS
Developer Kyutai
Quality
Speed Fast
License MIT
Cloning Supported
Tier Free (no characters used)
Parameters 100M
Architecture Transformer + Mimi Codec
Training Data 50000 hours
Year 2025

Best Use Cases for Cosette

Recommended applications based on this voice's characteristics

Audiobooks & Narration

Use Cosette to narrate long-form content with natural prosody and expression.

Video Voiceovers

Add professional narration to YouTube videos, ads, and social media content.

Apps & Accessibility

Fast generation makes this voice ideal for real-time apps, screen readers, and accessibility tools.

Custom Brand Voice

Clone this voice style with your own audio to create a unique branded TTS voice.

More Pocket TTS Voices

Other voices from the same TTS model

Alba

English Female

Azelma

English Female

Eponine

English Female

Fantine

English Female

Javert

English Male

Jean

English Male

Frequently Asked Questions

Pocket TTS by Kyutai (creators of Moshi) is a compact 100M parameter text-to-speech model that punches well above its weight. It runs efficiently on CPU, supports zero-shot voice cloning from a single audio sample, and produces natural-sounding speech. The small model size makes it ideal for edge deployment and low-resource environments.

Pocket TTS was developed by Kyutai and is released under the MIT license, which permits commercial use of generated audio.

Pocket TTS supports 2 languages: English, French.

Pocket TTS is in the Free tier — free — no credits required. You can preview any Pocket TTS voice for free before generating full audio.

Pocket TTS has very fast generation speed. It runs in near real-time, making it suitable for streaming and interactive applications.

Pocket TTS is rated 4/5 for audio quality on TTS.ai. It produces high-quality, natural-sounding speech.

Yes, Pocket TTS supports zero-shot voice cloning. Upload 5-30 seconds of reference audio to create a custom voice.

Yes, Pocket TTS is specifically recommended for lightweight deployment, cpu-only environments, quick voice cloning. Its 100m parameters, cpu inference, voice cloning capabilities make it an excellent choice for this use case.

Yes, Pocket TTS is licensed under MIT, which allows commercial use. Audio generated with Pocket TTS voices can be used in videos, podcasts, apps, games, and any other commercial project.

Yes, all voices on TTS.ai use commercially-licensed open-source models (MIT, Apache 2.0). The generated audio is yours to use in videos, podcasts, apps, games, and any other commercial application.

Send a POST request to /api/v1/tts/ with the model name and voice ID. See our API Documentation page for code examples in Python, JavaScript, Go, and cURL.

Yes, click the play button on this page to hear a sample. You can also type custom text on the Text to Speech page and generate a free preview with any voice.

Try Cosette Now

Type any text and hear it spoken by Cosette. Free to use with no characters required.