Kani TTS 2

Kani TTS 2 TTS

An ultra-lightweight 400M English model that runs in just 3GB of VRAM at a 0.2 real-time factor.

Kani-TTS-2 by NineNineSix is an ultra-lightweight 400M-parameter text-to-speech model built on a Liquid AI LFM2 backbone with NVIDIA's NanoCodec. It runs in just 3GB of VRAM and generates roughly ten seconds of speech in about two seconds on an A100 — a real-time factor near 0.2. The current public release ships an English-only checkpoint and, unlike its predecessor, does not expose the speaker-embedding hook needed for voice cloning. Its strength is fast, low-cost English generation on modest hardware, which makes it a good fit for quick previews and high-volume English narration. It is released under Apache 2.0 and offered on the free tier.

At a glance

Developer
NineNineSix
License
Apache 2.0
Tier
free
Speed
fast
Voice cloning
No
Languages
English
Max characters
1000

Kani TTS 2 AI Voices

Default

English
ডিফল্ট Neutral
ব্যবহার

Best for

Fast English generation on low-VRAM hardware, quick previews

Kani TTS 2 TTS — FAQ

It runs in just 3GB of VRAM and produces about ten seconds of speech in roughly two seconds on an A100 — a real-time factor near 0.2 — thanks to its 400M-parameter LFM2 backbone and NanoCodec.

No. The current v2 release removed the public speaker-embedding hook, so cloning is not available. For cloning, use Chatterbox, IndexTTS-2, or GPT-SoVITS.

English only. The public release ships a single English checkpoint; for non-English speech, use a model like Kokoro or MeloTTS.
← All voices