Kani TTS 2 TTS
An ultra-lightweight 400M English model that runs in just 3GB of VRAM at a 0.2 real-time factor.
Kani-TTS-2 by NineNineSix is an ultra-lightweight 400M-parameter text-to-speech model built on a Liquid AI LFM2 backbone with NVIDIA's NanoCodec. It runs in just 3GB of VRAM and generates roughly ten seconds of speech in about two seconds on an A100 — a real-time factor near 0.2. The current public release ships an English-only checkpoint and, unlike its predecessor, does not expose the speaker-embedding hook needed for voice cloning. Its strength is fast, low-cost English generation on modest hardware, which makes it a good fit for quick previews and high-volume English narration. It is released under Apache 2.0 and offered on the free tier.
At a glance
- Developer
- NineNineSix
- License
- Apache 2.0
- Tier
- free
- Speed
- fast
- Voice cloning
- No
- Languages
- English
- Max characters
- 1000
Kani TTS 2 voices
Best for
Fast English generation on low-VRAM hardware, quick previews