KhanomTan TTS

KhanomTan TTS TTS

An open, commercially-licensed Thai-first text-to-speech model with multiple speaker voices.

KhanomTan TTS was built by Thai NLP contributor Wannaphong Phatthiyaphaibun on top of the YourTTS multilingual VITS architecture, and trained on CC0 and other permissively-licensed Thai corpora including TSync. Where most open TTS models only cover Thai under research-only or non-commercial terms, KhanomTan ships under Apache 2.0, making it a rare commercially-safe choice for the language. It offers a small roster of speaker voices and runs fast at roughly five seconds per generation in around 2GB of VRAM. It fits Thai voiceovers, narration for Thai-language apps, and accessibility tooling where licensing clarity matters.

At a glance

Developer
Wannaphong Phatthiyaphaibun
License
Apache 2.0
Tier
standard
Speed
fast
Voice cloning
No
Languages
Thai
Max characters
500

KhanomTan TTS AI Voices

Bernard

Thai
Standar Male
Nggunakake

Default

Thai
Standar Neutral
Nggunakake

Kerstin

Thai
Standar Female
Nggunakake

Linda

Thai
Standar Female
Nggunakake

Thorsten

Thai
Standar Male
Nggunakake

Best for

Thai voiceovers, Thai-language content and apps

KhanomTan TTS TTS — FAQ

Yes. KhanomTan is released under the Apache 2.0 license, so it can be used commercially — which is unusual for open Thai TTS, where most alternatives carry non-commercial restrictions.

On TTS.ai it is configured for Thai. The underlying YourTTS architecture is multilingual, but the deployed model is focused on natural Thai pronunciation and prosody.

No. KhanomTan does not support voice cloning here; it generates speech from its built-in speaker voices. For cloning, use a model like Chatterbox, GPT-SoVITS, or IndexTTS-2.
← All voices