Free AI Tọghata ngwe ka ọsụsọ
33+ Open-source models, 273+ ụda 33+ Achọrọghị akaụntụ.
Ihe niile ịchọrọ maka ụda AI
30+ tools powered by open-source AI models
33+ Ụdị ụda AI
Nchịkọta zuru ezu nke open-source TTS models na mbido otu
Kokoro Free
Kokoro bụ 82 nde parameters ngwe-na-asị model nke punches mma n'elu ya weightclass. N'agbanyeghị ya obere size, ọ na-emepụta na-asị na-asị na-asị. Kokoro na-akwado asụsụ ndị ọzọ gụnyere English, Japanese, Chinese, na Korean na ụdị okwu ndị ọzọ. Ọ na-arụ ọrụ n'ụzọ dị mfe - na-emepụta ụda dị ka 100x n'ụzọ dị mfe karịa oge n'oge na GPU.
Ọkachasị maka: TTS nke dị elu na-enweghị mmebi, usoroiheomume ntụgharị
Nwalee
Piper Free
Piper bụ engine ngwe-na-asụsụ na-asụgharị nke Rhasspy na-eji VITS na larynx architectures. Ọ na-arụ ọrụ nke ọma na CPU, na-eme ka ọ dị mma maka ngwaọrụ edge, ụlọ ọrụ na-arụ ọrụ, na ngwa ọrụ chọrọ TTS na-enweghị njikọ. Na okwu 100 n'elu asụsụ 30 +, Piper na-enye okwu na-asụgharị na-asụgharị na-asụgharị na-asụgharị na Raspberry Pi 4.
Ọkachasị maka: Nlebiritụanya nkịtị, ikikembanye, nakwa usoroiheomume embedded
Nwalee
VITS Free
VITS (Variational Inference na-amụ ihe na-abịanụ maka ngwụcha-na-abịanụ Text-to-Speech) bụ ụzọ TTS na-abịanụ na-abịanụ nke na-emepụta ụda dị mma karịa ụdị abụọ nke ugbu a. Ọ na-ahọrọ ntụgharị dị iche iche na-agbakwunye na ntụgharị na usoro nkuzi na-abịanụ, na-enwetakwa mmelite dị mkpa na nghọta.
Ọkachasị maka: General-purpose text-to-speech na narịị prosody
Nwalee
MeloTTS Free
MeloTTS site na MyShell.ai bụ TTS multilingual library na-akwado English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, na Korean. Ọ dị ngwa ngwa, na-arụ ọrụ ngwe na-adịgide adịgide na CPU naanị. MeloTTS ejirila maka iji mmepụta na-akwado CPU na GPU inference.
Ọkachasị maka: Usoroiheomume mmepe na-achọ ngwa ngwa, TTS n'asụsụ dị iche iche
Nwalee
Kani TTS 2 Free
Kani-TTS-2 site na NineNineSix bụ ihe nlereanya nke ihe nlereanya 400M dị elu nke e wuru na backbone Liquid AI LFM2 na NVIDIA NanoCodec. Ọ na-arụ ọrụ na 3GB VRAM na-emepụta ~ 10 sekọnd nke okwu na ~ 2 sekọnd na A100 (RTF 0.2). N'oge a, ndị mmadụ na-ahapụ ụgbọ mmiri `kani-tts-2-en` na-eche echiche na-akọwapụta ihe nlereanya nke mkpa maka ịkọ okwu - jiri Chatterbox / IndexTTS2 / F5-TTS maka ịkọ, ma ọ bụ Kokoro / MeloTTS maka non-English.
Ọkachasị maka: Nhazi English n'ụzọ nkịtị na VRAM nke ala, nlebiritụanya n'ụzọ nkịtị
Nwalee
OuteTTS Free
OuteTTS na-eweta ụdị asụsụ dị ukwuu na ngwe-na-asụgharị n'oge na-echekwa ọdịnala. Ọ na-akwado ọtụtụ backends gụnyere llama.cpp (CPU / GPU), Hugging Face Transformers, ExLlamaV2, VLLM, na ọbụna ntụgharị ntụgharị na-eji Transformers.js. Ọ na-enyekwa ụda ụda site na profaịlụ ndị na-ekwu okwu echekwara dịka JSON.
Ọkachasị maka: Nhazi n'akụkụ, TTS nke na-adabere na brauịzaị, gburugburu ebe obibi nke na-enweghị uru
Nwalee
Pocket TTS Free
Pocket TTS site na Kyutai (ndị na-emepụta Moshi) bụ ụdị 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke
Ọkachasị maka: Nhazi dị n'okpuru, CPU-ọbụla gburugburu, ịkọgharị ụda n'ụzọ ngwa ngwa
Nwalee
Kitten TTS Free
Kitten TTS site na KittenML bụ ultra-lightweight ngwe-na-ekwu okwu model rụpụtara na ONNX. Na ụdị site na 15M ruo 80M parameters (25-80 MB na disk), ọ na-enye ụda ụda dị elu na CPU na-enweghị mkpa GPU. Features 8 built-in voices, adjustable speech speed, na built-in text preprocessing maka nọmba, ego, na yunit. Ideal maka edge deployment na low-latency applications.
Ọkachasị maka: Nnukwu TTS dị nro, ịkwado etiti, usoroiheomume nke na-echekwa obere
Nwalee
Ming-Omni TTS Free
Ming-omni-tts-0.5B site na inclusionAI bụ ụdị okwu omni-modal nke e wuru na backbone BailingMM na-eguzogide ọgwụ na-eguzogide ọgwụ na-eguzogide ọgwụ na-eguzogide ọgwụ. Na-enye 44.1kHz output (n'akụkụ CD quality), na-akwado ịkọ okwu nke ụda nke ụda site na 3 + nke abụọ reference, na-agụnye n'ime imewe / dialect / BGM control site na JSON instructions. Excellent stability - 0.83% WER on Chinese benchmarks.
Ọkachasị maka: Nkọwa asụsụ abụọ dị elu, ụda na-emetụta mmetụta, ihenhọrọ Chinese audiobook
Nwalee
MOSS-TTS Nano Free
MOSS-TTS-Nano-100M bụ OpenMOSS's compact 100M-paramita varians nke MOSS-TTS ezinụlọ, na-ekerịta delay-transformer architecture. Na-eweta 8B model's peak quality maka ~80x obere ibu na nke dị ala nke ukwuu per-request VRAM, na-eme ka ọ dị mma maka free-tier na elu-throughput deployments. 20-asụsụ dị ka nke ahụ.
Ọkachasị maka: Free-tier TTS, mmepụta nke ukwuu, ojiji nke na-emekọrịta ihe na-abawanye
Nwalee
Bark Dìfọ́ọ̀ltụ̀
Transform-based text-to-audio model nke na-emepụta okwu, egwu, na mmetụta ụda.
Debanye aha: Suno · Ikikere: MIT
Jiri ya
Bark Small Dìfọ́ọ̀ltụ̀
Ụdị dị n'okpuru nke Bark na-eji nghọta dị n'okpuru.
Debanye aha: Suno · Ikikere: MIT
Jiri ya
CosyVoice 2 Dìfọ́ọ̀ltụ̀
Alibaba's scalable streaming TTS na human-parity naturalness na nso-zero latency.
Debanye aha: Alibaba (Tongyi Lab) · Ikikere: Apache 2.0
Jiri ya
Dia TTS Dìfọ́ọ̀ltụ̀
Multi-speaker dialog generation model nke na-emepụta nchọgharị n'etiti ndị na-ekwu okwu.
Debanye aha: Nari Labs · Ikikere: Apache 2.0
Jiri ya
Parler TTS Dìfọ́ọ̀ltụ̀
Depụta ụda ịchọrọ n'asụsụ na-emeghị n'aka na Parler ga-eweta ụda dị n'otu.
Debanye aha: Hugging Face · Ikikere: Apache 2.0
Jiri ya
IndexTTS-2 Dìfọ́ọ̀ltụ̀
Zero-shot TTS na-ejikwa mmetụta uche nke dị mma nakwa n'ụzọ dị elu.
Debanye aha: Index Team · Ikikere: Bilibili Model License
Jiri ya
Spark TTS Dìfọ́ọ̀ltụ̀
Klọnaịsị ụda TTS n'ụdị ụda na-achịkwa ya nakwa n'ụdị okwu site n'ịjụjụ.
Debanye aha: SparkAudio · Ikikere: CC BY-NC-SA 4.0
Jiri ya
GPT-SoVITS Dìfọ́ọ̀ltụ̀
Few-shot ụda na-ebuli TTS nke na-ebuli ụda ọbụla site na sekọnd 5 nke ụda.
Debanye aha: RVC-Boss · Ikikere: MIT
Jiri ya
Orpheus Dìfọ́ọ̀ltụ̀
Human-level emotional TTS model trained on 100K hours of speech data.
Debanye aha: Canopy Labs · Ikikere: Llama 3.2 Community
Jiri ya
Qwen3 TTS Dìfọ́ọ̀ltụ̀
Alibaba's multilingual TTS na preset ụda na ụda nhazi site na ngwe.
Debanye aha: Alibaba (Qwen) · Ikikere: Apache 2.0
Jiri ya
VieNeu-TTS-v2 Dìfọ́ọ̀ltụ̀
Vietnamese + English kood-swap TTS na ụda 7 preset na ụda nke na-agaghị adọta. CPU-ọbụla, GPU ọbụla chọrọ.
Debanye aha: Phạm Nguyễn Ngọc Bảo · Ikikere: Apache 2.0
Jiri ya
Chatterbox Turbo Dìfọ́ọ̀ltụ̀
Chatterbox n'ụzọ nkịtị na sub-200ms latency na paralinguistic tags maka nnụnụ, nkụda mmụọ, na ndị ọzọ.
Debanye aha: Resemble AI · Ikikere: MIT
Jiri ya
VoxCPM Dìfọ́ọ̀ltụ̀
Tokenizer-free TTS na-eweta 44.1kHz ụda na n'ozuzu ya na-aghọta paragraf.
Debanye aha: OpenBMB · Ikikere: Apache 2.0
Jiri ya
VibeVoice Dìfọ́ọ̀ltụ̀
Móòdù Microsoft maka ihenhọrọ ndị na-ekwusa ọtụtụ ihe dị ka podcasts na audiobooks.
Debanye aha: Microsoft · Ikikere: MIT
Jiri ya
CosyVoice3 Dìfọ́ọ̀ltụ̀
TTS nke nsụgharị ọzọ na-asụ asụsụ abụọ na bi-streaming, nlekọta mmetụta, nakwa ịkọgharị ụda nke enweghị ntọala.
Debanye aha: Alibaba (FunAudioLLM) · Ikikere: Apache 2.0
Jiri ya
NAMAA Saudi TTS Dìfọ́ọ̀ltụ̀
Mepee TTS Saudi-Arabic mbụ. Naịlọn Saudi dialọg na-echekwa ụda Chatterbox-ọdịnaya.
Debanye aha: NAMAA Space · Ikikere: MIT
Jiri ya
Darwin TTS Dìfọ́ọ̀ltụ̀
Cross-modal Qwen3-TTS varians na FFN weights na-agbanye site na Qwen3-1.7B asụsụ model maka nsụgharị asụsụ dị iche iche dị ike.
Debanye aha: FINAL-Bench · Ikikere: Apache 2.0
Jiri ya
MOSS-TTSD Dìfọ́ọ̀ltụ̀
Multi-speaker dialog continuation model - mepụta podcast-style conversations na ruo 5 speakers na 60 nkeji nke coherent audio.
Debanye aha: OpenMOSS · Ikikere: Apache 2.0
Jiri ya
CosyVoice 2
Alibaba's scalable streaming TTS na human-parity naturalness na nso-zero latency.
Asụsụ: en, zh, ja, ko, fr, de, it, es
Kpọnye ụda
IndexTTS-2
Zero-shot TTS na-ejikwa mmetụta uche nke dị mma nakwa n'ụzọ dị elu.
Asụsụ: en, zh
Kpọnye ụda
Spark TTS
Klọnaịsị ụda TTS n'ụdị ụda na-achịkwa ya nakwa n'ụdị okwu site n'ịjụjụ.
Asụsụ: en, zh
Kpọnye ụda
GPT-SoVITS
Few-shot ụda na-ebuli TTS nke na-ebuli ụda ọbụla site na sekọnd 5 nke ụda.
Asụsụ: en, zh, ja, ko
Kpọnye ụda
Chatterbox
State-of-the-art zero-shot ụda ịkọsa na nchịkwa mmetụta site na Resemble AI.
Asụsụ: en
Kpọnye ụda
Tortoise TTS
Multi-voice text-to-speech na-atụle na mma na-eji autoregressive architecture.
Asụsụ: en
Kpọnye ụda
OpenVoice
Nkwado ụda na-akpaghị aka na nlekọta n'elu ụdị, mmetụta, nakwa ntụgharị.
Asụsụ: en, zh, ja, ko, fr, es
Kpọnye ụda
VieNeu-TTS-v2
Vietnamese + English kood-swap TTS na ụda 7 preset na ụda nke na-agaghị adọta. CPU-ọbụla, GPU ọbụla chọrọ.
Asụsụ: vi, en
Kpọnye ụda
Chatterbox Turbo
Chatterbox n'ụzọ nkịtị na sub-200ms latency na paralinguistic tags maka nnụnụ, nkụda mmụọ, na ndị ọzọ.
Asụsụ: en
Kpọnye ụda
VoxCPM
Tokenizer-free TTS na-eweta 44.1kHz ụda na n'ozuzu ya na-aghọta paragraf.
Asụsụ: en, zh
Kpọnye ụda
OuteTTS
LLM-n'okpuru TTS na-agbagharị na CPU, GPU, mọọbụ nchọgharị site na llama.cpp na Transformers.js.
Asụsụ: en
Kpọnye ụda
Pocket TTS
Lightweight 100M parameter model site na Kyutai na ụda na-ebuli site na saịmpọn.
Asụsụ: en, fr
Kpọnye ụda
CosyVoice3
TTS nke nsụgharị ọzọ na-asụ asụsụ abụọ na bi-streaming, nlekọta mmetụta, nakwa ịkọgharị ụda nke enweghị ntọala.
Asụsụ: en, zh, ja, ko, de, es, fr, it, ru
Kpọnye ụda
NAMAA Saudi TTS
Mepee TTS Saudi-Arabic mbụ. Naịlọn Saudi dialọg na-echekwa ụda Chatterbox-ọdịnaya.
Asụsụ: ar
Kpọnye ụda
Darwin TTS
Cross-modal Qwen3-TTS varians na FFN weights na-agbanye site na Qwen3-1.7B asụsụ model maka nsụgharị asụsụ dị iche iche dị ike.
Asụsụ: en, ko, ja, zh
Kpọnye ụda
MOSS-TTSD
Multi-speaker dialog continuation model - mepụta podcast-style conversations na ruo 5 speakers na 60 nkeji nke coherent audio.
Asụsụ: en, zh
Kpọnye ụda
Ming-Omni TTS
Compact 0.5B omni-modal okwu model site inclusionAI na elu-fidelity 44.1kHz output na zero-shot okwu cloning.
Asụsụ: en, zh
Kpọnye ụda
MOSS-TTS Nano
Tiny 100M MOSS-TTS variant - architecture dị ka, 80x obere, free-tier latency.
Asụsụ: en, zh, de, es, fr, ja, it, ko, ru, ar, pt
Kpọnye ụdaDeveloper-First API
OpenAI-compatible REST API. One endpoint, 22+ models. Streaming support for real-time applications.
- OpenAI-compatible format
- TTS na-edebata maka usoroiheomume oge n'eziokwu
- Nhazi batch maka ọrụ ndị dị ukwuu
- Ndesịta ozi ndị ahụ
pip install ttsai
npm install @ttsainpm/ttsai
from tts_ai import TTSClient
client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
text="Hello from TTS.ai!",
model="kokoro",
voice="af_bella",
)
client.save(audio, "output.mp3")
Ajụjụ ndị a na-ajụkarị
Gịnị ka anyị ga-eme ka ọ dịrị mma? Ntụziaka gị na-enyere anyị aka idozi nsogbu.