Free AI Tọghata ngwe ka ọsụsọ
31+ Open-source models, 231+ ụda 34+ Achọrọghị akaụntụ.
Ihe niile ịchọrọ maka ụda AI
30+ tools powered by open-source AI models
31+ Ụdị ụda AI
Nchịkọta zuru ezu nke open-source TTS models na mbido otu
Kokoro Free
Kokoro bụ 82 nde parameters ngwe-na-asị model nke punches mma n'elu ya weightclass. N'agbanyeghị ya obere size, ọ na-emepụta na-asị na-asị na-asị. Kokoro na-akwado asụsụ ndị ọzọ gụnyere English, Japanese, Chinese, na Korean na ụdị okwu ndị ọzọ. Ọ na-arụ ọrụ n'ụzọ dị mfe - na-emepụta ụda dị ka 100x n'ụzọ dị mfe karịa oge n'oge na GPU.
Ọkachasị maka: TTS nke dị elu na-enweghị mmebi, usoroiheomume ntụgharị
Nwalee
Piper Free
Piper bụ engine ngwe-na-asụsụ na-asụgharị nke Rhasspy na-eji VITS na larynx architectures. Ọ na-arụ ọrụ nke ọma na CPU, na-eme ka ọ dị mma maka ngwaọrụ edge, ụlọ ọrụ na-arụ ọrụ, na ngwa ọrụ chọrọ TTS na-enweghị njikọ. Na okwu 100 n'elu asụsụ 30 +, Piper na-enye okwu na-asụgharị na-asụgharị na-asụgharị na-asụgharị na Raspberry Pi 4.
Ọkachasị maka: Nlebiritụanya nkịtị, ikikembanye, nakwa usoroiheomume embedded
Nwalee
VITS Free
VITS (Variational Inference na-amụ ihe na-abịanụ maka ngwụcha-na-abịanụ Text-to-Speech) bụ ụzọ TTS na-abịanụ na-abịanụ nke na-emepụta ụda dị mma karịa ụdị abụọ nke ugbu a. Ọ na-ahọrọ ntụgharị dị iche iche na-agbakwunye na ntụgharị na usoro nkuzi na-abịanụ, na-enwetakwa mmelite dị mkpa na nghọta.
Ọkachasị maka: General-purpose text-to-speech na narịị prosody
Nwalee
MeloTTS Free
MeloTTS site na MyShell.ai bụ TTS multilingual library na-akwado English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, na Korean. Ọ dị ngwa ngwa, na-arụ ọrụ ngwe na-adịgide adịgide na CPU naanị. MeloTTS ejirila maka iji mmepụta na-akwado CPU na GPU inference.
Ọkachasị maka: Usoroiheomume mmepe na-achọ ngwa ngwa, TTS n'asụsụ dị iche iche
Nwalee
OuteTTS Free
OuteTTS na-eweta ụdị asụsụ dị ukwuu na ngwe-na-asụgharị n'oge na-echekwa ọdịnala. Ọ na-akwado ọtụtụ backends gụnyere llama.cpp (CPU / GPU), Hugging Face Transformers, ExLlamaV2, VLLM, na ọbụna ntụgharị ntụgharị na-eji Transformers.js. Ọ na-enyekwa ụda ụda site na profaịlụ ndị na-ekwu okwu echekwara dịka JSON.
Ọkachasị maka: Nhazi n'akụkụ, TTS nke na-adabere na brauịzaị, gburugburu ebe obibi nke na-enweghị uru
Nwalee
Pocket TTS Free
Pocket TTS site na Kyutai (ndị na-emepụta Moshi) bụ ụdị 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke
Ọkachasị maka: Nhazi dị n'okpuru, CPU-ọbụla gburugburu, ịkọgharị ụda n'ụzọ ngwa ngwa
Nwalee
Kitten TTS Free
Kitten TTS by KittenML is an ultra-lightweight text-to-speech model built on ONNX. With variants from 15M to 80M parameters (25-80 MB on disk), it delivers high-quality voice synthesis on CPU without requiring a GPU. Features 8 built-in voices, adjustable speech speed, and built-in text preprocessing for numbers, currencies, and units. Ideal for edge deployment and low-latency applications.
Ọkachasị maka: Fast lightweight TTS, edge deployment, low-latency applications
Nwalee
Bark Standard
Transform-based text-to-audio model nke na-emepụta okwu, egwu, na mmetụta ụda.
Debanye aha: Suno · Ikikere: MIT
Jiri ya
Bark Small Standard
Ụdị dị n'okpuru nke Bark na-eji nghọta dị n'okpuru.
Debanye aha: Suno · Ikikere: MIT
Jiri ya
CosyVoice 2 Standard
Alibaba's scalable streaming TTS na human-parity naturalness na nso-zero latency.
Debanye aha: Alibaba (Tongyi Lab) · Ikikere: Apache 2.0
Jiri ya
Dia TTS Standard
Multi-speaker dialog generation model nke na-emepụta nchọgharị n'etiti ndị na-ekwu okwu.
Debanye aha: Nari Labs · Ikikere: Apache 2.0
Jiri ya
Parler TTS Standard
Depụta ụda ịchọrọ n'asụsụ na-emeghị n'aka na Parler ga-eweta ụda dị n'otu.
Debanye aha: Hugging Face · Ikikere: Apache 2.0
Jiri ya
GLM-TTS Standard
Na-enwe ụkpụrụ nke ndehie akara ala n'etiti okporo ụzọ TTS model.
Debanye aha: Zhipu AI · Ikikere: GLM-4 License
Jiri ya
IndexTTS-2 Standard
Zero-shot TTS na-ejikwa mmetụta uche nke dị mma nakwa n'ụzọ dị elu.
Debanye aha: Index Team · Ikikere: Bilibili Model License
Jiri ya
Spark TTS Standard
Klọnaịsị ụda TTS n'ụdị ụda na-achịkwa ya nakwa n'ụdị okwu site n'ịjụjụ.
Debanye aha: SparkAudio · Ikikere: CC BY-NC-SA 4.0
Jiri ya
GPT-SoVITS Standard
Few-shot ụda na-ebuli TTS nke na-ebuli ụda ọbụla site na sekọnd 5 nke ụda.
Debanye aha: RVC-Boss · Ikikere: MIT
Jiri ya
Orpheus Standard
Human-level emotional TTS model trained on 100K hours of speech data.
Debanye aha: Canopy Labs · Ikikere: Llama 3.2 Community
Jiri ya
Qwen3 TTS Standard
Alibaba's multilingual TTS na ụda cloning, preset ụda, na ụda nhazi site na ngwe.
Debanye aha: Alibaba (Qwen) · Ikikere: Apache 2.0
Jiri ya
Chatterbox Turbo Standard
Chatterbox n'ụzọ nkịtị na sub-200ms latency na paralinguistic tags maka nnụnụ, nkụda mmụọ, na ndị ọzọ.
Debanye aha: Resemble AI · Ikikere: MIT
Jiri ya
Dia 2 Standard
Ntụgharị-ọhụrụ TTS na-asụgharị na-asụgharị na-asụgharị na-asụgharị na-asụgharị.
Debanye aha: Nari Labs · Ikikere: Apache 2.0
Jiri ya
VoxCPM Standard
Tokenizer-free TTS na-eweta 44.1kHz ụda na n'ozuzu ya na-aghọta paragraf.
Debanye aha: OpenBMB · Ikikere: Apache 2.0
Jiri ya
TADA Standard
Zero-hallucination TTS na ngwe-acoustic dual alignment, 5x ngwa ngwa karịa dị ka LLM TTS.
Debanye aha: Hume AI · Ikikere: MIT
Jiri ya
VibeVoice Standard
Móòdù Microsoft maka ihenhọrọ ndị na-ekwusa ọtụtụ ihe dị ka podcasts na audiobooks.
Debanye aha: Microsoft · Ikikere: MIT
Jiri ya
CosyVoice3 Standard
Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.
Debanye aha: Alibaba (FunAudioLLM) · Ikikere: Apache 2.0
Jiri ya
CosyVoice 2
Alibaba's scalable streaming TTS na human-parity naturalness na nso-zero latency.
Asụsụ: en, zh, ja, ko, fr, de, it, es
Kpọnye ụda
IndexTTS-2
Zero-shot TTS na-ejikwa mmetụta uche nke dị mma nakwa n'ụzọ dị elu.
Asụsụ: en, zh
Kpọnye ụda
Spark TTS
Klọnaịsị ụda TTS n'ụdị ụda na-achịkwa ya nakwa n'ụdị okwu site n'ịjụjụ.
Asụsụ: en, zh
Kpọnye ụda
GPT-SoVITS
Few-shot ụda na-ebuli TTS nke na-ebuli ụda ọbụla site na sekọnd 5 nke ụda.
Asụsụ: en, zh, ja, ko
Kpọnye ụda
Chatterbox
State-of-the-art zero-shot ụda ịkọsa na nchịkwa mmetụta site na Resemble AI.
Asụsụ: en
Kpọnye ụda
Tortoise TTS
Multi-voice text-to-speech na-atụle na mma na-eji autoregressive architecture.
Asụsụ: en
Kpọnye ụda
OpenVoice
Nkwado ụda na-akpaghị aka na nlekọta n'elu ụdị, mmetụta, nakwa ntụgharị.
Asụsụ: en, zh, ja, ko, fr, de, es, it
Kpọnye ụda
Qwen3 TTS
Alibaba's multilingual TTS na ụda cloning, preset ụda, na ụda nhazi site na ngwe.
Asụsụ: en, zh, ja, ko, de, fr, ru, pt, es, it
Kpọnye ụda
Chatterbox Turbo
Chatterbox n'ụzọ nkịtị na sub-200ms latency na paralinguistic tags maka nnụnụ, nkụda mmụọ, na ndị ọzọ.
Asụsụ: en
Kpọnye ụda
VoxCPM
Tokenizer-free TTS na-eweta 44.1kHz ụda na n'ozuzu ya na-aghọta paragraf.
Asụsụ: en, zh
Kpọnye ụda
OuteTTS
LLM-n'okpuru TTS na-agbagharị na CPU, GPU, mọọbụ nchọgharị site na llama.cpp na Transformers.js.
Asụsụ: en
Kpọnye ụda
Pocket TTS
Lightweight 100M parameter model site na Kyutai na ụda na-ebuli site na saịmpọn.
Asụsụ: en, fr
Kpọnye ụda
CosyVoice3
Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.
Asụsụ: en, zh, ja, ko, de, es, fr, it, ru
Kpọnye ụda
MOSS-TTS
Ultra-long 20-language TTS supporting up to 1 hour of continuous generation with phoneme-level control.
Asụsụ: en, zh, de, es, fr, ja, it, hu, ko, ru, fa, ar, pl, pt, cs, da, sv, el, tr
Kpọnye ụda
MegaTTS3
ByteDance's sparse alignment TTS with adjustable intelligibility vs. speaker similarity.
Asụsụ: en, zh
Kpọnye ụdaDeveloper-First API
OpenAI-compatible REST API. One endpoint, 22+ models. Streaming support for real-time applications.
- OpenAI-compatible format
- TTS na-edebata maka usoroiheomume oge n'eziokwu
- Nhazi batch maka ọrụ ndị dị ukwuu
- Ndesịta ozi ndị ahụ
pip install ttsai
npm install @ttsainpm/ttsai
from tts_ai import TTSClient
client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
text="Hello from TTS.ai!",
model="kokoro",
voice="af_bella",
)
client.save(audio, "output.mp3")
Ajụjụ ndị a na-ajụkarị
Gịnị ka anyị ga-eme ka ọ dịrị mma? Ntụziaka gị na-enyere anyị aka idozi nsogbu.