I-Free AI Okubhaliweyo ukuya kuSpeechName
31+ iimodeli zomthombo ovulekileyo, 231+ ii-voices, 34+ Iinkqubo zekhompyutha
Yonke into oyifunayo kwi Voice AI
Izixhobo ezingaphezu kwe-30 ezixhaswa ziimodyuli ze-AI ezivulekileyo
31+ Iimodeli zesandi ze-AI
Uluhlu olupheleleyo lweemodeli ze-TTS ezivulekileyo kwinkqubo enye
Kokoro Free
I-Kokoro yimodeli yombhalo-ukuthetha eneparameter ezili-82 ezili-million eyenza ungqubano oluhle ngaphezulu kweqela layo lobunzima. Nangona ubungakanani bayo buncinci, ivelisa ukuthetha okucacileyo nobucacileyo. I-Kokoro ixhasa ulwimi oluninzi oluquka isiNgesi, isiJaphani, isiTshayina, nesiKorea ngeendlela ezahlukeneyo zesandi ezicacileyo. Isebenza ngokukhawuleza kakhulu — ivelisa isandi esimalunga ne-100x ngokukhawuleza kunexesha elibonakalayo kwi-GPU.
Elungileyo ku: I-TTS esezingeni eliphezulu enexesha lokulibaziseka elincinci, iinkqubo zokudlulisa
Zama simahla
Piper Free
I-Piper yinjini elula yombhalo-ukuthetha ephuhliswe yi Rhasspy esebenzisa i VITS kunye ne-larynx architectures. Isebenza ngokupheleleyo kwi CPU, iyenza ibe yindawo efanelekileyo yezixhobo zesiphelo, ulawulo lwasekhaya, kunye neenkqubo ezifuna i-offline TTS. Ngeelizwi ezingaphezu kwe-100 ezisuka kwiilwimi ezingaphezu kwe-30, i-Piper inikezela ngokuthetha okuziva ngathi kuqhelekanga kwisantya sexesha elibonakalayo nakwi-Raspberry Pi 4.
Elungileyo ku: Imboniselo yabucala ekhawulezayo, ufikelelo, kunye neenkqubo ezifakelweyo
Zama simahla
VITS Free
VITS (I-Variation Inference ne-adversarial learning for end-to-end Text-to-Speech) yindlela efana ne-end-to-end TTS evelisa isandi esininzi esiqhelekileyo kunezikhokelo zenqanaba elinye. Isebenzisa i-variation inference ephuculweyo ngokuhamba okuqhelekileyo kunye nenkqubo yoqeqesho oluchaphazelayo, efumana ukuphuculwa okubalulekileyo kwindalo.
Elungileyo ku: Umbhalo-usuka-ku-ukuthetha osetyenziswa ngokubanzi nge-prosody eqhelekileyo
Zama simahla
MeloTTS Free
MeloTTS yi MyShell. ai yi TTS yelayibrari exhasa isiNgesi (iMelika, iBrithani, i-Indian, i-Australian), isiSpanyol, isiFrentshi, isiTshayina, isiJaphani, nesiKorea. Ikhawuleza kakhulu, iqhubekekisa umbhalo kwisantya esifutshane sexesha elibonakalayo kwi CPU kuphela. MeloTTS icwangciswe ukusetyenziswa kokwenza imveliso kwaye ixhasa zombini i CPU ne GPU inference.
Elungileyo ku: Iinkqubo zokuvelisa ezifuna i-TTS ekhawulezayo, eneelwimi ezininzi
Zama simahla
OuteTTS Free
OuteTTS iqhuba iimodeli ezinkulu zolwimi ngemisebenzi yokubhala-ukuze-uthethe ngelixa igcina uyilo oluphambili. Ixhasa ii-backends ezininzi kubandakanya i-lama.cpp (CPU/GPU), Ukutsala i-Face Transformers, ExLlamaV2, VLLM, naphi na ukuqonda kwebrowser nge-Transformers.js. Iimpawu zokuklona kwelizwi elingenanto-eyenziweyo ngeeprofayili zomthumeli ezigcinwe njenge-JSON.
Elungileyo ku: Unikezelo lwe-edge, i-TTS esekelwe kwi-browser, imigangatho ephantsi-yomthombo
Zama simahla
Pocket TTS Free
I Pocket TTS ngu Kyutai (abavelisi be Moshi) yimodeli yombhalo- ukuya- ku- kuthetha encinci eneparameter ye 100M eyenza ubunzima bayo. Isebenza kakuhle kwi CPU, ixhasa ukuklona kwesandi esingenanto ukusuka kwisampuli yesandi, kwaye ivelisa ulwimi oluzimeleyo. Ubungakanani bemodeli encinci yenza ukuba ibe yindawo efanelekileyo yokubekwa kwesiphelo kunye nemeko- bume ephantsi yecebo.
Elungileyo ku: Unikezelo olusezantsi, i CPU- kuphela iimeko- bume, ukuclona kwelizwi ngokukhawuleza
Zama simahla
Kitten TTS Free
Kitten TTS by KittenML is an ultra-lightweight text-to-speech model built on ONNX. With variants from 15M to 80M parameters (25-80 MB on disk), it delivers high-quality voice synthesis on CPU without requiring a GPU. Features 8 built-in voices, adjustable speech speed, and built-in text preprocessing for numbers, currencies, and units. Ideal for edge deployment and low-latency applications.
Elungileyo ku: Fast lightweight TTS, edge deployment, low-latency applications
Zama simahla
Bark Standard
Imodeli yombhalo-ukuya-kwisandi esekelwe kwi-transformer evelisa ukuthetha okunyanisekileyo, umculo, kunye neziphumo zesandi.
Umbhekisi phambili: Suno · Ilayisensi: MIT
Zama kwakhona
Bark Small Standard
Uguqulelo olusezantsi lwe Bark olunolwazi olukhawulezayo nokusetyenziswa okuphantsi kovimba wolwazi.
Umbhekisi phambili: Suno · Ilayisensi: MIT
Zama kwakhona
CosyVoice 2 Standard
I-Alibaba's scalable streaming TTS ene-human-parity naturalness kunye ne-zero-near latency.
Umbhekisi phambili: Alibaba (Tongyi Lab) · Ilayisensi: Apache 2.0
Zama kwakhona
Dia TTS Standard
Imodeli yokudala ingxoxo yomthumeli-omninzi eyenza ingxoxo eqhelekileyo phakathi kwamathumeli.
Umbhekisi phambili: Nari Labs · Ilayisensi: Apache 2.0
Zama kwakhona
Parler TTS Standard
Ichaza ilizwi ofuna ngayo kwilwimi oluqhelekileyo kwaye i-Parler ivelise ukuthetha okuhambelanayo.
Umbhekisi phambili: Hugging Face · Ilayisensi: Apache 2.0
Zama kwakhona
GLM-TTS Standard
Ifumana umyinge womlinganiselo womonakalo wophawu olusezantsi phakathi kweemodeli ze-TTS ezivulekileyo.
Umbhekisi phambili: Zhipu AI · Ilayisensi: GLM-4 License
Zama kwakhona
IndexTTS-2 Standard
I-TTS engapheliyo ene-fine-grained emotional control kunye nokubonisa okuphezulu.
Umbhekisi phambili: Index Team · Ilayisensi: Bilibili Model License
Zama kwakhona
Spark TTS Standard
Uklone lwelizwi le TTS ngeemvakalelo ezilawulwayo kunye nesitayile sokuthetha ngeempendulo.
Umbhekisi phambili: SparkAudio · Ilayisensi: CC BY-NC-SA 4.0
Zama kwakhona
GPT-SoVITS Standard
Ilizwi elincinci-eliqhutywa lokuklonya i-TTS ephindayo nayiphi na ilizwi ukusuka kwimizuzu emihlanu kuphela yesandi.
Umbhekisi phambili: RVC-Boss · Ilayisensi: MIT
Zama kwakhona
Orpheus Standard
Imodeli ye-TTS evakalelwa ngamandla enqanaba lomuntu eqeqeshwe kwi-100K yeeyure zedatha yokuthetha.
Umbhekisi phambili: Canopy Labs · Ilayisensi: Llama 3.2 Community
Zama kwakhona
Qwen3 TTS Standard
I-Alibaba's multilingual TTS enesandi sokukrola, ilizwi elimiselweyo, kunye noyilo lwesandi ukusuka kumbhalo.
Umbhekisi phambili: Alibaba (Qwen) · Ilayisensi: Apache 2.0
Zama kwakhona
Chatterbox Turbo Standard
Ibhokisi yencoko yababini ekhawulezayo ene sub-200ms latency kunye nee-tags zeparalinguistic zoluvo, ukuphefumla, kunye nezinye izinto.
Umbhekisi phambili: Resemble AI · Ilayisensi: MIT
Zama kwakhona
Dia 2 Standard
Ukusasazwa-kuqala kwe-TTS yonxibelelwano kunye nonxibelelwano lomntu othetha-ninzi kunye neengcebiso zeparalinguistic.
Umbhekisi phambili: Nari Labs · Ilayisensi: Apache 2.0
Zama kwakhona
VoxCPM Standard
I-Tokenizer-free TTS ivelisa i-44.1kHz yesandi ngemeko-bume eyaziyo iparagraph consistency.
Umbhekisi phambili: OpenBMB · Ilayisensi: Apache 2.0
Zama kwakhona
TADA Standard
I-TTS engabonakaliyo-ngamanzi enemigca emibini yokulungelelanisa umbhalo-ukubonakala, ikhawuleza ka-5x kune-LLM TTS elinganisekayo.
Umbhekisi phambili: Hume AI · Ilayisensi: MIT
Zama kwakhona
VibeVoice Standard
Imodeli ye-Microsoft yezinto eziqulethe i-multi-speaker ezifana nepodcasts kunye neencwadi zesandi.
Umbhekisi phambili: Microsoft · Ilayisensi: MIT
Zama kwakhona
CosyVoice3 Standard
Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.
Umbhekisi phambili: Alibaba (FunAudioLLM) · Ilayisensi: Apache 2.0
Zama kwakhona
CosyVoice 2
I-Alibaba's scalable streaming TTS ene-human-parity naturalness kunye ne-zero-near latency.
Iilwimi: en, zh, ja, ko, fr, de, it, es
Ilizwi lika-Clone
GLM-TTS
Ifumana umyinge womlinganiselo womonakalo wophawu olusezantsi phakathi kweemodeli ze-TTS ezivulekileyo.
Iilwimi: en, zh
Ilizwi lika-Clone
IndexTTS-2
I-TTS engapheliyo ene-fine-grained emotional control kunye nokubonisa okuphezulu.
Iilwimi: en, zh
Ilizwi lika-Clone
Spark TTS
Uklone lwelizwi le TTS ngeemvakalelo ezilawulwayo kunye nesitayile sokuthetha ngeempendulo.
Iilwimi: en, zh
Ilizwi lika-Clone
GPT-SoVITS
Ilizwi elincinci-eliqhutywa lokuklonya i-TTS ephindayo nayiphi na ilizwi ukusuka kwimizuzu emihlanu kuphela yesandi.
Iilwimi: en, zh, ja, ko
Ilizwi lika-Clone
Chatterbox
Uhlobo olutsha lwesandi esingena-nto esifana nesandi esilawulwa ngumnqweno ovela kwiResemble AI.
Iilwimi: en
Ilizwi lika-Clone
Tortoise TTS
Umbhalo-ukuthetha-ngezwi oluninzi olujolise kwixabiso kunye noyilo oluya ezantsi ngokuzenzekelayo.
Iilwimi: en
Ilizwi lika-Clone
OpenVoice
Uklonelo lwesandi olukhawulezayo nolawulo oluthe kratya kwindlela, imvakalelo, nesiqhelo.
Iilwimi: en, zh, ja, ko, fr, de, es, it
Ilizwi lika-Clone
Qwen3 TTS
I-Alibaba's multilingual TTS enesandi sokukrola, ilizwi elimiselweyo, kunye noyilo lwesandi ukusuka kumbhalo.
Iilwimi: en, zh, ja, ko, de, fr, ru, pt, es, it
Ilizwi lika-Clone
Chatterbox Turbo
Ibhokisi yencoko yababini ekhawulezayo ene sub-200ms latency kunye nee-tags zeparalinguistic zoluvo, ukuphefumla, kunye nezinye izinto.
Iilwimi: en
Ilizwi lika-Clone
VoxCPM
I-Tokenizer-free TTS ivelisa i-44.1kHz yesandi ngemeko-bume eyaziyo iparagraph consistency.
Iilwimi: en, zh
Ilizwi lika-Clone
OuteTTS
I-LLM-based TTS esebenza kwi-CPU, GPU, okanye kwi-browser nge-lama.cpp ne-Transformers.js.
Iilwimi: en
Ilizwi lika-Clone
Pocket TTS
Imodeli elula yeparamitha ye-100M ye-Kyutai enesandi esifana nesona esivela kwisikhokelo esifanayo.
Iilwimi: en, fr
Ilizwi lika-Clone
CosyVoice3
Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.
Iilwimi: en, zh, ja, ko, de, es, fr, it, ru
Ilizwi lika-Clone
MOSS-TTS
Ultra-long 20-language TTS supporting up to 1 hour of continuous generation with phoneme-level control.
Iilwimi: en, zh, de, es, fr, ja, it, hu, ko, ru, fa, ar, pl, pt, cs, da, sv, el, tr
Ilizwi lika-Clone
MegaTTS3
ByteDance's sparse alignment TTS with adjustable intelligibility vs. speaker similarity.
Iilwimi: en, zh
Ilizwi lika-CloneUmbhekisi phambili-Okuqalayo API
I-REST API ehambelana ne-OpenAI. Incopho enye yesiphelo, iimodeli ezingaphezu kwe-22. Inkxaso yosasazo lwezicelo zexesha elibonakalayo.
- Ifomati ehambelana ne-OpenAI
- Unikezelo lwe-TTS lweenkqubo zexesha elibonakalayo
- Uqhubekeko lweqela lomsebenzi omkhulu
- Isaziso se Webhook
pip install ttsai
npm install @ttsainpm/ttsai
from tts_ai import TTSClient
client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
text="Hello from TTS.ai!",
model="kokoro",
voice="af_bella",
)
client.save(audio, "output.mp3")
Ixabiso elilula, elicacileyo
Qala ngokukhululekileyo. Ubungakanani njengoko ukhula.
Ekhululekileyo
15,000 iimpawu
- Kokoro, Piper, VITS, MeloTTS
- Umda we-500 char
- 3 gen/iyure (akukho akhawunti)
Isiqalisi
500,000 iimpawu/inyanga
- Zonke iimodeli ezingaphezu kwe-22
- 100,000 iimpawu ngenkqubo
- I-Voice Cloning
I-Pro
2,000 iikhredithi/inyanga
- Yonke into kwisiqalisi
- Ufikelelo lwe-API
- Ukuqhubekeka okuphambili
Imisebenzi
10,000 iikhredithi/inyanga
- Yonke into kwi-Pro
- I-Bulk API
- Ufolo oluphambili
Imibuzo ebuzwa rhoqo
Yintoni esinokuyilungisa? Ulwazi lwakho olufunyenweyo lunceda silungise iingxaki.
Qala Ukusebenzisa i-AI Voice Namhlanje
Dibanisa abavelisi, abaphuhlisi, kunye neenkampani usebenzisa i-TTS.ai