I-Free AI Umbhalo usuka kumazwi
20+ imodeli yomthombo ovulekile 107+ izizwi, 32+ Izilimi. Akukho akhawunti edingekayo.
Konke okudingayo ngezwi AI
Amathuluzi angama-30+ asebenza ngemodeli ye-AI evulekile
20+ Amamodeli omsindo we-AI
Uhlelo oluphelele kakhulu lwezimo ze-TTS ezivulekile ezikhona kwi-platform eyodwa
Kokoro Free
I-Kokoro iyimodeli ye-text-to-speech eneparameter engu-82 million eyenza kahle ngaphezu kwe-weight class yayo. Nakuba incane kakhulu, ikhiqiza amagama acacile futhi acacile. I-Kokoro isekela izilimi eziningi kufaka phakathi isiNgisi, isiJaphani, isiTshayina, nesiKoreane ngezinhlobonhlobo zamazwi acacile. Isebenza ngokushesha kakhulu — ikhiqiza umsindo osheshayo cishe ngama-100x kunosikhathi sangempela kwi-GPU.
Okungcono kakhulu: Ikhwalithi ephezulu ye-TTS enesikhathi sokuphuma esincane, izisebenziso zokusakaza
Zama mahhala
Piper Free
I-Piper iyinjini elula yokubhala-ukukhuluma ethuthukiswe yi-Rhasspy esebenzisa i-VITS ne-larynx architectures. Isebenza ngokuphelele ku-CPU, iyenza ibe ngcono kakhulu kumadivayisi e-edge, ukuphathwa kwekhaya, namathuluzi adinga i-TTS engenayo. Ngezwi elingaphezu kuka-100 lidlula ulwimi olungaphezu kuka-30, i-Piper inikeza ukukhuluma okubukekayo ngokuzenzakalela ngejubane lesikhathi sangempela ngisho ne-Raspberry Pi 4.
Okungcono kakhulu: Ukubukeka okukhawulelwe, ukufinyeleleka, kanye nezisebenziso ezifakwe ngaphakathi
Zama mahhala
VITS Free
VITS (Izibalo ezishintshayo ezifunda ngokuphikisanayo ukuqala ukubhala-ukukhuluma-ukuphela-ku-kuphela) yindlela ye-TTS elinganayo ekugcineni-ku-kuphela ekhiqiza umsindo ozwakalayo ojwayelekile kunalezo ezingemuva-ezimbili. Isebenzisa izibalo ezishintshayo ezithuthukisiwe ngokuhamba okujwayelekile kanye nenqubo yokuqeqeshwa okuphikisanayo, ethola ukukhula okuphawulekayo ekungavamile.
Okungcono kakhulu: Umbhalo-ku-ukukhuluma okusetshenziswa kakhulu nge-prosody ejwayelekile
Zama mahhala
MeloTTS Free
MeloTTS ngu MyShell.ai yi-TTS library eminingi ye-languages exhasa isiNgisi (i-American, i-British, i-Indian, i-Australian), isiShayina, isiJalimane, isiKorean. Ishesha kakhulu, isebenza umbhalo ngejubane elifanayo nesikhathi sangempela kwi-CPU kuphela. MeloTTS isetshenziselwa ukusetshenziswa kokukhiqizwa futhi ixhasa i-CPU ne-GPU inference.
Okungcono kakhulu: Izisebenziso zokukhiqiza ezidinga i-TTS esheshayo, enezilimi eziningi
Zama mahhala
Bark Standard
Imodeli yokubhala-kuya-kwesandi esekelwe ku-transformer ekhiqiza amagama acacile, umculo, kanye nemiphumela yomsindo.
Umthuthukisi: Suno · Ilayisense: MIT
Zama
Bark Small Standard
Uhlobo oluncane lwe-Bark olunezincazelo ezisheshayo nokusetshenziswa okuphansi kwememori.
Umthuthukisi: Suno · Ilayisense: MIT
Zama
CosyVoice 2 Standard
I-Alibaba's scalable streaming TTS ne-human-parity naturalness ne-near-zero latency.
Umthuthukisi: Alibaba (Tongyi Lab) · Ilayisense: Apache 2.0
Zama
Dia TTS Standard
Imodeli yokukhiqiza umsindo oningi owenza ukuxhumana okujwayelekile phakathi kwama-speakers.
Umthuthukisi: Nari Labs · Ilayisense: Apache 2.0
Zama
Parler TTS Standard
Sichaza umsindo ofuna ngesilimi esijwayelekile futhi i-Parler ikhiqiza umsindo olinganayo.
Umthuthukisi: Hugging Face · Ilayisense: Apache 2.0
Zama
GLM-TTS Standard
Ithola iphutha lophawu oluphansi phakathi kwemodeli ye-TTS yomthombo ovulekile.
Umthuthukisi: Zhipu AI · Ilayisense: GLM-4 License
Zama
IndexTTS-2 Standard
I-TTS engekho emthethweni ene-fine-grained emotional control ne-high expressionality.
Umthuthukisi: Index Team · Ilayisense: Bilibili Model License
Zama
Spark TTS Standard
Uhlu lwezwi lokuklonya i-TTS nge-emoji elawulwayo nesimo sokukhuluma nge-prompts.
Umthuthukisi: SparkAudio · Ilayisense: CC BY-NC-SA 4.0
Zama
GPT-SoVITS Standard
Uhlu lwezwi lokuklonya TTS oluncane oluphindayo noma yiluphi ulwimi kusuka kumasekondi angama-5 kuphela wesandi.
Umthuthukisi: RVC-Boss · Ilayisense: MIT
Zama
Orpheus Standard
Imodeli ye-TTS enamandla okuqonda esezingeni lomuntu eqeqeshiwe ngehora le-100K ledatha yokukhuluma.
Umthuthukisi: Canopy Labs · Ilayisense: Llama 3.2 Community
Zama
Qwen3 TTS Standard
I-Alibaba's multilingual TTS nezwi lokuklonya, izizwi ezisetshenzisiwe, kanye nobuciko bezwi kusuka kumbhalo.
Umthuthukisi: Alibaba (Qwen) · Ilayisense: Apache 2.0
Zama
CosyVoice 2
I-Alibaba's scalable streaming TTS ne-human-parity naturalness ne-near-zero latency.
Izilimi: en, zh, ja, ko, fr, de, it, es
Umsindo
GLM-TTS
Ithola iphutha lophawu oluphansi phakathi kwemodeli ye-TTS yomthombo ovulekile.
Izilimi: en, zh
Umsindo
IndexTTS-2
I-TTS engekho emthethweni ene-fine-grained emotional control ne-high expressionality.
Izilimi: en, zh
Umsindo
Spark TTS
Uhlu lwezwi lokuklonya i-TTS nge-emoji elawulwayo nesimo sokukhuluma nge-prompts.
Izilimi: en, zh
Umsindo
GPT-SoVITS
Uhlu lwezwi lokuklonya TTS oluncane oluphindayo noma yiluphi ulwimi kusuka kumasekondi angama-5 kuphela wesandi.
Izilimi: en, zh, ja, ko
Umsindo
Chatterbox
Uhlelo olusha lokuklonya umsindo olungenalutho olune-emotion control oluvela ku-Resemble AI.
Izilimi: en
Umsindo
Tortoise TTS
Umbhalo-ku-ukukhuluma okhuluma ngezilimi eziningi obhekene nekhwalithi ngesakhiwo esibuyela emuva.
Izilimi: en
Umsindo
OpenVoice
Ukuklonya umsindo ngokuzenzakalela ngokulawula okuqinile ngesitayela, inkanuko, nesimo.
Izilimi: en, zh, ja, ko, fr, de, es, it
Umsindo
Qwen3 TTS
I-Alibaba's multilingual TTS nezwi lokuklonya, izizwi ezisetshenzisiwe, kanye nobuciko bezwi kusuka kumbhalo.
Izilimi: en, zh, ja, ko, de, fr, ru, pt, es, it
UmsindoUmthuthukisi-kuqala API
I-REST API ehambisana ne-OpenAI. Ingxenye eyodwa, amamodeli angama-22+ Ukusakazwa kwengxoxo yesikhathi sangempela.
- Ifomethi ehambisana ne-OpenAI
- Ukusakazwa kwe-TTS kwezinhlelo zokusebenza zesikhathi sangempela
- Uhlelo lwe-batch lwemisebenzi enkulu
- Ulwaziso lwe-Webhook
pip install ttsai
npm install @ttsainpm/ttsai
from tts_ai import TTSClient
client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
text="Hello from TTS.ai!",
model="kokoro",
voice="af_bella",
)
client.save(audio, "output.mp3")
Intengo elula, ecacile
Qalisa ngokukhululekileyo. Ukukala njengoba ukhula.
Ikhululekile
15,000 characters
- Kokoro, Piper, VITS, MeloTTS
- Iphutha lophawu lwe-500
- 3 gen/ihora (akukho akhawunti)
Isiqalisi
500,000 characters/month
- Zonke imodeli ezingu-22+
- 100,000 chars per generation
- Ukulungiswa kwezwi
I-Pro
2,000,000 characters/month
- Konke ku-Starter
- Ukungena kwe-API
- Ukulungiswa kokuqala
Ibhizinisi
10,000,000 characters/month
- Konke ku-Pro
- I-bulk API
- Ifolokhwe yesinqumo
Bona zonke izilungiselelo kufaka phakathi izilungiselelo zophawu →
Imibuzo ebuzwa kaningi
Qala ukusebenzisa umsindo we-AI namhlanje
Xhumana nabakhiqizi, abathuthukisi, namabhizinisi asebenzisa i-TTS.ai