I-Free AI Okubhaliweyo ukuya kuSpeechName
20+ iimodeli zomthombo ovulekileyo, 107+ ii-voices, 32+ iilwimi. Akukho akhawunti ifunekayo.
Yonke into oyifunayo kwi Voice AI
Izixhobo ezingaphezu kwe-30 ezixhaswa ziimodyuli ze-AI ezivulekileyo
20+ Iimodeli zesandi ze-AI
Uluhlu olupheleleyo lweemodeli ze-TTS ezivulekileyo kwinkqubo enye
Kokoro Free
I-Kokoro yimodeli yombhalo-ukuthetha eneparameter ezili-82 ezili-million eyenza ungqubano oluhle ngaphezulu kweqela layo lobunzima. Nangona ubungakanani bayo buncinci, ivelisa ukuthetha okucacileyo nobucacileyo. I-Kokoro ixhasa ulwimi oluninzi oluquka isiNgesi, isiJaphani, isiTshayina, nesiKorea ngeendlela ezahlukeneyo zesandi ezicacileyo. Isebenza ngokukhawuleza kakhulu — ivelisa isandi esimalunga ne-100x ngokukhawuleza kunexesha elibonakalayo kwi-GPU.
Elungileyo ku: I-TTS esezingeni eliphezulu enexesha lokulibaziseka elincinci, iinkqubo zokudlulisa
Zama simahla
Piper Free
I-Piper yinjini elula yombhalo-ukuthetha ephuhliswe yi Rhasspy esebenzisa i VITS kunye ne-larynx architectures. Isebenza ngokupheleleyo kwi CPU, iyenza ibe yindawo efanelekileyo yezixhobo zesiphelo, ulawulo lwasekhaya, kunye neenkqubo ezifuna i-offline TTS. Ngeelizwi ezingaphezu kwe-100 ezisuka kwiilwimi ezingaphezu kwe-30, i-Piper inikezela ngokuthetha okuziva ngathi kuqhelekanga kwisantya sexesha elibonakalayo nakwi-Raspberry Pi 4.
Elungileyo ku: Imboniselo yabucala ekhawulezayo, ufikelelo, kunye neenkqubo ezifakelweyo
Zama simahla
VITS Free
VITS (I-Variation Inference ne-adversarial learning for end-to-end Text-to-Speech) yindlela efana ne-end-to-end TTS evelisa isandi esininzi esiqhelekileyo kunezikhokelo zenqanaba elinye. Isebenzisa i-variation inference ephuculweyo ngokuhamba okuqhelekileyo kunye nenkqubo yoqeqesho oluchaphazelayo, efumana ukuphuculwa okubalulekileyo kwindalo.
Elungileyo ku: Umbhalo-usuka-ku-ukuthetha osetyenziswa ngokubanzi nge-prosody eqhelekileyo
Zama simahla
MeloTTS Free
MeloTTS yi MyShell. ai yi TTS yelayibrari exhasa isiNgesi (iMelika, iBrithani, i-Indian, i-Australian), isiSpanyol, isiFrentshi, isiTshayina, isiJaphani, nesiKorea. Ikhawuleza kakhulu, iqhubekekisa umbhalo kwisantya esifutshane sexesha elibonakalayo kwi CPU kuphela. MeloTTS icwangciswe ukusetyenziswa kokwenza imveliso kwaye ixhasa zombini i CPU ne GPU inference.
Elungileyo ku: Iinkqubo zokuvelisa ezifuna i-TTS ekhawulezayo, eneelwimi ezininzi
Zama simahla
Bark Standard
Imodeli yombhalo-ukuya-kwisandi esekelwe kwi-transformer evelisa ukuthetha okunyanisekileyo, umculo, kunye neziphumo zesandi.
Umbhekisi phambili: Suno · Ilayisenisi: MIT
Zama kwakhona
Bark Small Standard
Uguqulelo olusezantsi lwe Bark olunolwazi olukhawulezayo nokusetyenziswa okuphantsi kovimba wolwazi.
Umbhekisi phambili: Suno · Ilayisenisi: MIT
Zama kwakhona
CosyVoice 2 Standard
I-Alibaba's scalable streaming TTS ene-human-parity naturalness kunye ne-zero-near latency.
Umbhekisi phambili: Alibaba (Tongyi Lab) · Ilayisenisi: Apache 2.0
Zama kwakhona
Dia TTS Standard
Imodeli yokuveliswa kwencoko yababini yesandi esininzi eyenza unxibelelwano oluqhelekileyo phakathi kwamasandi.
Umbhekisi phambili: Nari Labs · Ilayisenisi: Apache 2.0
Zama kwakhona
Parler TTS Standard
Ichaza ilizwi ofuna ngayo kwilwimi oluqhelekileyo kwaye i-Parler ivelise ukuthetha okuhambelanayo.
Umbhekisi phambili: Hugging Face · Ilayisenisi: Apache 2.0
Zama kwakhona
GLM-TTS Standard
Ifumana umyinge womlinganiselo womonakalo wophawu olusezantsi phakathi kweemodeli ze-TTS ezivulekileyo.
Umbhekisi phambili: Zhipu AI · Ilayisenisi: GLM-4 License
Zama kwakhona
IndexTTS-2 Standard
I-TTS engapheliyo ene-fine-grained emotional control kunye nokubonisa okuphezulu.
Umbhekisi phambili: Index Team · Ilayisenisi: Bilibili Model License
Zama kwakhona
Spark TTS Standard
Uklone lwelizwi le TTS ngeemvakalelo ezilawulwayo kunye nesitayile sokuthetha ngeempendulo.
Umbhekisi phambili: SparkAudio · Ilayisenisi: CC BY-NC-SA 4.0
Zama kwakhona
GPT-SoVITS Standard
Ilizwi elincinci-eliqhutywa lokuklonya i-TTS ephindayo nayiphi na ilizwi ukusuka kwimizuzu emihlanu kuphela yesandi.
Umbhekisi phambili: RVC-Boss · Ilayisenisi: MIT
Zama kwakhona
Orpheus Standard
Imodeli ye-TTS evakalelwa ngamandla enqanaba lomuntu eqeqeshwe kwi-100K yeeyure zedatha yokuthetha.
Umbhekisi phambili: Canopy Labs · Ilayisenisi: Llama 3.2 Community
Zama kwakhona
Qwen3 TTS Standard
I-Alibaba's multilingual TTS enesandi sokukrola, ilizwi elimiselweyo, kunye noyilo lwesandi ukusuka kumbhalo.
Umbhekisi phambili: Alibaba (Qwen) · Ilayisenisi: Apache 2.0
Zama kwakhona
CosyVoice 2
I-Alibaba's scalable streaming TTS ene-human-parity naturalness kunye ne-zero-near latency.
Iilwimi: en, zh, ja, ko, fr, de, it, es
Ilizwi lika-Clone
GLM-TTS
Ifumana umyinge womlinganiselo womonakalo wophawu olusezantsi phakathi kweemodeli ze-TTS ezivulekileyo.
Iilwimi: en, zh
Ilizwi lika-Clone
IndexTTS-2
I-TTS engapheliyo ene-fine-grained emotional control kunye nokubonisa okuphezulu.
Iilwimi: en, zh
Ilizwi lika-Clone
Spark TTS
Uklone lwelizwi le TTS ngeemvakalelo ezilawulwayo kunye nesitayile sokuthetha ngeempendulo.
Iilwimi: en, zh
Ilizwi lika-Clone
GPT-SoVITS
Ilizwi elincinci-eliqhutywa lokuklonya i-TTS ephindayo nayiphi na ilizwi ukusuka kwimizuzu emihlanu kuphela yesandi.
Iilwimi: en, zh, ja, ko
Ilizwi lika-Clone
Chatterbox
Uhlobo olutsha lwesandi esingena-nto esifana nesandi esilawulwa ngumnqweno ovela kwiResemble AI.
Iilwimi: en
Ilizwi lika-Clone
Tortoise TTS
Umbhalo-ukuthetha-ngezwi oluninzi olujolise kwixabiso kunye noyilo oluya ezantsi ngokuzenzekelayo.
Iilwimi: en
Ilizwi lika-Clone
OpenVoice
Uklonelo lwesandi olukhawulezayo nolawulo oluthe kratya kwindlela, imvakalelo, nesiqhelo.
Iilwimi: en, zh, ja, ko, fr, de, es, it
Ilizwi lika-Clone
Qwen3 TTS
I-Alibaba's multilingual TTS enesandi sokukrola, ilizwi elimiselweyo, kunye noyilo lwesandi ukusuka kumbhalo.
Iilwimi: en, zh, ja, ko, de, fr, ru, pt, es, it
Ilizwi lika-CloneUmbhekisi phambili-Okuqalayo
I-REST API ehambelana ne-OpenAI. Incopho enye yesiphelo, iimodeli ezingaphezu kwe-22. Inkxaso yosasazo lwezicelo zexesha elibonakalayo.
- Ifomati ehambelana ne-OpenAI
- Unikezelo lwe-TTS lweenkqubo zexesha elibonakalayo
- Uqhubekeko lweqela lomsebenzi omkhulu
- Isaziso se Webhook
pip install ttsai
npm install @ttsainpm/ttsai
from tts_ai import TTSClient
client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
text="Hello from TTS.ai!",
model="kokoro",
voice="af_bella",
)
client.save(audio, "output.mp3")
Ixabiso elilula, elicacileyo
Qala ngokukhululekileyo. Ubungakanani njengoko ukhula.
Ekhululekileyo
15,000 characters
- Kokoro, Piper, VITS, MeloTTS
- Umda we-500 char
- 3 gen/iyure (akukho akhawunti)
Isiqalisi
500,000 characters/month
- Zonke iimodeli ezingaphezu kwe-22
- 100,000 chars per generation
- I-Voice Cloning
I-Pro
2,000 iikhredithi/inyanga
- Yonke into kwisiqalisi
- Ufikelelo lwe-API
- Ukuqhubekeka okuphambili
Imisebenzi
10,000 iikhredithi/inyanga
- Yonke into kwi-Pro
- I-Bulk API
- Ufolo oluphambili
Imibuzo ebuzwa rhoqo
Qala Ukusebenzisa i-AI Voice Namhlanje
Dibanisa abavelisi, abaphuhlisi, kunye neenkampani usebenzisa i-TTS.ai