I-Free AI Okubhaliweyo ukuya kuSpeechName

31+ iimodeli zomthombo ovulekileyo, 231+ ii-voices, 34+ Iinkqubo zekhompyutha

8K+
abavelisi
30K+
iindidi
31+
Imodeli ye-AI
231+
iilizwi
0/500 Iimpawu · Sign up for 5,000 per generation → Ekhululekileyo
Uthando TTS.ai? Nceda utshele abalandeli bakho!

Yonke into oyifunayo kwi Voice AI

Izixhobo ezingaphezu kwe-30 ezixhaswa ziimodyuli ze-AI ezivulekileyo

31+ Iimodeli zesandi ze-AI

Uluhlu olupheleleyo lweemodeli ze-TTS ezivulekileyo kwinkqubo enye

KokoroKokoro Free

I-Kokoro yimodeli yombhalo-ukuthetha eneparameter ezili-82 ezili-million eyenza ungqubano oluhle ngaphezulu kweqela layo lobunzima. Nangona ubungakanani bayo buncinci, ivelisa ukuthetha okucacileyo nobucacileyo. I-Kokoro ixhasa ulwimi oluninzi oluquka isiNgesi, isiJaphani, isiTshayina, nesiKorea ngeendlela ezahlukeneyo zesandi ezicacileyo. Isebenza ngokukhawuleza kakhulu — ivelisa isandi esimalunga ne-100x ngokukhawuleza kunexesha elibonakalayo kwi-GPU.

Elungileyo ku: I-TTS esezingeni eliphezulu enexesha lokulibaziseka elincinci, iinkqubo zokudlulisa

Zama simahla

PiperPiper Free

I-Piper yinjini elula yombhalo-ukuthetha ephuhliswe yi Rhasspy esebenzisa i VITS kunye ne-larynx architectures. Isebenza ngokupheleleyo kwi CPU, iyenza ibe yindawo efanelekileyo yezixhobo zesiphelo, ulawulo lwasekhaya, kunye neenkqubo ezifuna i-offline TTS. Ngeelizwi ezingaphezu kwe-100 ezisuka kwiilwimi ezingaphezu kwe-30, i-Piper inikezela ngokuthetha okuziva ngathi kuqhelekanga kwisantya sexesha elibonakalayo nakwi-Raspberry Pi 4.

Elungileyo ku: Imboniselo yabucala ekhawulezayo, ufikelelo, kunye neenkqubo ezifakelweyo

Zama simahla

VITSVITS Free

VITS (I-Variation Inference ne-adversarial learning for end-to-end Text-to-Speech) yindlela efana ne-end-to-end TTS evelisa isandi esininzi esiqhelekileyo kunezikhokelo zenqanaba elinye. Isebenzisa i-variation inference ephuculweyo ngokuhamba okuqhelekileyo kunye nenkqubo yoqeqesho oluchaphazelayo, efumana ukuphuculwa okubalulekileyo kwindalo.

Elungileyo ku: Umbhalo-usuka-ku-ukuthetha osetyenziswa ngokubanzi nge-prosody eqhelekileyo

Zama simahla

MeloTTSMeloTTS Free

MeloTTS yi MyShell. ai yi TTS yelayibrari exhasa isiNgesi (iMelika, iBrithani, i-Indian, i-Australian), isiSpanyol, isiFrentshi, isiTshayina, isiJaphani, nesiKorea. Ikhawuleza kakhulu, iqhubekekisa umbhalo kwisantya esifutshane sexesha elibonakalayo kwi CPU kuphela. MeloTTS icwangciswe ukusetyenziswa kokwenza imveliso kwaye ixhasa zombini i CPU ne GPU inference.

Elungileyo ku: Iinkqubo zokuvelisa ezifuna i-TTS ekhawulezayo, eneelwimi ezininzi

Zama simahla

OuteTTSOuteTTS Free

OuteTTS iqhuba iimodeli ezinkulu zolwimi ngemisebenzi yokubhala-ukuze-uthethe ngelixa igcina uyilo oluphambili. Ixhasa ii-backends ezininzi kubandakanya i-lama.cpp (CPU/GPU), Ukutsala i-Face Transformers, ExLlamaV2, VLLM, naphi na ukuqonda kwebrowser nge-Transformers.js. Iimpawu zokuklona kwelizwi elingenanto-eyenziweyo ngeeprofayili zomthumeli ezigcinwe njenge-JSON.

Elungileyo ku: Unikezelo lwe-edge, i-TTS esekelwe kwi-browser, imigangatho ephantsi-yomthombo

Zama simahla

Pocket TTSPocket TTS Free

I Pocket TTS ngu Kyutai (abavelisi be Moshi) yimodeli yombhalo- ukuya- ku- kuthetha encinci eneparameter ye 100M eyenza ubunzima bayo. Isebenza kakuhle kwi CPU, ixhasa ukuklona kwesandi esingenanto ukusuka kwisampuli yesandi, kwaye ivelisa ulwimi oluzimeleyo. Ubungakanani bemodeli encinci yenza ukuba ibe yindawo efanelekileyo yokubekwa kwesiphelo kunye nemeko- bume ephantsi yecebo.

Elungileyo ku: Unikezelo olusezantsi, i CPU- kuphela iimeko- bume, ukuclona kwelizwi ngokukhawuleza

Zama simahla

Kitten TTSKitten TTS Free

Kitten TTS by KittenML is an ultra-lightweight text-to-speech model built on ONNX. With variants from 15M to 80M parameters (25-80 MB on disk), it delivers high-quality voice synthesis on CPU without requiring a GPU. Features 8 built-in voices, adjustable speech speed, and built-in text preprocessing for numbers, currencies, and units. Ideal for edge deployment and low-latency applications.

Elungileyo ku: Fast lightweight TTS, edge deployment, low-latency applications

Zama simahla

BarkBark Standard

Imodeli yombhalo-ukuya-kwisandi esekelwe kwi-transformer evelisa ukuthetha okunyanisekileyo, umculo, kunye neziphumo zesandi.

Umbhekisi phambili: Suno · Ilayisensi: MIT

Zama kwakhona

Bark SmallBark Small Standard

Uguqulelo olusezantsi lwe Bark olunolwazi olukhawulezayo nokusetyenziswa okuphantsi kovimba wolwazi.

Umbhekisi phambili: Suno · Ilayisensi: MIT

Zama kwakhona

CosyVoice 2CosyVoice 2 Standard

I-Alibaba's scalable streaming TTS ene-human-parity naturalness kunye ne-zero-near latency.

Umbhekisi phambili: Alibaba (Tongyi Lab) · Ilayisensi: Apache 2.0

Zama kwakhona

Dia TTSDia TTS Standard

Imodeli yokudala ingxoxo yomthumeli-omninzi eyenza ingxoxo eqhelekileyo phakathi kwamathumeli.

Umbhekisi phambili: Nari Labs · Ilayisensi: Apache 2.0

Zama kwakhona

Parler TTSParler TTS Standard

Ichaza ilizwi ofuna ngayo kwilwimi oluqhelekileyo kwaye i-Parler ivelise ukuthetha okuhambelanayo.

Umbhekisi phambili: Hugging Face · Ilayisensi: Apache 2.0

Zama kwakhona

GLM-TTSGLM-TTS Standard

Ifumana umyinge womlinganiselo womonakalo wophawu olusezantsi phakathi kweemodeli ze-TTS ezivulekileyo.

Umbhekisi phambili: Zhipu AI · Ilayisensi: GLM-4 License

Zama kwakhona

IndexTTS-2IndexTTS-2 Standard

I-TTS engapheliyo ene-fine-grained emotional control kunye nokubonisa okuphezulu.

Umbhekisi phambili: Index Team · Ilayisensi: Bilibili Model License

Zama kwakhona

Spark TTSSpark TTS Standard

Uklone lwelizwi le TTS ngeemvakalelo ezilawulwayo kunye nesitayile sokuthetha ngeempendulo.

Umbhekisi phambili: SparkAudio · Ilayisensi: CC BY-NC-SA 4.0

Zama kwakhona

GPT-SoVITSGPT-SoVITS Standard

Ilizwi elincinci-eliqhutywa lokuklonya i-TTS ephindayo nayiphi na ilizwi ukusuka kwimizuzu emihlanu kuphela yesandi.

Umbhekisi phambili: RVC-Boss · Ilayisensi: MIT

Zama kwakhona

OrpheusOrpheus Standard

Imodeli ye-TTS evakalelwa ngamandla enqanaba lomuntu eqeqeshwe kwi-100K yeeyure zedatha yokuthetha.

Umbhekisi phambili: Canopy Labs · Ilayisensi: Llama 3.2 Community

Zama kwakhona

Qwen3 TTSQwen3 TTS Standard

I-Alibaba's multilingual TTS enesandi sokukrola, ilizwi elimiselweyo, kunye noyilo lwesandi ukusuka kumbhalo.

Umbhekisi phambili: Alibaba (Qwen) · Ilayisensi: Apache 2.0

Zama kwakhona

Chatterbox TurboChatterbox Turbo Standard

Ibhokisi yencoko yababini ekhawulezayo ene sub-200ms latency kunye nee-tags zeparalinguistic zoluvo, ukuphefumla, kunye nezinye izinto.

Umbhekisi phambili: Resemble AI · Ilayisensi: MIT

Zama kwakhona

Dia 2Dia 2 Standard

Ukusasazwa-kuqala kwe-TTS yonxibelelwano kunye nonxibelelwano lomntu othetha-ninzi kunye neengcebiso zeparalinguistic.

Umbhekisi phambili: Nari Labs · Ilayisensi: Apache 2.0

Zama kwakhona

VoxCPMVoxCPM Standard

I-Tokenizer-free TTS ivelisa i-44.1kHz yesandi ngemeko-bume eyaziyo iparagraph consistency.

Umbhekisi phambili: OpenBMB · Ilayisensi: Apache 2.0

Zama kwakhona

TADATADA Standard

I-TTS engabonakaliyo-ngamanzi enemigca emibini yokulungelelanisa umbhalo-ukubonakala, ikhawuleza ka-5x kune-LLM TTS elinganisekayo.

Umbhekisi phambili: Hume AI · Ilayisensi: MIT

Zama kwakhona

VibeVoiceVibeVoice Standard

Imodeli ye-Microsoft yezinto eziqulethe i-multi-speaker ezifana nepodcasts kunye neencwadi zesandi.

Umbhekisi phambili: Microsoft · Ilayisensi: MIT

Zama kwakhona

CosyVoice3CosyVoice3 Standard

Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.

Umbhekisi phambili: Alibaba (FunAudioLLM) · Ilayisensi: Apache 2.0

Zama kwakhona

ChatterboxChatterbox Premium

Uhlobo olutsha lwesandi esingena-nto esifana nesandi esilawulwa ngumnqweno ovela kwiResemble AI.

Ubunjani:

Zama kwakhona

Tortoise TTSTortoise TTS Premium

Umbhalo-ukuthetha-ngezwi oluninzi olujolise kwixabiso kunye noyilo oluya ezantsi ngokuzenzekelayo.

Ubunjani:

Zama kwakhona

StyleTTS 2StyleTTS 2 Premium

Umgangatho womntu-umbhalo-ukuthetha-ukuthetha ngokusasaza isimbo kunye noqeqesho oluchaseneyo.

Ubunjani:

Zama kwakhona

OpenVoiceOpenVoice Premium

Uklonelo lwesandi olukhawulezayo nolawulo oluthe kratya kwindlela, imvakalelo, nesiqhelo.

Ubunjani:

Zama kwakhona

Sesame CSMSesame CSM Premium

Imodeli yokuthetha-thethana eyenza unxibelelwano oluqhelekileyo ngexesha elifanelekileyo kunye nengqondo.

Ubunjani:

Zama kwakhona

MOSS-TTSMOSS-TTS Premium

Ultra-long 20-language TTS supporting up to 1 hour of continuous generation with phoneme-level control.

Ubunjani:

Zama kwakhona

MegaTTS3MegaTTS3 Premium

ByteDance's sparse alignment TTS with adjustable intelligibility vs. speaker similarity.

Ubunjani:

Zama kwakhona

CosyVoice 2CosyVoice 2

I-Alibaba's scalable streaming TTS ene-human-parity naturalness kunye ne-zero-near latency.

Iilwimi: en, zh, ja, ko, fr, de, it, es

Ilizwi lika-Clone

GLM-TTSGLM-TTS

Ifumana umyinge womlinganiselo womonakalo wophawu olusezantsi phakathi kweemodeli ze-TTS ezivulekileyo.

Iilwimi: en, zh

Ilizwi lika-Clone

IndexTTS-2IndexTTS-2

I-TTS engapheliyo ene-fine-grained emotional control kunye nokubonisa okuphezulu.

Iilwimi: en, zh

Ilizwi lika-Clone

Spark TTSSpark TTS

Uklone lwelizwi le TTS ngeemvakalelo ezilawulwayo kunye nesitayile sokuthetha ngeempendulo.

Iilwimi: en, zh

Ilizwi lika-Clone

GPT-SoVITSGPT-SoVITS

Ilizwi elincinci-eliqhutywa lokuklonya i-TTS ephindayo nayiphi na ilizwi ukusuka kwimizuzu emihlanu kuphela yesandi.

Iilwimi: en, zh, ja, ko

Ilizwi lika-Clone

ChatterboxChatterbox

Uhlobo olutsha lwesandi esingena-nto esifana nesandi esilawulwa ngumnqweno ovela kwiResemble AI.

Iilwimi: en

Ilizwi lika-Clone

Tortoise TTSTortoise TTS

Umbhalo-ukuthetha-ngezwi oluninzi olujolise kwixabiso kunye noyilo oluya ezantsi ngokuzenzekelayo.

Iilwimi: en

Ilizwi lika-Clone

OpenVoiceOpenVoice

Uklonelo lwesandi olukhawulezayo nolawulo oluthe kratya kwindlela, imvakalelo, nesiqhelo.

Iilwimi: en, zh, ja, ko, fr, de, es, it

Ilizwi lika-Clone

Qwen3 TTSQwen3 TTS

I-Alibaba's multilingual TTS enesandi sokukrola, ilizwi elimiselweyo, kunye noyilo lwesandi ukusuka kumbhalo.

Iilwimi: en, zh, ja, ko, de, fr, ru, pt, es, it

Ilizwi lika-Clone

Chatterbox TurboChatterbox Turbo

Ibhokisi yencoko yababini ekhawulezayo ene sub-200ms latency kunye nee-tags zeparalinguistic zoluvo, ukuphefumla, kunye nezinye izinto.

Iilwimi: en

Ilizwi lika-Clone

VoxCPMVoxCPM

I-Tokenizer-free TTS ivelisa i-44.1kHz yesandi ngemeko-bume eyaziyo iparagraph consistency.

Iilwimi: en, zh

Ilizwi lika-Clone

OuteTTSOuteTTS

I-LLM-based TTS esebenza kwi-CPU, GPU, okanye kwi-browser nge-lama.cpp ne-Transformers.js.

Iilwimi: en

Ilizwi lika-Clone

Pocket TTSPocket TTS

Imodeli elula yeparamitha ye-100M ye-Kyutai enesandi esifana nesona esivela kwisikhokelo esifanayo.

Iilwimi: en, fr

Ilizwi lika-Clone

CosyVoice3CosyVoice3

Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.

Iilwimi: en, zh, ja, ko, de, es, fr, it, ru

Ilizwi lika-Clone

MOSS-TTSMOSS-TTS

Ultra-long 20-language TTS supporting up to 1 hour of continuous generation with phoneme-level control.

Iilwimi: en, zh, de, es, fr, ja, it, hu, ko, ru, fa, ar, pl, pt, cs, da, sv, el, tr

Ilizwi lika-Clone

MegaTTS3MegaTTS3

ByteDance's sparse alignment TTS with adjustable intelligibility vs. speaker similarity.

Iilwimi: en, zh

Ilizwi lika-Clone

Umbhekisi phambili-Okuqalayo API

I-REST API ehambelana ne-OpenAI. Incopho enye yesiphelo, iimodeli ezingaphezu kwe-22. Inkxaso yosasazo lwezicelo zexesha elibonakalayo.

  • Ifomati ehambelana ne-OpenAI
  • Unikezelo lwe-TTS lweenkqubo zexesha elibonakalayo
  • Uqhubekeko lweqela lomsebenzi omkhulu
  • Isaziso se Webhook
Bonisa i-API Docs
pip install ttsai npm install @ttsainpm/ttsai
Python
from tts_ai import TTSClient

client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
    text="Hello from TTS.ai!",
    model="kokoro",
    voice="af_bella",
)
client.save(audio, "output.mp3")

Ixabiso elilula, elicacileyo

Qala ngokukhululekileyo. Ubungakanani njengoko ukhula.

Ekhululekileyo

$0

15,000 iimpawu

  • Kokoro, Piper, VITS, MeloTTS
  • Umda we-500 char
  • 3 gen/iyure (akukho akhawunti)
Ubhaliso simahla

Isiqalisi

$9/inyanga( ii)

500,000 iimpawu/inyanga

  • Zonke iimodeli ezingaphezu kwe-22
  • 100,000 iimpawu ngenkqubo
  • I-Voice Cloning
Qala
Ethandwa kakhulu

I-Pro

$29/inyanga( ii)

2,000 iikhredithi/inyanga

  • Yonke into kwisiqalisi
  • Ufikelelo lwe-API
  • Ukuqhubekeka okuphambili
Fumana i-Pro

Imisebenzi

$99/inyanga( ii)

10,000 iikhredithi/inyanga

  • Yonke into kwi-Pro
  • I-Bulk API
  • Ufolo oluphambili
Fumana iNkqubo

Bonisa zonke iinkqubo eziquka iipakeji zophawu →

Imibuzo ebuzwa rhoqo

TTS.ai yinkqubo yesandi ye-AI epheleleyo, enikezela ngeemodeli ezingaphezu kwe-22 zokubhala-ukuthetha, ukuclona kwelizwi, ukuthetha-ukubhaliweyo, kunye neezixhobo zesandi. Zonke iimodeli zivela kwi-open source ngaphandle kokuvula umboneleli.

Ewe! TTS.ai ibonelela ngemibhalo-ukuze-ithetha ngokukhululekileyo ngeemodeli zeKokoro, Piper, VITS, kunye neMeloTTS. Akukho akhawunti ifunekayo. Bhalisa ukuze ufumane amagama angama-15,000 asimahla kwaye ufike kuzo zonke iimodeli. Iinkqubo ezihlawulwayo ziqala kwi- $9/inyanga.

Ukusebenza ngokukhawuleza, sebenzisa iKokoro okanye iPiper. Ukusebenza kakuhle, zama iCosyVoice 2 okanye iStyleTTS 2. Ukwenza ilizwi lifana, sebenzisa iChatterbox okanye iGPT-SoVITS. Unxibelelwano, sebenzisa iDia TTS. Zama iimodeli ezininzi kumbhalo ofanayo ukuthelekiswa.

Ewe. I-REST API ehambelana ne-OpenAI ye-TTS, i-STT, ukuclone kwesandi, kunye neezixhobo zesandi. Ifumaneka kwiPro ($29/mo) kunye ne-Enterprise ($99/mo) iinkqubo. Bona uxwebhu kwi-tts.ai/api/.

Ubunjani besandi buhluka ngokwemodeli. Iimodeli eziphezulu ezifana ne CosyVoice 2, StyleTTS 2, ne Chatterbox zivelisa ulwimi olunomgangatho ofanayo nolunobuntu obuqhelekileyo kunye novakalelo. Iimodeli ezikhululekileyo ezifana ne Kokoro zibonelela ngomgangatho olungileyo kwiziganeko ezininzi zokusetyenziswa.

TTS.ai ixhasa 30+ ulwimi kwilayibrari yemodeli. IsiNgesi sinomxhaso wemodeli obanzi kakhulu, kodwa imodeli ezifana neCosyVoice 2 iquka isiTshayina, isiJaphani, nesiKorea; iGPT-SoVITS iphatha isiTshayina, isiJaphani, isiKorea, nesiNgesi; neMeloTTS ixhasa isiNgesi, isiSpanish, isiFrentshi, isiTshayina, isiJaphani, nesiKorea.

Ewe. Zonke inkqubo ziqhutywa kwiseva yethu ekhethekileyo ye-GPU. Asigcinanga umbhalo wakho ongeniswe okanye isandi esiveliswe emva kokuthunyelwa. Iisampuli zesandi ezilayishwe phezulu zokuklonya zisetyenziswa kuphela kwintlanganiso yangoku kwaye azigcinwanga. Asiyi kudibana nedata yakho nabani na olandelayo okanye siyisebenzise ukuqeqesha iimodyuli.

Ewe. Zonke iiseshoni zesandi eziveliswe kwi-TTS.ai ziye zasetyenziswa ngokurhweba, kubandakanya i-YouTube videos, iipodcasts, iincwadi zesandi, ii-apps, izikhumbuzo, kunye neemveliso. Iimodeli zethu zivela kumbhalo ovulekileyo phantsi kwelayisensi ezivumelayo (MIT, Apache 2.0). Akukho lungelo lokushicilela okanye ukunikezelwa okufunekayo.

TTS.ai ivelisa isandi kwifomati ye WAV ngokumiselweyo umgangatho ophezulu. Ungaguqula kwi MP3, FLAC, OGG, okanye M4A usebenzisa isixhobo sethu esikhululekileyo sokutshintsha isandi. I-API ixhasa ukukhankanya ifomati yakho ekhethiweyo yemveliso ngqo kwisicelo.

Layisha phezulu isampuli yesandi esezantsi (incinci njengemizuzwana emi-5) yelizwi ofuna ukulikhupha, emva koko ubhale nawuphi na umbhalo ukuvelisa ukuthetha kulo lizwi. Iimodeli ezinjenge Chatterbox, GPT-SoVITS, kunye ne CosyVoice 2 zixhasa ukulikhupha ulwimi. Ilizwi elikhuphiweyo lithatha into, isivakalisi, kunye nesitayile sokuthetha.

Iimodeli ezikhululekileyo (iKokoro, iPiper, iVITS, iMeloTTS) azidingi i-akhawunti kwaye zibiza uphawu olupheleleyo. Iimodeli eziqhelekileyo (2,000 uphawu/1K ingxelo) ziquka iBark, iCosyVoice 2, iF5-TTS, neDia. Iimodeli eziphezulu (4,000 uphawu/1K ingxelo) ziquka iOpenVoice, iChatterbox, iStyleTTS 2, neTortoise. Iimodeli ezihlawulwayo ngokubanzi zibonelela ngomgangatho ophezulu, iingoma ezininzi, kunye nemisebenzi engaphezulu njenge-cloning yelizwi.

Ewe. I-API ixhasa uqhubekeko lweqela lokuguqula ivolumu enkulu yombhalo kwilizwi. Thumela izicelo ezininzi kwaye ubuyisele iziphumo ngokuzenzekelayo usebenzisa umsebenzi we-UUIDs. Iinkqubo zeshishini ($99/mo) ziquka unikezelo lofolo oluphambili loqhubekeko lweqela olukhawulezayo. Ilungele ukuveliswa kweencwadi zesandi, imixholo yenkqubo, kunye neeprojekthi ezinkulu zesandi.
4.1/5 (21)

Yintoni esinokuyilungisa? Ulwazi lwakho olufunyenweyo lunceda silungise iingxaki.

Qala Ukusebenzisa i-AI Voice Namhlanje

Dibanisa abavelisi, abaphuhlisi, kunye neenkampani usebenzisa i-TTS.ai