Free AI Text to SpeechName
31+ open-source mamodheru, 231+ mashoko, 34+ No account required.
Zvese zvaunoda kuti uzive nezve Voice AI
30 + maturusi anotsigirwa neopen-source AI mamodheru
31+ AI Voice Models
The most kunyatsojeka unganidzwa we open-source TTS mamodheru muimwe platform
Kokoro Free
Kokoro imhando yemutauro unoshandura maparameter makumi maviri nemana ezviuru kuti uve mashoko, uye inowana kubudirira kwakanyanya kupfuura mamwe mapurojekiti emhando iyi. Pasinei nechidiki chayo, Kokoro inoshandura maparameter makumi maviri nemana ezviuru kuti ive mashoko, uye inogadzira mashoko anotaura zvakajeka. Kokoro inotsigira mitauro mizhinji, kusanganisira Chirungu, ChiJapanese, ChiChinese, neChiKorean, pamwe nemhando dzakasiyana dzemazwi anotaura.
Yakanaka kune: Yakakwira-mhando TTS neyakaderera latency, streaming applications
Kuedza kwemahara
Piper Free
Piper idiki, yakajeka, uye yakajeka-kutaura injini yakagadzirwa neRhasspy, iyo inoshandisa VITS uye larynx architectures. Inoshanda zvachose paCPU, ichiita kuti ive yakanaka kune edge devices, home automation, uye maapplication anoda offline TTS. Nekusvika pa100 mazwi mu30+ matauro, Piper inopa zvakajairika zvinonzwa kutaura panguva chaiyo, kunyange paRaspberry Pi 4.
Yakanaka kune: Zvimwe zvinongedzo zvezvirongwa, kuwanikwa, uye zvinongedzo zvezvirongwa
Kuedza kwemahara
VITS Free
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) imwecheteyo nzira yeTTS inoita kuti mashoko aite seasiri kunyorwa, asi asiri kunyorwa. Inoshandisa variational inference pamwe nekushandura ma flows kuita zvakajairika uye nekuita ma training processes asingatarisirwi, izvo zvinopa mhedzisiro yakanaka mukutaura.
Yakanaka kune: General-purpose text-to-speech with natural prosody
Kuedza kwemahara
MeloTTS Free
MeloTTS by MyShell.ai is a multilingual TTS library supporting English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. It is extremely fast, processing text at near real-time speed on CPU alone. MeloTTS is designed for production use and supports both CPU and GPU inference.
Yakanaka kune: Production maapplication anoda nekukurumidza, multilingual TTS
Kuedza kwemahara
OuteTTS Free
OuteTTS inowedzera mamodheru emhando yepamusoro dzemitauro nemhando dzakasiyana dzemabasa ekuti mashoko aite sei, uye ichichengeta hunhu hwayo hwekutanga. Inotsigira akawanda mabackends, kusanganisira llama.cpp (CPU/GPU), Hugging Face Transformers, ExLlamaV2, VLLM, uyewo browser inference kuburikidza neTransformers.js. Inosanganisira zero-shot voice cloning kuburikidza ne speaker profiles dzakachengetwa seJSON.
Yakanaka kune: Edge kumisikidza, browser-based TTS, low-resource mamiriro
Kuedza kwemahara
Pocket TTS Free
Pocket TTS by Kyutai (vagadziri veMoshi) ndeimwe compact 100M parameter text-to-speech model iyo inotamba zvakanaka kupfuura muviri wayo. Inoshanda zvakaomarara paCPU, inotsigira zero-shot voice cloning kubva kune imwe audio sample, uye inogadzira mashoko anonzwa sezvinoita muviri.
Yakanaka kune: Lightweight kugoverwa, CPU-only mamiriro, nyore kufona cloning
Kuedza kwemahara
Kitten TTS Free
Kitten TTS by KittenML is an ultra-lightweight text-to-speech model built on ONNX. With variants from 15M to 80M parameters (25-80 MB on disk), it delivers high-quality voice synthesis on CPU without requiring a GPU. Features 8 built-in voices, adjustable speech speed, and built-in text preprocessing for numbers, currencies, and units. Ideal for edge deployment and low-latency applications.
Yakanaka kune: Fast lightweight TTS, edge deployment, low-latency applications
Kuedza kwemahara
Bark Standard
Transformer-based text-to-audio model iyo inogadzira yakasarudzika mashoko, mimhanzi, uye mhedzisiro yezwi.
Developer: Suno · License: MIT
Tarisa
Bark Small Standard
Lighter vhezheni yeBark nekurumbidza inference uye pasi memory usage.
Developer: Suno · License: MIT
Tarisa
CosyVoice 2 Standard
Alibaba's scalable streaming TTS ine hunhu hwemunhu-parity uye latency yakati rebei.
Developer: Alibaba (Tongyi Lab) · License: Apache 2.0
Tarisa
Dia TTS Standard
Multi-mutaura dialog generation model iyo inogadzira zvakajairika mashoko pakati pemutaura.
Developer: Nari Labs · License: Apache 2.0
Tarisa
Parler TTS Standard
Kutaura mashoko aunoda mutauro wakanaka uye Parler ichagadzira mazita anoenderana.
Developer: Hugging Face · License: Apache 2.0
Tarisa
GLM-TTS Standard
Achieve the lowest character error rate among open-source TTS models.
Developer: Zhipu AI · License: GLM-4 License
Tarisa
IndexTTS-2 Standard
Zero-shot TTS neyakaomeswa kudzora kwepfungwa uye yakakura kuratidzika.
Developer: Index Team · License: Bilibili Model License
Tarisa
Spark TTS Standard
Voice cloning TTS ne controllable emotion uye kutaura pfungwa kuburikidza nemibvunzo.
Developer: SparkAudio · License: CC BY-NC-SA 4.0
Tarisa
GPT-SoVITS Standard
Few-shot voice cloning TTS iyo inoshandura chero mashoko kubva chete 5 masekondi eaudio.
Developer: RVC-Boss · License: MIT
Tarisa
Orpheus Standard
Human-level emotional TTS model yakadzidziswa pa 100K mazuva emashoko data.
Developer: Canopy Labs · License: Llama 3.2 Community
Tarisa
Qwen3 TTS Standard
Alibaba's multilingual TTS nezwi cloning, preset mashoko, uye mashoko dhizaini kubva muchinyorwa.
Developer: Alibaba (Qwen) · License: Apache 2.0
Tarisa
Chatterbox Turbo Standard
Faster Chatterbox nesub-200ms latency uye paralinguistic tags for laughs, kuora mwoyo, uye zvakawanda.
Developer: Resemble AI · License: MIT
Tarisa
Dia 2 Standard
Streaming-kutanga conversational TTS nemulti-mutaura musangano uye paralinguistic zviratidzo.
Developer: Nari Labs · License: Apache 2.0
Tarisa
VoxCPM Standard
Tokenizer-free TTS inogadzira 44.1kHz audio ne context-aware paragraph consistency.
Developer: OpenBMB · License: Apache 2.0
Tarisa
TADA Standard
Zero-hallucination TTS netext-acoustic dual alignment, 5x nekukurumidza kupfuura zvakaenzana LLM TTS.
Developer: Hume AI · License: MIT
Tarisa
VibeVoice Standard
Microsoft model for long-form multi-speaker content like podcasts and audiobooks.
Developer: Microsoft · License: MIT
Tarisa
CosyVoice3 Standard
Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.
Developer: Alibaba (FunAudioLLM) · License: Apache 2.0
Tarisa
CosyVoice 2
Alibaba's scalable streaming TTS ine hunhu hwemunhu-parity uye latency yakati rebei.
Matauro: en, zh, ja, ko, fr, de, it, es
Clone Voice
GLM-TTS
Achieve the lowest character error rate among open-source TTS models.
Matauro: en, zh
Clone Voice
IndexTTS-2
Zero-shot TTS neyakaomeswa kudzora kwepfungwa uye yakakura kuratidzika.
Matauro: en, zh
Clone Voice
Spark TTS
Voice cloning TTS ne controllable emotion uye kutaura pfungwa kuburikidza nemibvunzo.
Matauro: en, zh
Clone Voice
GPT-SoVITS
Few-shot voice cloning TTS iyo inoshandura chero mashoko kubva chete 5 masekondi eaudio.
Matauro: en, zh, ja, ko
Clone Voice
Chatterbox
State-of-the-art zero-shot voice cloning nepfungwa kudzora kubva Resemble AI.
Matauro: en
Clone Voice
Tortoise TTS
Multi-voice text-to-speech yakatarisana nemhando neautoregressive architecture.
Matauro: en
Clone Voice
OpenVoice
Instant voice cloning ne granular kudzora pamusoro style, emotions, uye accent.
Matauro: en, zh, ja, ko, fr, de, es, it
Clone Voice
Qwen3 TTS
Alibaba's multilingual TTS nezwi cloning, preset mashoko, uye mashoko dhizaini kubva muchinyorwa.
Matauro: en, zh, ja, ko, de, fr, ru, pt, es, it
Clone Voice
Chatterbox Turbo
Faster Chatterbox nesub-200ms latency uye paralinguistic tags for laughs, kuora mwoyo, uye zvakawanda.
Matauro: en
Clone Voice
VoxCPM
Tokenizer-free TTS inogadzira 44.1kHz audio ne context-aware paragraph consistency.
Matauro: en, zh
Clone Voice
OuteTTS
LLM-based TTS iyo inofamba pa CPU, GPU, kana browser kuburikidza llama.cpp uye Transformers.js.
Matauro: en
Clone Voice
Pocket TTS
Lightweight 100M parameter model by Kyutai ne voice cloning kubva kune imwe sample.
Matauro: en, fr
Clone Voice
CosyVoice3
Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.
Matauro: en, zh, ja, ko, de, es, fr, it, ru
Clone Voice
MOSS-TTS
Ultra-long 20-language TTS supporting up to 1 hour of continuous generation with phoneme-level control.
Matauro: en, zh, de, es, fr, ja, it, hu, ko, ru, fa, ar, pl, pt, cs, da, sv, el, tr
Clone Voice
MegaTTS3
ByteDance's sparse alignment TTS with adjustable intelligibility vs. speaker similarity.
Matauro: en, zh
Clone VoiceDeveloper-First API
OpenAI-inowirirana REST API. One endpoint, 22+ mamodheru. Streaming rutsigiro rwe real-time applications.
- OpenAI-inowirirana fomati
- Streaming TTS for real-time apps
- Batch processing for large jobs
- Webhook notifications
pip install ttsai
npm install @ttsainpm/ttsai
from tts_ai import TTSClient
client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
text="Hello from TTS.ai!",
model="kokoro",
voice="af_bella",
)
client.save(audio, "output.mp3")
Simple, Transparent Pricing
Kutanga zvakasununguka. Scale sezvauri kukura.
Vakasununguka
15,000 characters
- Kokoro, Piper, VITS, MeloTTS
- 500 characters limit
- 3 gen / mwedzi (sina akaunti)
Starter
500 zvikwereti / mwedzi
- All 22+ mamodheru
- 100,000 chars per generation
- Voice Cloning
Pro
2,000,000 characters/mwedzi
- Zvese muStarter
- API kuwanikwa
- Priority processing
Mibvunzo Inobvunzwa Kazhinji
Chii chingatibatsira kuti tiite zvakanaka? Ruzivo rwako runogona kutibatsira kugadzirisa matambudziko.
Kutanga kushandisa AI Voice Today
Joina vagadziri, vagadziri, uye makambani shandisa TTS.ai