Free AI Text to SpeechName

31+ open-source mamodheru, 231+ mashoko, 34+ No account required.

8K+
Vagadziri
31K+
generations
31+
AI mamodeli
231+
mazwi
0/500 mavara · Sign up for 5,000 per generation → Vakasununguka
Love TTS.ai? Tiudza shamwari dzako!

Zvese zvaunoda kuti uzive nezve Voice AI

30 + maturusi anotsigirwa neopen-source AI mamodheru

31+ AI Voice Models

The most kunyatsojeka unganidzwa we open-source TTS mamodheru muimwe platform

KokoroKokoro Free

Kokoro imhando yemutauro unoshandura maparameter makumi maviri nemana ezviuru kuti uve mashoko, uye inowana kubudirira kwakanyanya kupfuura mamwe mapurojekiti emhando iyi. Pasinei nechidiki chayo, Kokoro inoshandura maparameter makumi maviri nemana ezviuru kuti ive mashoko, uye inogadzira mashoko anotaura zvakajeka. Kokoro inotsigira mitauro mizhinji, kusanganisira Chirungu, ChiJapanese, ChiChinese, neChiKorean, pamwe nemhando dzakasiyana dzemazwi anotaura.

Yakanaka kune: Yakakwira-mhando TTS neyakaderera latency, streaming applications

Kuedza kwemahara

PiperPiper Free

Piper idiki, yakajeka, uye yakajeka-kutaura injini yakagadzirwa neRhasspy, iyo inoshandisa VITS uye larynx architectures. Inoshanda zvachose paCPU, ichiita kuti ive yakanaka kune edge devices, home automation, uye maapplication anoda offline TTS. Nekusvika pa100 mazwi mu30+ matauro, Piper inopa zvakajairika zvinonzwa kutaura panguva chaiyo, kunyange paRaspberry Pi 4.

Yakanaka kune: Zvimwe zvinongedzo zvezvirongwa, kuwanikwa, uye zvinongedzo zvezvirongwa

Kuedza kwemahara

VITSVITS Free

VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) imwecheteyo nzira yeTTS inoita kuti mashoko aite seasiri kunyorwa, asi asiri kunyorwa. Inoshandisa variational inference pamwe nekushandura ma flows kuita zvakajairika uye nekuita ma training processes asingatarisirwi, izvo zvinopa mhedzisiro yakanaka mukutaura.

Yakanaka kune: General-purpose text-to-speech with natural prosody

Kuedza kwemahara

MeloTTSMeloTTS Free

MeloTTS by MyShell.ai is a multilingual TTS library supporting English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. It is extremely fast, processing text at near real-time speed on CPU alone. MeloTTS is designed for production use and supports both CPU and GPU inference.

Yakanaka kune: Production maapplication anoda nekukurumidza, multilingual TTS

Kuedza kwemahara

OuteTTSOuteTTS Free

OuteTTS inowedzera mamodheru emhando yepamusoro dzemitauro nemhando dzakasiyana dzemabasa ekuti mashoko aite sei, uye ichichengeta hunhu hwayo hwekutanga. Inotsigira akawanda mabackends, kusanganisira llama.cpp (CPU/GPU), Hugging Face Transformers, ExLlamaV2, VLLM, uyewo browser inference kuburikidza neTransformers.js. Inosanganisira zero-shot voice cloning kuburikidza ne speaker profiles dzakachengetwa seJSON.

Yakanaka kune: Edge kumisikidza, browser-based TTS, low-resource mamiriro

Kuedza kwemahara

Pocket TTSPocket TTS Free

Pocket TTS by Kyutai (vagadziri veMoshi) ndeimwe compact 100M parameter text-to-speech model iyo inotamba zvakanaka kupfuura muviri wayo. Inoshanda zvakaomarara paCPU, inotsigira zero-shot voice cloning kubva kune imwe audio sample, uye inogadzira mashoko anonzwa sezvinoita muviri.

Yakanaka kune: Lightweight kugoverwa, CPU-only mamiriro, nyore kufona cloning

Kuedza kwemahara

Kitten TTSKitten TTS Free

Kitten TTS by KittenML is an ultra-lightweight text-to-speech model built on ONNX. With variants from 15M to 80M parameters (25-80 MB on disk), it delivers high-quality voice synthesis on CPU without requiring a GPU. Features 8 built-in voices, adjustable speech speed, and built-in text preprocessing for numbers, currencies, and units. Ideal for edge deployment and low-latency applications.

Yakanaka kune: Fast lightweight TTS, edge deployment, low-latency applications

Kuedza kwemahara

BarkBark Standard

Transformer-based text-to-audio model iyo inogadzira yakasarudzika mashoko, mimhanzi, uye mhedzisiro yezwi.

Developer: Suno · License: MIT

Tarisa

Bark SmallBark Small Standard

Lighter vhezheni yeBark nekurumbidza inference uye pasi memory usage.

Developer: Suno · License: MIT

Tarisa

CosyVoice 2CosyVoice 2 Standard

Alibaba's scalable streaming TTS ine hunhu hwemunhu-parity uye latency yakati rebei.

Developer: Alibaba (Tongyi Lab) · License: Apache 2.0

Tarisa

Dia TTSDia TTS Standard

Multi-mutaura dialog generation model iyo inogadzira zvakajairika mashoko pakati pemutaura.

Developer: Nari Labs · License: Apache 2.0

Tarisa

Parler TTSParler TTS Standard

Kutaura mashoko aunoda mutauro wakanaka uye Parler ichagadzira mazita anoenderana.

Developer: Hugging Face · License: Apache 2.0

Tarisa

GLM-TTSGLM-TTS Standard

Achieve the lowest character error rate among open-source TTS models.

Developer: Zhipu AI · License: GLM-4 License

Tarisa

IndexTTS-2IndexTTS-2 Standard

Zero-shot TTS neyakaomeswa kudzora kwepfungwa uye yakakura kuratidzika.

Developer: Index Team · License: Bilibili Model License

Tarisa

Spark TTSSpark TTS Standard

Voice cloning TTS ne controllable emotion uye kutaura pfungwa kuburikidza nemibvunzo.

Developer: SparkAudio · License: CC BY-NC-SA 4.0

Tarisa

GPT-SoVITSGPT-SoVITS Standard

Few-shot voice cloning TTS iyo inoshandura chero mashoko kubva chete 5 masekondi eaudio.

Developer: RVC-Boss · License: MIT

Tarisa

OrpheusOrpheus Standard

Human-level emotional TTS model yakadzidziswa pa 100K mazuva emashoko data.

Developer: Canopy Labs · License: Llama 3.2 Community

Tarisa

Qwen3 TTSQwen3 TTS Standard

Alibaba's multilingual TTS nezwi cloning, preset mashoko, uye mashoko dhizaini kubva muchinyorwa.

Developer: Alibaba (Qwen) · License: Apache 2.0

Tarisa

Chatterbox TurboChatterbox Turbo Standard

Faster Chatterbox nesub-200ms latency uye paralinguistic tags for laughs, kuora mwoyo, uye zvakawanda.

Developer: Resemble AI · License: MIT

Tarisa

Dia 2Dia 2 Standard

Streaming-kutanga conversational TTS nemulti-mutaura musangano uye paralinguistic zviratidzo.

Developer: Nari Labs · License: Apache 2.0

Tarisa

VoxCPMVoxCPM Standard

Tokenizer-free TTS inogadzira 44.1kHz audio ne context-aware paragraph consistency.

Developer: OpenBMB · License: Apache 2.0

Tarisa

TADATADA Standard

Zero-hallucination TTS netext-acoustic dual alignment, 5x nekukurumidza kupfuura zvakaenzana LLM TTS.

Developer: Hume AI · License: MIT

Tarisa

VibeVoiceVibeVoice Standard

Microsoft model for long-form multi-speaker content like podcasts and audiobooks.

Developer: Microsoft · License: MIT

Tarisa

CosyVoice3CosyVoice3 Standard

Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.

Developer: Alibaba (FunAudioLLM) · License: Apache 2.0

Tarisa

ChatterboxChatterbox Premium

State-of-the-art zero-shot voice cloning nepfungwa kudzora kubva Resemble AI.

Quality:

Tarisa

Tortoise TTSTortoise TTS Premium

Multi-voice text-to-speech yakatarisana nemhando neautoregressive architecture.

Quality:

Tarisa

StyleTTS 2StyleTTS 2 Premium

Human-level text-to-speech kuburikidza style diffusion uye oponetsa kudzidziswa.

Quality:

Tarisa

OpenVoiceOpenVoice Premium

Instant voice cloning ne granular kudzora pamusoro style, emotions, uye accent.

Quality:

Tarisa

Sesame CSMSesame CSM Premium

Conversational mashoko model kuumba zvakatipoteredza musangano nenguva yakakodzera uye emotions.

Quality:

Tarisa

MOSS-TTSMOSS-TTS Premium

Ultra-long 20-language TTS supporting up to 1 hour of continuous generation with phoneme-level control.

Quality:

Tarisa

MegaTTS3MegaTTS3 Premium

ByteDance's sparse alignment TTS with adjustable intelligibility vs. speaker similarity.

Quality:

Tarisa

CosyVoice 2CosyVoice 2

Alibaba's scalable streaming TTS ine hunhu hwemunhu-parity uye latency yakati rebei.

Matauro: en, zh, ja, ko, fr, de, it, es

Clone Voice

GLM-TTSGLM-TTS

Achieve the lowest character error rate among open-source TTS models.

Matauro: en, zh

Clone Voice

IndexTTS-2IndexTTS-2

Zero-shot TTS neyakaomeswa kudzora kwepfungwa uye yakakura kuratidzika.

Matauro: en, zh

Clone Voice

Spark TTSSpark TTS

Voice cloning TTS ne controllable emotion uye kutaura pfungwa kuburikidza nemibvunzo.

Matauro: en, zh

Clone Voice

GPT-SoVITSGPT-SoVITS

Few-shot voice cloning TTS iyo inoshandura chero mashoko kubva chete 5 masekondi eaudio.

Matauro: en, zh, ja, ko

Clone Voice

ChatterboxChatterbox

State-of-the-art zero-shot voice cloning nepfungwa kudzora kubva Resemble AI.

Matauro: en

Clone Voice

Tortoise TTSTortoise TTS

Multi-voice text-to-speech yakatarisana nemhando neautoregressive architecture.

Matauro: en

Clone Voice

OpenVoiceOpenVoice

Instant voice cloning ne granular kudzora pamusoro style, emotions, uye accent.

Matauro: en, zh, ja, ko, fr, de, es, it

Clone Voice

Qwen3 TTSQwen3 TTS

Alibaba's multilingual TTS nezwi cloning, preset mashoko, uye mashoko dhizaini kubva muchinyorwa.

Matauro: en, zh, ja, ko, de, fr, ru, pt, es, it

Clone Voice

Chatterbox TurboChatterbox Turbo

Faster Chatterbox nesub-200ms latency uye paralinguistic tags for laughs, kuora mwoyo, uye zvakawanda.

Matauro: en

Clone Voice

VoxCPMVoxCPM

Tokenizer-free TTS inogadzira 44.1kHz audio ne context-aware paragraph consistency.

Matauro: en, zh

Clone Voice

OuteTTSOuteTTS

LLM-based TTS iyo inofamba pa CPU, GPU, kana browser kuburikidza llama.cpp uye Transformers.js.

Matauro: en

Clone Voice

Pocket TTSPocket TTS

Lightweight 100M parameter model by Kyutai ne voice cloning kubva kune imwe sample.

Matauro: en, fr

Clone Voice

CosyVoice3CosyVoice3

Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.

Matauro: en, zh, ja, ko, de, es, fr, it, ru

Clone Voice

MOSS-TTSMOSS-TTS

Ultra-long 20-language TTS supporting up to 1 hour of continuous generation with phoneme-level control.

Matauro: en, zh, de, es, fr, ja, it, hu, ko, ru, fa, ar, pl, pt, cs, da, sv, el, tr

Clone Voice

MegaTTS3MegaTTS3

ByteDance's sparse alignment TTS with adjustable intelligibility vs. speaker similarity.

Matauro: en, zh

Clone Voice

Developer-First API

OpenAI-inowirirana REST API. One endpoint, 22+ mamodheru. Streaming rutsigiro rwe real-time applications.

  • OpenAI-inowirirana fomati
  • Streaming TTS for real-time apps
  • Batch processing for large jobs
  • Webhook notifications
View API Docs
pip install ttsai npm install @ttsainpm/ttsai
Python
from tts_ai import TTSClient

client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
    text="Hello from TTS.ai!",
    model="kokoro",
    voice="af_bella",
)
client.save(audio, "output.mp3")

Simple, Transparent Pricing

Kutanga zvakasununguka. Scale sezvauri kukura.

Vakasununguka

$0

15,000 characters

  • Kokoro, Piper, VITS, MeloTTS
  • 500 characters limit
  • 3 gen / mwedzi (sina akaunti)
Sign Up Free

Starter

$9/mwedzi

500 zvikwereti / mwedzi

  • All 22+ mamodheru
  • 100,000 chars per generation
  • Voice Cloning
Kutanga
Yakanyanya Kuzivikanwa

Pro

$29/mwedzi

2,000,000 characters/mwedzi

  • Zvese muStarter
  • API kuwanikwa
  • Priority processing
Get Pro

Business

$99/mwedzi

10,000,000 characters/mwedzi

  • Zvese muPro
  • Bulk API
  • Priority queue
Get Business

Ona zvese zvirongwa kusanganisira mapakeji ezvinyorwa →

Mibvunzo Inobvunzwa Kazhinji

TTS.ai ndiyo yakanyanya kusanganisira AI voice platform, ichipa 22+ mamodheru emashoko-ku-mutauro, kudzokorora kwemashoko, kutaura-ku-mutauro, uye audio tools.All mamodheru anowanikwa pasina mutengesi we lock-in.

Yes! TTS.ai inopa yemahara yekunyora-ku-kutaura neKokoro, Piper, VITS, uye MeloTTS models. No account required. Sign up to get 15,000 free characters and access all models. Paid plans start at $9/month.

Kuti uite zviri nyore, shandisa Kokoro kana Piper. Kuti uwane kunaka, edza CosyVoice 2 kana StyleTTS 2. Kuti uite mashoko, shandisa Chatterbox kana GPT-SoVITS. Kuti uite mashoko, shandisa Dia TTS. edza mamodheru akasiyana pazita rimwe chete kuti uzvienzanise.

OpenAI-inowirirana REST API ye TTS, STT, kudhonza mashoko, uye audio zvinhu. Available on Pro ($29/mo) uye Enterprise ($99/mo) zvirongwa. Ona zvinyorwa pa tts.ai/api/.

Zvigadzirwa zvemhando yepamusoro seCosyVoice 2, StyleTTS 2, uye Chatterbox zvinopa mashoko akafanana neanotaurwa nemunhu, ane intonation uye emotions dzakajairika. Zvigadzirwa zvemhando yepamusoro seKokoro zvinopa mashoko emhando yepamusoro mumamiriro akawanda ekushandisa.

TTS.ai inotsigira 30+ matauro pasi pebhuku rayo remufananidzo. Chirungu chine kutsigira kwemufananidzo kwakawanda, asi mamodheru seCosyVoice 2 anobata ChiChinese, ChiJapanese, neChiKorean; GPT-SoVITS anobata ChiChinese, ChiJapanese, ChiKorean, neChirungu; uye MeloTTS anotsigira ChiSpanish, ChiFrench, ChiChinese, ChiJapanese, neChiKorean.

Yeah. All processing happens on our dedicated GPU servers. We don't store your text input or generated audio after delivery. Uploaded voice samples for cloning are used only for the current session and aren't retained. We never share your data with third parties or use it to train models.

Yeah. All audio yakagadzirwa pa TTS.ai ndeyako kuti uishandisa zvekutengesa, kusanganisira YouTube mavhidhiyo, podcasts, audiobooks, apps, matangazo, uye zvigadzirwa. Mamodeli edu anowanikwa pasi pezvibvumirano zvemutemo (MIT, Apache 2.0).

TTS.ai inogadzira vhidhiyo muWAV format ne default kuti uwane vhidhiyo ine yepamusoro mhando. Iwe unogona kushandura vhidhiyo yako kuita MP3, FLAC, OGG, kana M4A nekushandisa yedu yemahara Audio Converter tool. The API inobatsira kuisa yako yaunofarira vhidhiyo format mubvunzo.

Kuisa vhidhiyo yezwi rako.

Zvimwe zvirongwa zvinoda kuti munhu atenge account uye zvinoda kuti munhu abhadhare kuti akwanise kushandisa zvirongwa izvi. Zvirongwa zvemahara (Kokoro, Piper, VITS, MeloTTS) hazvidi kubhadhara mari yekushandisa uye zvinoda kuti munhu abhadhare kuti akwanise kushandisa zvirongwa izvi. Zvirongwa zvepakutanga (2,000 characters/1K input) zvinoti Bark, CosyVoice 2, F5-TTS, uye Dia. Zvirongwa zvepakutanga (4,000 characters/1K input) zvinoti OpenVoice, Chatterbox, StyleTTS 2, uye Tortoise.

Yes. The API supports batch processing for converting large volumes of text to speech. Send multiple requests and retrieve results asynchronously using job UUIDs. Enterprise plans ($99/mo) include priority queue access for faster batch processing. Ideal for audiobook production, course content, and large-scale voiceover projects.
4.1/5 (21)

Chii chingatibatsira kuti tiite zvakanaka? Ruzivo rwako runogona kutibatsira kugadzirisa matambudziko.

Kutanga kushandisa AI Voice Today

Joina vagadziri, vagadziri, uye makambani shandisa TTS.ai