Free AI Text to SpeechGenericName

31+ open-source modèl, 231+ vwa, 34+ Pa gen kont mande.

8K+
kreyatè
31K+
jenerasyon
31+
Modèl AI
231+
vwa
0/500 karaktè · Sign up for 5,000 per generation → Gratis
Love TTS.ai? Di zanmi ou yo!

31+ Modèl Vokal AI

Koleksyon ki pi konplè nan modèl TTS open-source nan yon sèl platfòm

KokoroKokoro Free

Kokoro se yon 82 milyon paramèt tèks-a-parole modèl ki punches byen pi wo pase klas pwa li. Pandan ke gwosè li ti, li pwodwi pale remarkabman natirèl ak ekspresif. Kokoro sipòte plizyè lang ki gen ladan angle, Japonè, Chinwa, ak Koreyen ak yon varyete de vwa ekspresif. Li kouri incredibly vit — jenere son prèske 100x pi vit pase tan reyèl sou yon GPU.

Pi bon pou: TTS bon jan kalite segondè ak latency minimòm, aplikasyon streaming

Eseye gratis

PiperPiper Free

Piper se yon motè tèks-a-parole limyè devlope pa Rhasspy ki itilize VITS ak larynx achitekti. Li kouri konplètman sou CPU, ki fè li ideyal pou aparèy edge, automatisation kay, ak aplikasyon ki mande TTS offline. Avèk plis pase 100 vwa nan plis pase 30 lang, Piper bay pale son natirèl nan vitès tan reyèl menm sou yon Raspberry Pi 4.

Pi bon pou: Previews rapid, accessibility, and embedded applications

Eseye gratis

VITSVITS Free

VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) se yon metòd TTS paralèl bout-a-bòd ki kreye yon son ki pi natirèl pase modèl aktyèl ki baze sou de etap. Li adopte inférence variational ki ogmante ak koule normalisation ak yon pwosesis antrenman adversarial, rive jwenn yon amelyorasyon siyifikatif nan natiralizasyon.

Pi bon pou: Text-to-speech pou rezon jeneral ak prozodi natirèlName

Eseye gratis

MeloTTSMeloTTS Free

MeloTTS by MyShell.ai is a multilingual TTS library supporting English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. It is extremely fast, processing text at near real-time speed on CPU alone. MeloTTS is designed for production use and supports both CPU and GPU inference.

Pi bon pou: Aplikasyon pwodiksyon ki bezwen TTS rapid, multilenguage

Eseye gratis

OuteTTSOuteTTS Free

OuteTTS pwolonje gwo modèl lang ak kapasite tèks-a-parole pandan l ap kenbe achitekti orijinal la. Li sipòte backends multiple ki gen ladan llama.cpp (CPU / GPU), Hugging Face Transformers, ExLlamaV2, VLLM, ak menm infèrans navigatè via Transformers.js. Features zero-shot voice cloning through speaker profiles saved as JSON.

Pi bon pou: Edge deployment, TTS ki baze sou navigatè, environnements ki ba-resous

Eseye gratis

Pocket TTSPocket TTS Free

Pocket TTS pa Kyutai (kreyatè Moshi) se yon modèl tèks-a-parole 100M paramèt ki koube byen pi wo pase pwa li. Li kouri efikasman sou CPU, sipòte klonaj vwa zero-shot soti nan yon sèl echantiyon son, epi pwodwi pale natirèl-son. Ti gwosè modèl la fè li ideyal pou deployment edge ak anviwònman ki ba-resous.

Pi bon pou: Deployman limyè, environnements CPU-only, klonaj vwa rapid

Eseye gratis

Kitten TTSKitten TTS Free

Kitten TTS by KittenML is an ultra-lightweight text-to-speech model built on ONNX. With variants from 15M to 80M parameters (25-80 MB on disk), it delivers high-quality voice synthesis on CPU without requiring a GPU. Features 8 built-in voices, adjustable speech speed, and built-in text preprocessing for numbers, currencies, and units. Ideal for edge deployment and low-latency applications.

Pi bon pou: Fast lightweight TTS, edge deployment, low-latency applications

Eseye gratis

BarkBark Standard

Modèl tèks-nan-son ki baze sou transformateur ki jenere pale, mizik, ak efè son realist.

Pwogramè: Suno · Lisans: MIT

Tcheke li

Bark SmallBark Small Standard

Versiyon ki pi limyè nan Bark ak inférence pi vit ak itilize nan memwa ki pi ba.

Pwogramè: Suno · Lisans: MIT

Tcheke li

CosyVoice 2CosyVoice 2 Standard

Alibaba's scalable streaming TTS ak natiralizasyon parite imen ak latency prèske zewo.

Pwogramè: Alibaba (Tongyi Lab) · Lisans: Apache 2.0

Tcheke li

Dia TTSDia TTS Standard

Modèl jenerasyon dyalòg multi-pale ki kreye konvèsasyon natirèl ant pale yo.

Pwogramè: Nari Labs · Lisans: Apache 2.0

Tcheke li

Parler TTSParler TTS Standard

Descrivez la voix que vous voulez dans la langue naturelle et Parler génère la parole correspondante.

Pwogramè: Hugging Face · Lisans: Apache 2.0

Tcheke li

GLM-TTSGLM-TTS Standard

Li gen pi ba pousantaj erè karaktè ant modèl TTS ki gen sous louvri.

Pwogramè: Zhipu AI · Lisans: GLM-4 License

Tcheke li

IndexTTS-2IndexTTS-2 Standard

Zero-shot TTS ak kontwòl emosyon fine-grained ak ekspresyon segondè.

Pwogramè: Index Team · Lisans: Bilibili Model License

Tcheke li

Spark TTSSpark TTS Standard

Voye klonaj TTS ak emosyon kontwole ak style pale via pwompts.

Pwogramè: SparkAudio · Lisans: CC BY-NC-SA 4.0

Tcheke li

GPT-SoVITSGPT-SoVITS Standard

Few-shot klonaj vwa TTS ki replike nenpòt vwa soti nan jis 5 segonn nan son.

Pwogramè: RVC-Boss · Lisans: MIT

Tcheke li

OrpheusOrpheus Standard

100,000 èdtan nan done pale yo te itilize pou fòme yon modèl TTS emosyonèl nivo imen.

Pwogramè: Canopy Labs · Lisans: Llama 3.2 Community

Tcheke li

Qwen3 TTSQwen3 TTS Standard

Alibaba's multilingual TTS ak klonaj vwa, preset vwa, ak konsepsyon vwa soti nan tèks.

Pwogramè: Alibaba (Qwen) · Lisans: Apache 2.0

Tcheke li

Chatterbox TurboChatterbox Turbo Standard

Chatterbox pi vit ak sub-200ms latency ak tags paralinguistik pou ri, touse, ak plis ankò.

Pwogramè: Resemble AI · Lisans: MIT

Tcheke li

Dia 2Dia 2 Standard

Konvèsasyon TTS ki baze sou streaming ak dyalòg ant plizyè paleur ak sijesyon paralingwistik.

Pwogramè: Nari Labs · Lisans: Apache 2.0

Tcheke li

VoxCPMVoxCPM Standard

Tokenizer-gratis TTS ki pwodwi 44.1kHz odyo ak konstan paragraf kontexte-konsyan.

Pwogramè: OpenBMB · Lisans: Apache 2.0

Tcheke li

TADATADA Standard

Zero-hallucination TTS ak alineasyon doub tèks-acoustic, 5x pi vit pase LLM TTS konparab.

Pwogramè: Hume AI · Lisans: MIT

Tcheke li

VibeVoiceVibeVoice Standard

Microsoft modèl pou fòm long multi-pale kontni tankou podcasts ak audiobooks.

Pwogramè: Microsoft · Lisans: MIT

Tcheke li

CosyVoice3CosyVoice3 Standard

Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.

Pwogramè: Alibaba (FunAudioLLM) · Lisans: Apache 2.0

Tcheke li

ChatterboxChatterbox Premium

Pwogram sa a gen ladan tou yon sistèm klonaj vwa ak kontwòl emosyonèl ki rele Resemble AI.

Kalite:

Tcheke li

Tortoise TTSTortoise TTS Premium

Atik sa a se yon atik ki gen anpil valè ki gen rapò ak arkeoloji.

Kalite:

Tcheke li

StyleTTS 2StyleTTS 2 Premium

Edikasyon nan lang kreyòl se yon edikasyon ki baze sou divèsite ak konpreyansyon.

Kalite:

Tcheke li

OpenVoiceOpenVoice Premium

Instant klonaj vwa ak kontwòl granulaire sou style, emosyon, ak aksan.

Kalite:

Tcheke li

Sesame CSMSesame CSM Premium

Modèl pale konvèsatif ki kreye yon dyalòg natirèl ak tan ak emosyon ki apwopriye.

Kalite:

Tcheke li

MOSS-TTSMOSS-TTS Premium

Ultra-long 20-language TTS supporting up to 1 hour of continuous generation with phoneme-level control.

Kalite:

Tcheke li

MegaTTS3MegaTTS3 Premium

ByteDance's sparse alignment TTS with adjustable intelligibility vs. speaker similarity.

Kalite:

Tcheke li

CosyVoice 2CosyVoice 2

Alibaba's scalable streaming TTS ak natiralizasyon parite imen ak latency prèske zewo.

Lang: en, zh, ja, ko, fr, de, it, es

Klone Voy

GLM-TTSGLM-TTS

Li gen pi ba pousantaj erè karaktè ant modèl TTS ki gen sous louvri.

Lang: en, zh

Klone Voy

IndexTTS-2IndexTTS-2

Zero-shot TTS ak kontwòl emosyon fine-grained ak ekspresyon segondè.

Lang: en, zh

Klone Voy

Spark TTSSpark TTS

Voye klonaj TTS ak emosyon kontwole ak style pale via pwompts.

Lang: en, zh

Klone Voy

GPT-SoVITSGPT-SoVITS

Few-shot klonaj vwa TTS ki replike nenpòt vwa soti nan jis 5 segonn nan son.

Lang: en, zh, ja, ko

Klone Voy

ChatterboxChatterbox

Pwogram sa a gen ladan tou yon sistèm klonaj vwa ak kontwòl emosyonèl ki rele Resemble AI.

Lang: en

Klone Voy

Tortoise TTSTortoise TTS

Atik sa a se yon atik ki gen anpil valè ki gen rapò ak arkeoloji.

Lang: en

Klone Voy

OpenVoiceOpenVoice

Instant klonaj vwa ak kontwòl granulaire sou style, emosyon, ak aksan.

Lang: en, zh, ja, ko, fr, de, es, it

Klone Voy

Qwen3 TTSQwen3 TTS

Alibaba's multilingual TTS ak klonaj vwa, preset vwa, ak konsepsyon vwa soti nan tèks.

Lang: en, zh, ja, ko, de, fr, ru, pt, es, it

Klone Voy

Chatterbox TurboChatterbox Turbo

Chatterbox pi vit ak sub-200ms latency ak tags paralinguistik pou ri, touse, ak plis ankò.

Lang: en

Klone Voy

VoxCPMVoxCPM

Tokenizer-gratis TTS ki pwodwi 44.1kHz odyo ak konstan paragraf kontexte-konsyan.

Lang: en, zh

Klone Voy

OuteTTSOuteTTS

LLM-ki baze sou TTS ki kouri sou CPU, GPU, oswa navigatè via llama.cpp ak Transformers.js.

Lang: en

Klone Voy

Pocket TTSPocket TTS

100M modèl paramèt limyè pa Kyutai ak klonaj vwa soti nan yon sèl echantiyon.

Lang: en, fr

Klone Voy

CosyVoice3CosyVoice3

Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.

Lang: en, zh, ja, ko, de, es, fr, it, ru

Klone Voy

MOSS-TTSMOSS-TTS

Ultra-long 20-language TTS supporting up to 1 hour of continuous generation with phoneme-level control.

Lang: en, zh, de, es, fr, ja, it, hu, ko, ru, fa, ar, pl, pt, cs, da, sv, el, tr

Klone Voy

MegaTTS3MegaTTS3

ByteDance's sparse alignment TTS with adjustable intelligibility vs. speaker similarity.

Lang: en, zh

Klone Voy

Developer-First API

OpenAI-kompatib REST API. One endpoint, 22 + modèl. Streaming sipò pou aplikasyon an tan reyèl.

  • OpenAI-kompatib fòma
  • Streaming TTS pou aplikasyon an tan reyèl
  • Batch pwosesis pou gwo travay
  • Notifikasyon Webhook
View API Docs
pip install ttsai npm install @ttsainpm/ttsai
Python
from tts_ai import TTSClient

client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
    text="Hello from TTS.ai!",
    model="kokoro",
    voice="af_bella",
)
client.save(audio, "output.mp3")

Pri senp, transparan

Kòmanse gratis. Skale kòm ou grandi.

Gratis

$0

50 kredi

  • Kokoro, Piper, VITS, MeloTTS
  • Limit 500 karaktè
  • 3 gen/èdtan (pa gen kont)
Enskri pou gratis

Starter

$9/mo

500 kredi / mwa

  • Tout 22+ modèl
  • 100,000 karaktè pou chak jenerasyon
  • Klonaj Vokal
Kòmanse
Pi popilè

Pro

$29/mo

2,000 kredi / mwa

  • Tout bagay nan Starter
  • Akses API
  • Pwosesis priorité
Jwenn Pro

Biznis

$99/mo

10,000 kredi / mwa

  • Tout bagay nan Pro
  • Bulk API
  • Priyorite
Jwenn biznis

Gade tout plan ki gen ladan pake kredi →

Kesyon ki poze souvan

TTS.ai se platfòm vwa AI ki pi konplè, ki ofri plis pase 22 modèl tèks-a-parole, klonaj vwa, pale-a-tèks, ak zouti odyo.Tout modèl yo se sous louvri ak pa gen okenn vandè lock-an.

Wi! TTS.ai ofri gratis tèks-a-parole ak Kokoro, Piper, VITS, ak MeloTTS modèl. Pa gen kont mande. Enskri pou jwenn 15,000 karaktè gratis ak aksè a tout modèl. Plan peye kòmanse nan $9 / mwa.

Pou vitès, sèvi ak Kokoro oswa Piper. Pou bon jan kalite, eseye CosyVoice 2 oswa StyleTTS 2. Pou klone vwa, sèvi ak Chatterbox oswa GPT-SoVITS. Pou dyalòg, itilize Dia TTS. Eseye plizyè modèl sou menm tèks la pou konpare.

Wi. OpenAI-kompatib REST API pou TTS, STT, klonaj vwa, ak zouti son. Disponib sou Pro ($ 29 / mwa) ak Enterprise ($ 99 / mwa) plan.

Kalite vwa a varye selon modèl la. Modèl Premium tankou CosyVoice 2, StyleTTS 2, ak Chatterbox pwodwi yon vwa ki gen bon jan kalite moun ak yon intonasyon ak efè natirèl. Modèl gratis tankou Kokoro ofri yon bon jan kalite ekselan pou pifò ka.

TTS.ai sipòte plis pase 30 lang atravè bibliyotèk modèl li a. Anglè gen sipò modèl ki pi laj, men modèl tankou CosyVoice 2 kouvri Chinwa, Japonè, ak Koreyen; GPT-SoVITS kontwole Chinwa, Japonè, Koreyen, ak Angle; ak MeloTTS sipòte Angle, Espayòl, franse, Chinwa, Japonè, ak Koreyen.

Yes. All processing happens on our dedicated GPU servers. We do not store your text input or generated audio after delivery. Uploaded voice samples for cloning are used only for the current session and are not retained. We never share your data with third parties or use it to train models.

Wi. Tout son ki pwodwi sou TTS.ai se ou pou w itilize pou rezon komèsyal, ki gen ladan pou videyo YouTube, podcasts, liv son, aplikasyon, piblisite, ak pwodwi. Modèl nou yo se sous louvri anba lisans permissive (MIT, Apache 2.0).

TTS.ai jenere son an nan fòma WAV pa default pou pi bon kalite. Ou ka konvèti li nan MP3, FLAC, OGG, oswa M4A lè l sèvi avèk zouti gratis Audio Converter nou an.

Upload a short audio sample (as little as 5 seconds) of the voice you want to clone, then type any text to generate speech in that voice. Models like Chatterbox, GPT-SoVITS, and CosyVoice 2 support voice cloning. The cloned voice captures tone, accent, and speaking style.

Modèles gratis (Kokoro, Piper, VITS, MeloTTS) ne nécessitent pas de compte et coûtent zéro crédits. Modèles standards (2 crédits/1K caractères) incluent Bark, CosyVoice 2, F5-TTS, et Dia. Modèles Premium (4 crédits/1K caractères) incluent OpenVoice, Chatterbox, StyleTTS 2, et Tortoise. Modèles payés généralement offrent une qualité supérieure, plus de voix, et des fonctionnalités supplémentaires comme clonage de voix.

Wi. API a sipòte pwosesis batch pou konvèti gwo kantite tèks nan pale. Soumèt plizyè demann ak rekipere rezilta asynchronously lè l sèvi avèk travay UUIDs. Enterprise plan ($ 99 / mwa) gen ladan accès wout priyorite pou pwosesis batch pi vit. Ideal pou pwodiksyon audiobook, kontni kou, ak gwo-echèl voiceover pwojè.
4.1/5 (21)

What could we improve? Your feedback helps us fix issues.

Kòmanse itilize AI Voice jodi a

Join kreyatè, devlopè, ak biznis ki itilize TTS.ai