Free AI Text to SpeechGenericName
31+ open-source modèl, 231+ vwa, 34+ Pa gen kont mande.
Tout sa ou bezwen pou Voice AI
30+ zouti sipòte pa modèl AI open-source
31+ Modèl Vokal AI
Koleksyon ki pi konplè nan modèl TTS open-source nan yon sèl platfòm
Kokoro Free
Kokoro se yon 82 milyon paramèt tèks-a-parole modèl ki punches byen pi wo pase klas pwa li. Pandan ke gwosè li ti, li pwodwi pale remarkabman natirèl ak ekspresif. Kokoro sipòte plizyè lang ki gen ladan angle, Japonè, Chinwa, ak Koreyen ak yon varyete de vwa ekspresif. Li kouri incredibly vit — jenere son prèske 100x pi vit pase tan reyèl sou yon GPU.
Pi bon pou: TTS bon jan kalite segondè ak latency minimòm, aplikasyon streaming
Eseye gratis
Piper Free
Piper se yon motè tèks-a-parole limyè devlope pa Rhasspy ki itilize VITS ak larynx achitekti. Li kouri konplètman sou CPU, ki fè li ideyal pou aparèy edge, automatisation kay, ak aplikasyon ki mande TTS offline. Avèk plis pase 100 vwa nan plis pase 30 lang, Piper bay pale son natirèl nan vitès tan reyèl menm sou yon Raspberry Pi 4.
Pi bon pou: Previews rapid, accessibility, and embedded applications
Eseye gratis
VITS Free
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) se yon metòd TTS paralèl bout-a-bòd ki kreye yon son ki pi natirèl pase modèl aktyèl ki baze sou de etap. Li adopte inférence variational ki ogmante ak koule normalisation ak yon pwosesis antrenman adversarial, rive jwenn yon amelyorasyon siyifikatif nan natiralizasyon.
Pi bon pou: Text-to-speech pou rezon jeneral ak prozodi natirèlName
Eseye gratis
MeloTTS Free
MeloTTS by MyShell.ai is a multilingual TTS library supporting English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. It is extremely fast, processing text at near real-time speed on CPU alone. MeloTTS is designed for production use and supports both CPU and GPU inference.
Pi bon pou: Aplikasyon pwodiksyon ki bezwen TTS rapid, multilenguage
Eseye gratis
OuteTTS Free
OuteTTS pwolonje gwo modèl lang ak kapasite tèks-a-parole pandan l ap kenbe achitekti orijinal la. Li sipòte backends multiple ki gen ladan llama.cpp (CPU / GPU), Hugging Face Transformers, ExLlamaV2, VLLM, ak menm infèrans navigatè via Transformers.js. Features zero-shot voice cloning through speaker profiles saved as JSON.
Pi bon pou: Edge deployment, TTS ki baze sou navigatè, environnements ki ba-resous
Eseye gratis
Pocket TTS Free
Pocket TTS pa Kyutai (kreyatè Moshi) se yon modèl tèks-a-parole 100M paramèt ki koube byen pi wo pase pwa li. Li kouri efikasman sou CPU, sipòte klonaj vwa zero-shot soti nan yon sèl echantiyon son, epi pwodwi pale natirèl-son. Ti gwosè modèl la fè li ideyal pou deployment edge ak anviwònman ki ba-resous.
Pi bon pou: Deployman limyè, environnements CPU-only, klonaj vwa rapid
Eseye gratis
Kitten TTS Free
Kitten TTS by KittenML is an ultra-lightweight text-to-speech model built on ONNX. With variants from 15M to 80M parameters (25-80 MB on disk), it delivers high-quality voice synthesis on CPU without requiring a GPU. Features 8 built-in voices, adjustable speech speed, and built-in text preprocessing for numbers, currencies, and units. Ideal for edge deployment and low-latency applications.
Pi bon pou: Fast lightweight TTS, edge deployment, low-latency applications
Eseye gratis
Bark Standard
Modèl tèks-nan-son ki baze sou transformateur ki jenere pale, mizik, ak efè son realist.
Pwogramè: Suno · Lisans: MIT
Tcheke li
Bark Small Standard
Versiyon ki pi limyè nan Bark ak inférence pi vit ak itilize nan memwa ki pi ba.
Pwogramè: Suno · Lisans: MIT
Tcheke li
CosyVoice 2 Standard
Alibaba's scalable streaming TTS ak natiralizasyon parite imen ak latency prèske zewo.
Pwogramè: Alibaba (Tongyi Lab) · Lisans: Apache 2.0
Tcheke li
Dia TTS Standard
Modèl jenerasyon dyalòg multi-pale ki kreye konvèsasyon natirèl ant pale yo.
Pwogramè: Nari Labs · Lisans: Apache 2.0
Tcheke li
Parler TTS Standard
Descrivez la voix que vous voulez dans la langue naturelle et Parler génère la parole correspondante.
Pwogramè: Hugging Face · Lisans: Apache 2.0
Tcheke li
GLM-TTS Standard
Li gen pi ba pousantaj erè karaktè ant modèl TTS ki gen sous louvri.
Pwogramè: Zhipu AI · Lisans: GLM-4 License
Tcheke li
IndexTTS-2 Standard
Zero-shot TTS ak kontwòl emosyon fine-grained ak ekspresyon segondè.
Pwogramè: Index Team · Lisans: Bilibili Model License
Tcheke li
Spark TTS Standard
Voye klonaj TTS ak emosyon kontwole ak style pale via pwompts.
Pwogramè: SparkAudio · Lisans: CC BY-NC-SA 4.0
Tcheke li
GPT-SoVITS Standard
Few-shot klonaj vwa TTS ki replike nenpòt vwa soti nan jis 5 segonn nan son.
Pwogramè: RVC-Boss · Lisans: MIT
Tcheke li
Orpheus Standard
100,000 èdtan nan done pale yo te itilize pou fòme yon modèl TTS emosyonèl nivo imen.
Pwogramè: Canopy Labs · Lisans: Llama 3.2 Community
Tcheke li
Qwen3 TTS Standard
Alibaba's multilingual TTS ak klonaj vwa, preset vwa, ak konsepsyon vwa soti nan tèks.
Pwogramè: Alibaba (Qwen) · Lisans: Apache 2.0
Tcheke li
Chatterbox Turbo Standard
Chatterbox pi vit ak sub-200ms latency ak tags paralinguistik pou ri, touse, ak plis ankò.
Pwogramè: Resemble AI · Lisans: MIT
Tcheke li
Dia 2 Standard
Konvèsasyon TTS ki baze sou streaming ak dyalòg ant plizyè paleur ak sijesyon paralingwistik.
Pwogramè: Nari Labs · Lisans: Apache 2.0
Tcheke li
VoxCPM Standard
Tokenizer-gratis TTS ki pwodwi 44.1kHz odyo ak konstan paragraf kontexte-konsyan.
Pwogramè: OpenBMB · Lisans: Apache 2.0
Tcheke li
TADA Standard
Zero-hallucination TTS ak alineasyon doub tèks-acoustic, 5x pi vit pase LLM TTS konparab.
Pwogramè: Hume AI · Lisans: MIT
Tcheke li
VibeVoice Standard
Microsoft modèl pou fòm long multi-pale kontni tankou podcasts ak audiobooks.
Pwogramè: Microsoft · Lisans: MIT
Tcheke li
CosyVoice3 Standard
Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.
Pwogramè: Alibaba (FunAudioLLM) · Lisans: Apache 2.0
Tcheke li
CosyVoice 2
Alibaba's scalable streaming TTS ak natiralizasyon parite imen ak latency prèske zewo.
Lang: en, zh, ja, ko, fr, de, it, es
Klone Voy
IndexTTS-2
Zero-shot TTS ak kontwòl emosyon fine-grained ak ekspresyon segondè.
Lang: en, zh
Klone Voy
GPT-SoVITS
Few-shot klonaj vwa TTS ki replike nenpòt vwa soti nan jis 5 segonn nan son.
Lang: en, zh, ja, ko
Klone Voy
Chatterbox
Pwogram sa a gen ladan tou yon sistèm klonaj vwa ak kontwòl emosyonèl ki rele Resemble AI.
Lang: en
Klone Voy
OpenVoice
Instant klonaj vwa ak kontwòl granulaire sou style, emosyon, ak aksan.
Lang: en, zh, ja, ko, fr, de, es, it
Klone Voy
Qwen3 TTS
Alibaba's multilingual TTS ak klonaj vwa, preset vwa, ak konsepsyon vwa soti nan tèks.
Lang: en, zh, ja, ko, de, fr, ru, pt, es, it
Klone Voy
Chatterbox Turbo
Chatterbox pi vit ak sub-200ms latency ak tags paralinguistik pou ri, touse, ak plis ankò.
Lang: en
Klone Voy
VoxCPM
Tokenizer-gratis TTS ki pwodwi 44.1kHz odyo ak konstan paragraf kontexte-konsyan.
Lang: en, zh
Klone Voy
OuteTTS
LLM-ki baze sou TTS ki kouri sou CPU, GPU, oswa navigatè via llama.cpp ak Transformers.js.
Lang: en
Klone Voy
Pocket TTS
100M modèl paramèt limyè pa Kyutai ak klonaj vwa soti nan yon sèl echantiyon.
Lang: en, fr
Klone Voy
CosyVoice3
Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.
Lang: en, zh, ja, ko, de, es, fr, it, ru
Klone Voy
MOSS-TTS
Ultra-long 20-language TTS supporting up to 1 hour of continuous generation with phoneme-level control.
Lang: en, zh, de, es, fr, ja, it, hu, ko, ru, fa, ar, pl, pt, cs, da, sv, el, tr
Klone Voy
MegaTTS3
ByteDance's sparse alignment TTS with adjustable intelligibility vs. speaker similarity.
Lang: en, zh
Klone VoyDeveloper-First API
OpenAI-kompatib REST API. One endpoint, 22 + modèl. Streaming sipò pou aplikasyon an tan reyèl.
- OpenAI-kompatib fòma
- Streaming TTS pou aplikasyon an tan reyèl
- Batch pwosesis pou gwo travay
- Notifikasyon Webhook
pip install ttsai
npm install @ttsainpm/ttsai
from tts_ai import TTSClient
client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
text="Hello from TTS.ai!",
model="kokoro",
voice="af_bella",
)
client.save(audio, "output.mp3")
Pri senp, transparan
Kòmanse gratis. Skale kòm ou grandi.
Gratis
50 kredi
- Kokoro, Piper, VITS, MeloTTS
- Limit 500 karaktè
- 3 gen/èdtan (pa gen kont)
Starter
500 kredi / mwa
- Tout 22+ modèl
- 100,000 karaktè pou chak jenerasyon
- Klonaj Vokal
Kesyon ki poze souvan
What could we improve? Your feedback helps us fix issues.
Kòmanse itilize AI Voice jodi a
Join kreyatè, devlopè, ak biznis ki itilize TTS.ai