Free AI Text to SpeechQuery
31+ open-source mafano, 231+ mawu, 34+ Palibe akaunti yofunika.
Zonse zomwe mukufuna kwa Voice AI
30 + zipangizo zopangidwa ndi mapangidwe a AI otsegulidwa
31+ AI Voice Models
Kusonkhanitsa kokwanira kwambiri kwa ma TTS aulere aulere m'modzi m'modzi
Kokoro Free
Kokoro ndi 82 miliyoni paramita malemba-ku-kulankhula chitsanzo chomwe punches bwino pamwamba pa khalidwe lake la thupi. Ngakhale ndi ochepa kukula, amatulutsa mawu owoneka bwino ndi owoneka bwino. Kokoro amathandiza mabungwe ambiri kuphatikizapo Chijeremani, Chijeremani, Chijeremani, ndi Korean ndi mitundu yosiyanasiyana ya mawu owoneka bwino.
Oyenera kwa: TTS yotsika mtengo ndi latency yotsika, machitidwe oyenda
Kuyesa kwaulere
Piper Free
Piper ndi makina otsika mtengo a mawu ochokera ku mawu omwe adapangidwa ndi Rhasspy omwe amagwiritsa ntchito VITS ndi larynx architectures. Imayenda kwathunthu pa CPU, zomwe zimapangitsa kuti ikhale yabwino kwa zida za edge, zowongolera zanyumba, ndi mapulogalamu omwe akufuna TTS osagwirizana. Ndi mawu oposa 100 m'zinenero 30 +, Piper imabweretsa mawu owoneka bwino panthawi ya real-time ngakhale pa Raspberry Pi 4.
Oyenera kwa: Kuwonetsa mofulumira, kupezeka, ndi mapulogalamu ophatikizidwa
Kuyesa kwaulere
VITS Free
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) ndi njira yofanana yoyambira kumapeto kwa TTS yomwe imapanga mawu owoneka bwino kwambiri kuposa mamodeli anthawi zonse awiri. Imagwiritsa ntchito kutengera kwa maonekedwe osiyanasiyana omwe amawonjezeredwa ndi kuwongolera kwa magazi ndi njira yophunzitsa yotsutsana, yomwe imakwaniritsa kuwonjezeka kwakukulu kwa chilengedwe.
Oyenera kwa: Kusintha mawu kukhala malemba ndi mawu
Kuyesa kwaulere
MeloTTS Free
MeloTTS ndi MyShell.ai ndi TTS library yokhala ndi mabuku ambiri omwe amathandizira Chijeremani (cha America, cha British, cha Indian, cha Australia), Chisipanishi, Chifalansa, Chijeremani, cha Japanese, ndi cha Korean. Ndiyabwino kwambiri, yopanga malemba panthawi yoyenera kwambiri pa CPU yokha. MeloTTS idapangidwa kuti igwiritse ntchito kupanga ndipo imathandizira kuzindikira kwa CPU ndi GPU.
Oyenera kwa: Ntchito zopanga zomwe zimafunikira TTS yofulumira komanso yosiyanasiyana
Kuyesa kwaulere
OuteTTS Free
OuteTTS imawonjezera mapangidwe amtundu waukulu ndi zofunikira za mawu-ku-mawu poteteza mapangidwe ake oyambirira. Imathandizira ma backends ambiri kuphatikiza llama.cpp (CPU / GPU), Hugging Face Transformers, ExLlamaV2, VLLM, komanso kuzindikira kwa msakatuli pogwiritsa ntchito Transformers.js.
Oyenera kwa: Kukhazikitsa kwa Edge, TTS yochokera pabrowser, malo otsika azinthu
Kuyesa kwaulere
Pocket TTS Free
Pocket TTS ya Kyutai (opanga Moshi) ndi mtundu wa 100M wa maparamita a masamba omwe amagwira bwino ntchito. Imagwira ntchito bwino pa CPU, imathandizira kujambula mawu osatha kuchokera pa sampling ya audio imodzi, ndipo imatulutsa mawu owoneka bwino.
Oyenera kwa: Kugwiritsa ntchito kosavuta, CPU-only environments, kujambula mawu mwachangu
Kuyesa kwaulere
Kitten TTS Free
Kitten TTS by KittenML is an ultra-lightweight text-to-speech model built on ONNX. With variants from 15M to 80M parameters (25-80 MB on disk), it delivers high-quality voice synthesis on CPU without requiring a GPU. Features 8 built-in voices, adjustable speech speed, and built-in text preprocessing for numbers, currencies, and units. Ideal for edge deployment and low-latency applications.
Oyenera kwa: Fast lightweight TTS, edge deployment, low-latency applications
Kuyesa kwaulere
Bark Standard
Model ya text-to-audio yokhala ndi transformer yomwe imapanga mawu, nyimbo, ndi zotsatira za mawu zowoneka bwino.
Wopanga: Suno · License: MIT
Yambitsani
Bark Small Standard
Lighter mtundu wa Bark ndi mofulumira kumvetsa ndi pansi kugwiritsa ntchito kumbukirani.
Wopanga: Suno · License: MIT
Yambitsani
CosyVoice 2 Standard
Alibaba's scalable streaming TTS ndi khalidwe la munthu-parity ndi kupitilira-zero latency.
Wopanga: Alibaba (Tongyi Lab) · License: Apache 2.0
Yambitsani
Dia TTS Standard
Multi-wokamba nkhani dialogue chiyambi cha mtundu womwe amaumba zokambirana zachilengedwe pakati pa okamba nkhani.
Wopanga: Nari Labs · License: Apache 2.0
Yambitsani
Parler TTS Standard
Kufotokozera mawu mukufuna mu chilankhulo chachilengedwe ndi Parler amapanga mawu ogwirizana.
Wopanga: Hugging Face · License: Apache 2.0
Yambitsani
GLM-TTS Standard
Achire pansi kwambiri chizindikiro cha vuto kuchuluka pakati open-source TTS mafano.
Wopanga: Zhipu AI · License: GLM-4 License
Yambitsani
IndexTTS-2 Standard
Zero-shot TTS ndi kuwongolera kwa maganizo olimba komanso kutanthauzira kwakukulu.
Wopanga: Index Team · License: Bilibili Model License
Yambitsani
Spark TTS Standard
Kulankhula kloning TTS ndi controlable chisoni ndi kulankhula mtundu mwa kufunsa.
Wopanga: SparkAudio · License: CC BY-NC-SA 4.0
Yambitsani
GPT-SoVITS Standard
Few-shot mawu kloning TTS kuti amachitanso chilichonse mawu kuchokera 5 masekondi a audio.
Wopanga: RVC-Boss · License: MIT
Yambitsani
Orpheus Standard
Model ya TTS yokhudzana ndi munthu yophunzitsa 100K maola a data ya mawu.
Wopanga: Canopy Labs · License: Llama 3.2 Community
Yambitsani
Qwen3 TTS Standard
TTS ya Alibaba ndi mawu osiyanasiyana, mawu osankhidwa, ndi mawu opangidwa kuchokera ku malemba.
Wopanga: Alibaba (Qwen) · License: Apache 2.0
Yambitsani
Chatterbox Turbo Standard
Faster Chatterbox ndi sub-200ms latency ndi paralinguistic tags kwa laughs, kupweteka, ndi zina zambiri.
Wopanga: Resemble AI · License: MIT
Yambitsani
Dia 2 Standard
Kutumiza-pyamba conversational TTS ndi multi-wokamba nkhani uthenga ndi paralinguistic cues.
Wopanga: Nari Labs · License: Apache 2.0
Yambitsani
VoxCPM Standard
TTS yopanda Tokenizer yomwe imapanga 44.1kHz audio ndi kugwirizana kwa masamba omvetsetsa.
Wopanga: OpenBMB · License: Apache 2.0
Yambitsani
TADA Standard
Zero-hallucination TTS ndi malemba-acoustic dual kugwirizana, 5x mofulumira kuposa kuyerekezera LLM TTS.
Wopanga: Hume AI · License: MIT
Yambitsani
VibeVoice Standard
Microsoft model for long-form multi-speaker content monga podcasts ndi audiobooks.
Wopanga: Microsoft · License: MIT
Yambitsani
CosyVoice3 Standard
Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.
Wopanga: Alibaba (FunAudioLLM) · License: Apache 2.0
Yambitsani
CosyVoice 2
Alibaba's scalable streaming TTS ndi khalidwe la munthu-parity ndi kupitilira-zero latency.
Zilankhulo: en, zh, ja, ko, fr, de, it, es
Kusintha mawu
GLM-TTS
Achire pansi kwambiri chizindikiro cha vuto kuchuluka pakati open-source TTS mafano.
Zilankhulo: en, zh
Kusintha mawu
IndexTTS-2
Zero-shot TTS ndi kuwongolera kwa maganizo olimba komanso kutanthauzira kwakukulu.
Zilankhulo: en, zh
Kusintha mawu
Spark TTS
Kulankhula kloning TTS ndi controlable chisoni ndi kulankhula mtundu mwa kufunsa.
Zilankhulo: en, zh
Kusintha mawu
GPT-SoVITS
Few-shot mawu kloning TTS kuti amachitanso chilichonse mawu kuchokera 5 masekondi a audio.
Zilankhulo: en, zh, ja, ko
Kusintha mawu
Chatterbox
State-of-the-art zero-shot kujambula mawu ndi kuwongolera maganizo kuchokera ku Resemble AI.
Zilankhulo: en
Kusintha mawu
Tortoise TTS
Multi-wolankhula malemba-ku-kulankhula kuganizira za katundu ndi autoregressive ukadaulo.
Zilankhulo: en
Kusintha mawu
OpenVoice
Instant mawu kloning ndi granular kuwongolera pa mtundu, chisoni, ndi accent.
Zilankhulo: en, zh, ja, ko, fr, de, es, it
Kusintha mawu
Qwen3 TTS
TTS ya Alibaba ndi mawu osiyanasiyana, mawu osankhidwa, ndi mawu opangidwa kuchokera ku malemba.
Zilankhulo: en, zh, ja, ko, de, fr, ru, pt, es, it
Kusintha mawu
Chatterbox Turbo
Faster Chatterbox ndi sub-200ms latency ndi paralinguistic tags kwa laughs, kupweteka, ndi zina zambiri.
Zilankhulo: en
Kusintha mawu
VoxCPM
TTS yopanda Tokenizer yomwe imapanga 44.1kHz audio ndi kugwirizana kwa masamba omvetsetsa.
Zilankhulo: en, zh
Kusintha mawu
OuteTTS
LLM-ogwirizana TTS kuti amayenda pa CPU, GPU, kapena browser kudzera llama.cpp ndi Transformers.js.
Zilankhulo: en
Kusintha mawu
Pocket TTS
Model ya 100M ya Kyutai ndi kulumikizana kwa mawu kuchokera ku satifiketi imodzi.
Zilankhulo: en, fr
Kusintha mawu
CosyVoice3
Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.
Zilankhulo: en, zh, ja, ko, de, es, fr, it, ru
Kusintha mawu
MOSS-TTS
Ultra-long 20-language TTS supporting up to 1 hour of continuous generation with phoneme-level control.
Zilankhulo: en, zh, de, es, fr, ja, it, hu, ko, ru, fa, ar, pl, pt, cs, da, sv, el, tr
Kusintha mawu
MegaTTS3
ByteDance's sparse alignment TTS with adjustable intelligibility vs. speaker similarity.
Zilankhulo: en, zh
Kusintha mawuWopanga-Pyamba API
OpenAI-kugwirizana REST API. One endpoint, 22 + mafano. Streaming thandizo kwa real-time mapulogalamu.
- Format yogwirizana ndi OpenAI
- Streaming TTS kwa real-time mapulogalamu
- Batch processing kwa ntchito zazikulu
- Zidziwitso za Webhook
pip install ttsai
npm install @ttsainpm/ttsai
from tts_ai import TTSClient
client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
text="Hello from TTS.ai!",
model="kokoro",
voice="af_bella",
)
client.save(audio, "output.mp3")
Zosatheka, Zosawoneka bwino
Kuyamba kwaulere. Scale monga mukukula.
_Yaulere
15,000 characters
- Kokoro, Piper, VITS, MeloTTS
- 500 Character limit
- 3 gen / ola (opanda akaunti)
Woyamba
500,000 characters/mwezi
- onse 22+ zojambula
- 100,000 chars pa chiyambi
- Chizindikiro cha mawu
Pro
2,000,000 characters/mwezi
- Zonse mu Starter
- Kugwiritsa ntchito API
- Priority processing
Zamalonda
10,000,000 characters/mwezi
- Zonse mu Pro
- Mphamvu ya API
- Priority queue
Funso Lofunsidwa Kawirikawiri
Kodi tingachitire chiyani kuti tisinthe? Maganizo anu amatithandiza kuchotsa mavuto.
Kuyambiranso kugwiritsa ntchito AI Voice lero
Join opanga, opanga, ndi makampani pogwiritsa ntchito TTS.ai