Free AI Text to SpeechQuery
33+ open-source mafano, 273+ mawu, 33+ Palibe akaunti yofunika.
Zonse zomwe mukufuna kwa Voice AI
30 + zipangizo zopangidwa ndi mapangidwe a AI otsegulidwa
33+ AI Voice Models
Kusonkhanitsa kokwanira kwambiri kwa ma TTS aulere aulere m'modzi m'modzi
Kokoro Opanda ndalama
Kokoro ndi 82 miliyoni paramita malemba-ku-kulankhula chitsanzo chomwe punches bwino pamwamba pa khalidwe lake la thupi. Ngakhale ndi ochepa kukula, amatulutsa mawu owoneka bwino ndi owoneka bwino. Kokoro amathandiza mabungwe ambiri kuphatikizapo Chijeremani, Chijeremani, Chijeremani, ndi Korean ndi mitundu yosiyanasiyana ya mawu owoneka bwino.
Oyenera kwa: TTS yotsika mtengo ndi latency yotsika, machitidwe oyenda
Kuyesa kwaulere
Piper Opanda ndalama
Piper ndi makina otsika mtengo a mawu ochokera ku mawu omwe adapangidwa ndi Rhasspy omwe amagwiritsa ntchito VITS ndi larynx architectures. Imayenda kwathunthu pa CPU, zomwe zimapangitsa kuti ikhale yabwino kwa zida za edge, zowongolera zanyumba, ndi mapulogalamu omwe akufuna TTS osagwirizana. Ndi mawu oposa 100 m'zinenero 30 +, Piper imabweretsa mawu owoneka bwino panthawi ya real-time ngakhale pa Raspberry Pi 4.
Oyenera kwa: Kuwonetsa mofulumira, kupezeka, ndi mapulogalamu ophatikizidwa
Kuyesa kwaulere
VITS Opanda ndalama
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) ndi njira yofanana yoyambira kumapeto kwa TTS yomwe imapanga mawu owoneka bwino kwambiri kuposa mamodeli anthawi zonse awiri. Imagwiritsa ntchito kutengera kwa maonekedwe osiyanasiyana omwe amawonjezeredwa ndi kuwongolera kwa magazi ndi njira yophunzitsa yotsutsana, yomwe imakwaniritsa kuwonjezeka kwakukulu kwa chilengedwe.
Oyenera kwa: Kusintha mawu kukhala malemba ndi mawu
Kuyesa kwaulere
MeloTTS Opanda ndalama
MeloTTS ndi MyShell.ai ndi TTS library yokhala ndi mabuku ambiri omwe amathandizira Chijeremani (cha America, cha British, cha Indian, cha Australia), Chisipanishi, Chifalansa, Chijeremani, cha Japanese, ndi cha Korean. Ndiyabwino kwambiri, yopanga malemba panthawi yoyenera kwambiri pa CPU yokha. MeloTTS idapangidwa kuti igwiritse ntchito kupanga ndipo imathandizira kuzindikira kwa CPU ndi GPU.
Oyenera kwa: Ntchito zopanga zomwe zimafunikira TTS yofulumira komanso yosiyanasiyana
Kuyesa kwaulere
Kani TTS 2 Opanda ndalama
Kani-TTS-2 ndi NineNineSix ndi mtundu wa 400M wa 400M wopangidwa ndi Liquid AI LFM2 backbone ndi NVIDIA NanoCodec. Imagwira ntchito pa 3GB VRAM yokha ndipo imatulutsa ~ 10 masekondi a mawu mu ~ 2 masekondi pa A100 (RTF 0.2). Kutulutsa kwa anthu kwatsopano kumatumiza chitsimikizo cha `kani-tts-2-en` cha Chingerezi kokha ndipo sichikuwonetsa khomo lophatikizira lofunikira kwa kujambula mawu - kugwiritsa ntchito Chatterbox / IndexTTS2 / F5-TTS kwa kujambula, kapena Kokoro / MeloTTS kwa omwe salankhula Chingelezi.
Oyenera kwa: Fast English chiyambi pa low-VRAM zida, mawonedwe ofulumira
Kuyesa kwaulere
OuteTTS Opanda ndalama
OuteTTS imawonjezera mapangidwe amtundu waukulu ndi zofunikira za mawu-ku-mawu poteteza mapangidwe ake oyambirira. Imathandizira ma backends ambiri kuphatikiza llama.cpp (CPU / GPU), Hugging Face Transformers, ExLlamaV2, VLLM, komanso kuzindikira kwa msakatuli pogwiritsa ntchito Transformers.js.
Oyenera kwa: Kukhazikitsa kwa Edge, TTS yochokera pabrowser, malo otsika azinthu
Kuyesa kwaulere
Pocket TTS Opanda ndalama
Pocket TTS ya Kyutai (opanga Moshi) ndi mtundu wa 100M wa maparamita a masamba omwe amagwira bwino ntchito. Imagwira ntchito bwino pa CPU, imathandizira kujambula mawu osatha kuchokera pa sampling ya audio imodzi, ndipo imatulutsa mawu owoneka bwino.
Oyenera kwa: Kugwiritsa ntchito kosavuta, CPU-only environments, kujambula mawu mwachangu
Kuyesa kwaulere
Kitten TTS Opanda ndalama
Kitten TTS ndi KittenML ndi mtundu wa text-to-speech wokhala ndi ma parameters 15M mpaka 80M (25-80 MB pa diski), imapatsa mawu osiyanasiyana amtundu wamphamvu pa CPU popanda kufunikira GPU. Imaphatikizapo mawu 8 ophatikizidwa, kuthamanga kwa mawu osinthika, ndi kuphatikizidwa kwa masamba oyambirira a masamba, ndalama, ndi mayunitsi.
Oyenera kwa: Fast lightweight TTS, kukhazikitsa edge, ntchito zochepa
Kuyesa kwaulere
Ming-Omni TTS Opanda ndalama
Ming-omni-tts-0.5B ndi inclusionAI ndi mtundu wa mawu opanda kanthu omwe amapangidwa pa backbone ya BailingMM yolimba ndi decoder ya audio yogwirizana ndi Patch-by-Patch. Imabweretsa 44.1kHz (kuzungulira CD quality), imathandizira kujambula mawu opanda kanthu kuchokera ku 3 + yachiwiri, ndipo imaphatikizapo kuwongolera kwa emotion / dialect / BGM kudzera mu malangizo a JSON.
Oyenera kwa: High-fidelity bilingual narration, kuyankhulana kwa mawu oyang'anira, zinenero za Chisipanishi
Kuyesa kwaulere
MOSS-TTS Nano Opanda ndalama
MOSS-TTS-Nano-100M ndi mtundu wa 100M wa OpenMOSS wa banja la MOSS-TTS, lomwe limagawana chikhalidwe cha delay-transformer. Amagulitsa mtundu wa 8B wa 8B kwa ~ 80x ochepa komanso kutsitsa kwa VRAM, zomwe zimapangitsa kuti zikhale zoyenera kwa opanda chithandizo komanso opanga maphunziro.
Oyenera kwa: TTS yopanda malire, kupanga kwakukulu, kugwiritsa ntchito kolumikizana ndi kuchepa kwa latency
Kuyesa kwaulere
Bark Choyambirira
Model ya text-to-audio yokhala ndi transformer yomwe imapanga mawu, nyimbo, ndi zotsatira za mawu zowoneka bwino.
Wopanga: Suno · License: MIT
Yambitsani
Bark Small Choyambirira
Lighter mtundu wa Bark ndi mofulumira kumvetsa ndi pansi kugwiritsa ntchito kumbukirani.
Wopanga: Suno · License: MIT
Yambitsani
CosyVoice 2 Choyambirira
Alibaba's scalable streaming TTS ndi khalidwe la munthu-parity ndi kupitilira-zero latency.
Wopanga: Alibaba (Tongyi Lab) · License: Apache 2.0
Yambitsani
Dia TTS Choyambirira
Multi-wokamba nkhani dialogue chiyambi cha mtundu womwe amaumba zokambirana zachilengedwe pakati pa okamba nkhani.
Wopanga: Nari Labs · License: Apache 2.0
Yambitsani
Parler TTS Choyambirira
Kufotokozera mawu mukufuna mu chilankhulo chachilengedwe ndi Parler amapanga mawu ogwirizana.
Wopanga: Hugging Face · License: Apache 2.0
Yambitsani
IndexTTS-2 Choyambirira
Zero-shot TTS ndi kuwongolera kwa maganizo olimba komanso kutanthauzira kwakukulu.
Wopanga: Index Team · License: Bilibili Model License
Yambitsani
Spark TTS Choyambirira
Kulankhula kloning TTS ndi controlable chisoni ndi kulankhula mtundu mwa kufunsa.
Wopanga: SparkAudio · License: CC BY-NC-SA 4.0
Yambitsani
GPT-SoVITS Choyambirira
Few-shot mawu kloning TTS kuti amachitanso chilichonse mawu kuchokera 5 masekondi a audio.
Wopanga: RVC-Boss · License: MIT
Yambitsani
Orpheus Choyambirira
Model ya TTS yokhudzana ndi munthu yophunzitsa 100K maola a data ya mawu.
Wopanga: Canopy Labs · License: Llama 3.2 Community
Yambitsani
Qwen3 TTS Choyambirira
TTS ya Alibaba ndi mawu osankhidwa ndi mawu osankhidwa kuchokera ku malemba.
Wopanga: Alibaba (Qwen) · License: Apache 2.0
Yambitsani
VieNeu-TTS-v2 Choyambirira
Vietnamese + Chijeremani code-switching TTS ndi 7 preset mawu ndi zero-shot mawu kloning. CPU-khama, palibe GPU zofunika.
Wopanga: Phạm Nguyễn Ngọc Bảo · License: Apache 2.0
Yambitsani
Chatterbox Turbo Choyambirira
Faster Chatterbox ndi sub-200ms latency ndi paralinguistic tags kwa laughs, kupweteka, ndi zina zambiri.
Wopanga: Resemble AI · License: MIT
Yambitsani
VoxCPM Choyambirira
TTS yopanda Tokenizer yomwe imapanga 44.1kHz audio ndi kugwirizana kwa masamba omvetsetsa.
Wopanga: OpenBMB · License: Apache 2.0
Yambitsani
VibeVoice Choyambirira
Microsoft model for long-form multi-speaker content monga podcasts ndi audiobooks.
Wopanga: Microsoft · License: MIT
Yambitsani
CosyVoice3 Choyambirira
TTS yatsopano ya TTS ndi bi-streaming, kuwongolera maganizo, ndi kujambula mawu opanda kanthu.
Wopanga: Alibaba (FunAudioLLM) · License: Apache 2.0
Yambitsani
NAMAA Saudi TTS Choyambirira
Kuyamba kutsegulira Saudi-Arabic TTS. Native Saudi dialect ndi Chatterbox-quality voice cloning.
Wopanga: NAMAA Space · License: MIT
Yambitsani
Darwin TTS Choyambirira
Qwen3-TTS ndi mtundu wa Qwen3-1.7B, womwe umagwiritsa ntchito ma FFN kuti agwirizane ndi ma TTS ena.
Wopanga: FINAL-Bench · License: Apache 2.0
Yambitsani
MOSS-TTSD Choyambirira
Model yotsatira ya macheza a olankhula ambiri - kuyambitsa macheza a podcast-style ndi mpaka 5 olankhula ndi maola 60 a mawu ogwirizana.
Wopanga: OpenMOSS · License: Apache 2.0
Yambitsani
CosyVoice 2
Alibaba's scalable streaming TTS ndi khalidwe la munthu-parity ndi kupitilira-zero latency.
Zilankhulo: en, zh, ja, ko, fr, de, it, es
Kusintha mawu
IndexTTS-2
Zero-shot TTS ndi kuwongolera kwa maganizo olimba komanso kutanthauzira kwakukulu.
Zilankhulo: en, zh
Kusintha mawu
Spark TTS
Kulankhula kloning TTS ndi controlable chisoni ndi kulankhula mtundu mwa kufunsa.
Zilankhulo: en, zh
Kusintha mawu
GPT-SoVITS
Few-shot mawu kloning TTS kuti amachitanso chilichonse mawu kuchokera 5 masekondi a audio.
Zilankhulo: en, zh, ja, ko
Kusintha mawu
Chatterbox
State-of-the-art zero-shot kujambula mawu ndi kuwongolera maganizo kuchokera ku Resemble AI.
Zilankhulo: en
Kusintha mawu
Tortoise TTS
Multi-wolankhula malemba-ku-kulankhula kuganizira za katundu ndi autoregressive ukadaulo.
Zilankhulo: en
Kusintha mawu
OpenVoice
Instant mawu kloning ndi granular kuwongolera pa mtundu, chisoni, ndi accent.
Zilankhulo: en, zh, ja, ko, fr, es
Kusintha mawu
VieNeu-TTS-v2
Vietnamese + Chijeremani code-switching TTS ndi 7 preset mawu ndi zero-shot mawu kloning. CPU-khama, palibe GPU zofunika.
Zilankhulo: vi, en
Kusintha mawu
Chatterbox Turbo
Faster Chatterbox ndi sub-200ms latency ndi paralinguistic tags kwa laughs, kupweteka, ndi zina zambiri.
Zilankhulo: en
Kusintha mawu
VoxCPM
TTS yopanda Tokenizer yomwe imapanga 44.1kHz audio ndi kugwirizana kwa masamba omvetsetsa.
Zilankhulo: en, zh
Kusintha mawu
OuteTTS
LLM-ogwirizana TTS kuti amayenda pa CPU, GPU, kapena browser kudzera llama.cpp ndi Transformers.js.
Zilankhulo: en
Kusintha mawu
Pocket TTS
Model ya 100M ya Kyutai ndi kulumikizana kwa mawu kuchokera ku satifiketi imodzi.
Zilankhulo: en, fr
Kusintha mawu
CosyVoice3
TTS yatsopano ya TTS ndi bi-streaming, kuwongolera maganizo, ndi kujambula mawu opanda kanthu.
Zilankhulo: en, zh, ja, ko, de, es, fr, it, ru
Kusintha mawu
NAMAA Saudi TTS
Kuyamba kutsegulira Saudi-Arabic TTS. Native Saudi dialect ndi Chatterbox-quality voice cloning.
Zilankhulo: ar
Kusintha mawu
Darwin TTS
Qwen3-TTS ndi mtundu wa Qwen3-1.7B, womwe umagwiritsa ntchito ma FFN kuti agwirizane ndi ma TTS ena.
Zilankhulo: en, ko, ja, zh
Kusintha mawu
MOSS-TTSD
Model yotsatira ya macheza a olankhula ambiri - kuyambitsa macheza a podcast-style ndi mpaka 5 olankhula ndi maola 60 a mawu ogwirizana.
Zilankhulo: en, zh
Kusintha mawu
Ming-Omni TTS
Model ya 0.5B yokhala ndi mawu osiyanasiyana a inclusionAI ndi 44.1kHz yokhala ndi 44.1kHz yokhala ndi 44.1kHz ndi zero-shot voice cloning.
Zilankhulo: en, zh
Kusintha mawu
MOSS-TTS Nano
Tiny 100M MOSS-TTS mtundu - chimodzimodzi chikhalidwe, 80x ochepa, free-tier latency.
Zilankhulo: en, zh, de, es, fr, ja, it, ko, ru, ar, pt
Kusintha mawuWopanga-Pyamba API
OpenAI-kugwirizana REST API. One endpoint, 22 + mafano. Streaming thandizo kwa real-time mapulogalamu.
- Format yogwirizana ndi OpenAI
- Streaming TTS kwa real-time mapulogalamu
- Batch processing kwa ntchito zazikulu
- Zidziwitso za Webhook
pip install ttsai
npm install @ttsainpm/ttsai
from tts_ai import TTSClient
client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
text="Hello from TTS.ai!",
model="kokoro",
voice="af_bella",
)
client.save(audio, "output.mp3")
Zosatheka, Zosawoneka bwino
Kuyamba kwaulere. Scale monga mukukula.
Opanda pake
15,000 characters + 5,000/day
- 7 ufulu mafano kuphatikizapo Kokoro
- 5,000 chars per generation
- Kupeza kwa API kuphatikizidwa
Woyamba
500,000 characters/mwezi
- onse 22+ zojambula
- 100,000 chars pa chiyambi
- Chizindikiro cha mawu
Pro
2,000,000 characters/mwezi
- Zonse mu Starter
- Kugwiritsa ntchito API
- Priority processing
Zamalonda
10,000,000 characters/mwezi
- Zonse mu Pro
- Mphamvu ya API
- Priority queue
Funso Lofunsidwa Kawirikawiri
Kodi tingachitire chiyani kuti tisinthe? Maganizo anu amatithandiza kuchotsa mavuto.
Kuyambiranso kugwiritsa ntchito AI Voice lero
Join opanga, opanga, ndi makampani pogwiritsa ntchito TTS.ai