Free AI Qoraalka u beddel hadal
33+ Open-source qaabab, 273+ Codadka, 33+ luqado. Aan xisaab loo baahnaa.
Wax kasta oo aad u baahan tahay Voice AI
30+ aaladaha ku shaqeeya qaababka AI ee asalka furan
33+ Muuqaalka Codka
Sameynta ugu ballaaran ee qaababka TTS ee asalka furan ee platform-ka
Kokoro Bilaash
Kokoro waa 82 milyan oo parameter qoraal-to-hadalka oo qaab ah oo si fiican u saaran heerka miisaanka. In kasta oo ay yar tahay, waxay soo saartaa hadal aad u dabiici ah oo muujinaya. Kokoro waxay taageertaa luqado badan oo ay ku jiraan Ingiriisiga, Japan, Shiinaha, iyo Korean oo leh codyo muujinaya oo kala duwan. Waxay ku socotaa si aad u dhakhso badan - abuurista maqalka oo ku dhow 100x ka dhaqso badan waqtiga dhabta ah ee GPU.
Ugu Fiican: TTS tayo sare leh oo leh latentii ugu yar, codsiyada streaming
Raac bilaash ah
Piper Bilaash
Piper waa mashiin qoraal-u-hadalka ah oo fudud oo ay soo saartay Rhasspy oo isticmaalaya VITS iyo dhismayaasha larynx. Waxay ku socotaa oo dhan CPU, taasoo ka dhigaysa mid aad u fiican qalabka edge, otomaatiga guriga, iyo codsiyada u baahan TTS offline. Iyadoo ku saabsan 100 codadka oo ka socda 30 + luqadood, Piper wuxuu siiyaa hadalka dabiiciga ah ee ku jira xawaaraha waqtiga dhabta ah, xitaa Raspberry Pi 4.
Ugu Fiican: Soo-dhaweynta degdegga ah, helitaanka, iyo barnaamijyada ku jira
Raac bilaash ah
VITS Bilaash
VITS (Isbarbardhiga kala duwan ee la barashada adversarial ee dhamaadka-to-dhamaadka Text-to-Speech) waa isku mid ah dhamaadka-to-dhamaadka TTS habka oo soo saara audio maqal badan oo dabiici ah ka badan hadda laba-geesoodka ah. Waxay qaadataa isbarbardhiga kala duwan oo la kordhiyay la isku mid ah qulqulka iyo habka tababarka adversarial, gaarista horumar weyn oo dabiiciga ah.
Ugu Fiican: Qoraalka-u-hadalka-u-ujeedada guud oo leh hadalka dabiiciga ah
Raac bilaash ah
MeloTTS Bilaash
MeloTTS by MyShell.ai waa maktabad TTS oo luqado badan leh oo taageera Ingiriisiga (Amerika, Ingiriis, Hindi, Australia), Isbaanish, Faransiis, Shiinaha, Japan, iyo Korean. Waa mid aad u dhaqso badan, oo qoraalka u qaabeeya xawaare ku dhow waqtiga dhabta ah ee CPU keliya. MeloTTS waxaa loogu talagalay isticmaalka wax soo saarka wuxuuna taageeraa labadaba CPU iyo GPU.
Ugu Fiican: Barnaamijyada wax soo saarka oo u baahan TTS degdeg ah, luqado badan
Raac bilaash ah
Kani TTS 2 Bilaash
Kani-TTS-2 by NineNineSix waa mid aad u fudud 400M parameter qaab dhismeedka ku dhisan yahay Liquid AI LFM2 backbone la NVIDIA NanoCodec. Waxay ku socotaa kaliya 3GB VRAM iyo soo saartaa ~ 10 ilbiriqsi oo hadal ah ~ 2 ilbiriqsi oo ku saabsan A100 (RTF 0.2). Soo saarida guud ee hadda jirta waxay leedahay marin-ka-qaybgal ah oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya oo keliya
Ugu Fiican: Fast English dhalasho on low-VRAM hardware, hordhaca degdeg ah
Raac bilaash ah
OuteTTS Bilaash
OuteTTS wuxuu ballaariyaa moodooyinka afka oo leh awoodo qoraal-u-hadalka ah, isagoo ilaalinaya naqshadeynta asalka ah. Waxay taageertaa backends badan oo ay ku jiraan llama.cpp (CPU / GPU), Hugging Face Transformers, ExLlamaV2, VLLM, iyo xitaa soo jeedinta brauzer-ka ee Transformers.js.
Ugu Fiican: Isku-darka Edge, TTS-ka ku saleysan brauzer-ka, deegaanka la'aanta
Raac bilaash ah
Pocket TTS Bilaash
Pocket TTS by Kyutai (abuurayaasha Moshi) waa 100M parameter qoraalka-to-speech qaabka compact in ka badan miisaanka. Waxay ku socotaa si wax ku ool ah on CPU, taageertaa zero-shot codka ka soo baxa tusaale audio kaliya, iyo soo saarta hadalka dabiiciga ah-soo baxay.
Ugu Fiican: Isku-darka fudud, CPU-keliya, muuqaalka codka ee degdega ah
Raac bilaash ah
Kitten TTS Bilaash
Kitten TTS by KittenML waa mid aad u fudud oo qoraal-u-hadalka ah oo ku dhisan ONNX. Iyadoo ay jiraan noocyo kala duwan oo ka socda 15M ilaa 80M parameters (25-80 MB diskka), waxay ku siineysaa isku-darka codka tayo sare leh ee CPU iyada oo aan loo baahnayn GPU. Features 8 codadka ku jira, xawaaraha hadalka ee la hagaajin karo, iyo qoraalka ku jira ee lagu soo bandhigo tirada, lacagta, iyo unugyada.
Ugu Fiican: TTS fudud oo degdeg ah, soo bandhigida cidhifka, codsiyada hooseeya
Raac bilaash ah
Ming-Omni TTS Bilaash
Ming-omni-tts-0.5B by inclusionAI waa mid compact omni-modal hadalka qaab dhismeedka ku dhisan yahay BailingMM aasaaska adag oo leh Patch-by-Patch qulqulka-matching audio decoder. Soo saarka 44.1kHz (CD-ga dhow), taageerada zero-shot codka ka soo 3 + ka labaad ee soo jeedinta, iyo waxaa ka mid ah aasaaska dareenka / afka / BGM xakamaynta ka dib marka ay raacaan tilmaamaha JSON.
Ugu Fiican: High-fidelity laba-af leh sheeko, dareenka-ku-xiran codka ciyaarta, Chinese audiobook content
Raac bilaash ah
MOSS-TTS Nano Bilaash
MOSS-TTS-Nano-100M waa nooc ka mid ah 100M-parameter ee OpenMOSS ee qoyska MOSS-TTS, oo qayb ka ah naqshadeynta dib-u-dhaca-beddelka. Ganacsiga 8B ee heerka sare ee heerka sare ee ~ 80x miisaanka yar iyo VRAM-ka ugu hooseeya ee per-request, taas oo ka dhigaysa mid ku habboon isticmaalka bilaashka ah iyo isticmaalka sare.
Ugu Fiican: TTS bilaash ah, wax soo saar ballaaran, isticmaalka isgaarsiinta hooseeya
Raac bilaash ah
Bark Standard
Model-ka qoraalka-audio-ka ah ee ku saleysan isbeddelka oo soo saara hadal, muusig iyo saameynta dhawaaqa.
Soo-saarihii: Suno · Liisan: MIT
Samee
Bark Small Standard
Version Lighter of Bark la soo jeedinta degdeg ah iyo isticmaalka xusuusta hooseeya.
Soo-saarihii: Suno · Liisan: MIT
Samee
CosyVoice 2 Standard
Alibaba ee scaleable streaming TTS la dabiiciga ah ee aadanaha-parity iyo laantii ku dhow-zero.
Soo-saarihii: Alibaba (Tongyi Lab) · Liisan: Apache 2.0
Samee
Dia TTS Standard
Multi-hoosaad dialog dhalasho qaabka oo abuura wada hadalka dabiiciga ah ee dhexdhexaadiyaal.
Soo-saarihii: Nari Labs · Liisan: Apache 2.0
Samee
Parler TTS Standard
Tilmaam codka aad rabto in afka dabiiciga ah iyo Parler soo saarta hadalka la mid ah.
Soo-saarihii: Hugging Face · Liisan: Apache 2.0
Samee
IndexTTS-2 Standard
Zero-shot TTS la fine-grained xakamaynta dareenka iyo muujinta sare.
Soo-saarihii: Index Team · Liisan: Bilibili Model License
Samee
Spark TTS Standard
Codka kloning TTS la dareenka la xakamayn karo iyo qaabka hadalka ka soo baxa.
Soo-saarihii: SparkAudio · Liisan: CC BY-NC-SA 4.0
Samee
GPT-SoVITS Standard
Dhamaan-shot codka isku-dhafan TTS in isku-dhafan cod kasta oo ka mid ah kaliya 5 ilbiriqsi oo audio.
Soo-saarihii: RVC-Boss · Liisan: MIT
Samee
Orpheus Standard
TTS moodel maskaxda ah oo heerka aadanaha ah oo tababaran 100K saacadood oo xogta hadalka ah.
Soo-saarihii: Canopy Labs · Liisan: Llama 3.2 Community
Samee
Qwen3 TTS Standard
Alibaba ee TTS multilingual la codadka hore loo dhigay iyo qaabeynta codka ka qoraalka.
Soo-saarihii: Alibaba (Qwen) · Liisan: Apache 2.0
Samee
VieNeu-TTS-v2 Standard
Vietnamese + Ingiriisi code-shidma TTS la 7 codadka hore iyo zero-shot codka isku-dhafan. CPU-keliya, GPU ma loo baahan yahay.
Soo-saarihii: Phạm Nguyễn Ngọc Bảo · Liisan: Apache 2.0
Samee
Chatterbox Turbo Standard
Faster Chatterbox la sub-200ms latentity iyo tags paralinguistic u qoslo, qandho, iyo in ka badan.
Soo-saarihii: Resemble AI · Liisan: MIT
Samee
VoxCPM Standard
Tokenizer-free TTS soo saara 44.1kHz audio la context-aware qodobka isku mid ah.
Soo-saarihii: OpenBMB · Liisan: Apache 2.0
Samee
VibeVoice Standard
Microsoft tusaale u ah qaabka dheer multi-hoogaamiyaha content sida podcasts iyo audiobooks.
Soo-saarihii: Microsoft · Liisan: MIT
Samee
CosyVoice3 Standard
Next-generation TTS luqado badan leh bi-streaming, xakamaynta dareenka, iyo zero-shot codka isku-dhafan.
Soo-saarihii: Alibaba (FunAudioLLM) · Liisan: Apache 2.0
Samee
NAMAA Saudi TTS Standard
Ugu horeysay u furan Saudi-Carabi TTS. Native Arab la Chatterbox-tayada codka kloning.
Soo-saarihii: NAMAA Space · Liisan: MIT
Samee
Darwin TTS Standard
Cross-modal Qwen3-TTS nooc leh miisaanka FFN isku darka ka Qwen3-1.7B qaabka afka ah ee sharraxaad badan oo afka badan.
Soo-saarihii: FINAL-Bench · Liisan: Apache 2.0
Samee
MOSS-TTSD Standard
Multi-hoosaad wada hadalka sii wadidda qaabka - abuuro podcast-style wada hadalka leh ilaa 5 hadal jeediyayaashiisa iyo 60 daqiiqo oo ah audio isku xiran.
Soo-saarihii: OpenMOSS · Liisan: Apache 2.0
Samee
CosyVoice 2
Alibaba ee scaleable streaming TTS la dabiiciga ah ee aadanaha-parity iyo laantii ku dhow-zero.
Afaf: en, zh, ja, ko, fr, de, it, es
Dhagax-dhig
IndexTTS-2
Zero-shot TTS la fine-grained xakamaynta dareenka iyo muujinta sare.
Afaf: en, zh
Dhagax-dhig
Spark TTS
Codka kloning TTS la dareenka la xakamayn karo iyo qaabka hadalka ka soo baxa.
Afaf: en, zh
Dhagax-dhig
GPT-SoVITS
Dhamaan-shot codka isku-dhafan TTS in isku-dhafan cod kasta oo ka mid ah kaliya 5 ilbiriqsi oo audio.
Afaf: en, zh, ja, ko
Dhagax-dhig
Chatterbox
State-of-the-art zero-shot codka isku-dhafan la dareenka ka maamulo ka soo Resemble AI.
Afaf: en
Dhagax-dhig
Tortoise TTS
Text-to-speech codka badan oo ku saleysan tayada leh naqshadeynta autoregressive.
Afaf: en
Dhagax-dhig
OpenVoice
Codka instant kloning la xakamaynta granular ka style, dareenka, iyo afka.
Afaf: en, zh, ja, ko, fr, es
Dhagax-dhig
VieNeu-TTS-v2
Vietnamese + Ingiriisi code-shidma TTS la 7 codadka hore iyo zero-shot codka isku-dhafan. CPU-keliya, GPU ma loo baahan yahay.
Afaf: vi, en
Dhagax-dhig
Chatterbox Turbo
Faster Chatterbox la sub-200ms latentity iyo tags paralinguistic u qoslo, qandho, iyo in ka badan.
Afaf: en
Dhagax-dhig
VoxCPM
Tokenizer-free TTS soo saara 44.1kHz audio la context-aware qodobka isku mid ah.
Afaf: en, zh
Dhagax-dhig
OuteTTS
LLM-ku salaysan TTS oo ku socda CPU, GPU, ama browser via llama.cpp iyo Transformers.js.
Afaf: en
Dhagax-dhig
Pocket TTS
100M parameter fudud oo loogu talagalay Kyutai oo leh codka ka soo baxa tusaale kaliya.
Afaf: en, fr
Dhagax-dhig
CosyVoice3
Next-generation TTS luqado badan leh bi-streaming, xakamaynta dareenka, iyo zero-shot codka isku-dhafan.
Afaf: en, zh, ja, ko, de, es, fr, it, ru
Dhagax-dhig
NAMAA Saudi TTS
Ugu horeysay u furan Saudi-Carabi TTS. Native Arab la Chatterbox-tayada codka kloning.
Afaf: ar
Dhagax-dhig
Darwin TTS
Cross-modal Qwen3-TTS nooc leh miisaanka FFN isku darka ka Qwen3-1.7B qaabka afka ah ee sharraxaad badan oo afka badan.
Afaf: en, ko, ja, zh
Dhagax-dhig
MOSS-TTSD
Multi-hoosaad wada hadalka sii wadidda qaabka - abuuro podcast-style wada hadalka leh ilaa 5 hadal jeediyayaashiisa iyo 60 daqiiqo oo ah audio isku xiran.
Afaf: en, zh
Dhagax-dhig
Ming-Omni TTS
Compact 0.5B omni-modal hadalka tusaale ka inclusionAI la high-fidelity 44.1kHz soo saarka iyo zero-shot codka isku-dhafan.
Afaf: en, zh
Dhagax-dhig
MOSS-TTS Nano
Tiny 100M MOSS-TTS kala duwanaansho — dhismaha isku mid ah, 80x yar, free-tier latency.
Afaf: en, zh, de, es, fr, ja, it, ko, ru, ar, pt
Dhagax-dhigDeveloper-First API
OpenAI-ku habboon REST API. Mid ka mid ah dhamaadka, 22+ qaabab. Streaming taageero loogu talagalay codsiyada waqtiga dhabta ah.
- Nidaam la jaanqaada OpenAI
- Streaming TTS loogu talagalay barnaamijyada waqtiga dhabta ah
- Batch processing shaqada weyn
- Ogeysiisyada Webhook
pip install ttsai
npm install @ttsainpm/ttsai
from tts_ai import TTSClient
client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
text="Hello from TTS.ai!",
model="kokoro",
voice="af_bella",
)
client.save(audio, "output.mp3")
Sahlan, Shaandhaynta Shaandhaynta
Bilow bilaash ah. Scale sida aad u koraan.
Bilaash
15,000 xaraf + 5,000/maalmood
- 7 noocyo bilaash ah oo ay ku jiraan Kokoro
- 5,000 xarfo oo ku saabsan dhalasho kasta
- API access ku jira
Bilow
500,000 xaraf / bilood
- Dhammaan 22+ qaabab
- 100,000 xarfo oo ku saabsan dhalasho
- Duubista Codka
Pro
2,000,000 xaraf / bilood
- Wax kasta oo Starter ah
- API access
- Waxqabadka Horumarinta
Ka eeg qorshayaasha oo dhan oo ay ku jiraan noocyada xarfaha →
Su'aalaha badanaa la waydiiyo
Maxaa aan ku hagaajin karnaa? Jawaabtaada waxay naga caawisaa inaan xallino dhibaatooyinka.
Bilow isticmaalka AI Voice maanta
Ku biir abuurayaasha, horumariyeyaasha, iyo ganacsiyada isticmaalaya TTS.ai