I-Free AI Okubhaliweyo ukuya kuSpeechName
22+ open-source models, 100+ voices, 32+ iilwimi. Akukho akhawunti ifunekayo.
Yonke into oyifunayo kwi Voice AI
Izixhobo ezili-26 ezixhaswa ziimodeli ze-24+ open-source AI
Iimodeli zesandi ze-22+ AI
Uluhlu olupheleleyo lweemodeli ze-TTS ezinomthombo ovulekileyo kwi-platform enye
Kokoro Free
Kokoro is an 82 million parameter text-to-speech model that punches well above its weight class. Despite its tiny size, it produces remarkably natural and expressive speech. Kokoro supports multiple languages including English, Japanese, Chinese, and Korean with a variety of expressive voices. It runs incredibly fast — generating audio nearly 100x faster than real-time on a GPU.
Elungileyo ku: High-quality TTS with minimal latency, streaming applications
Zama simahla
Piper Free
Piper is a lightweight text-to-speech engine developed by Rhasspy that uses VITS and larynx architectures. It runs entirely on CPU, making it ideal for edge devices, home automation, and applications requiring offline TTS. With over 100 voices across 30+ languages, Piper delivers natural-sounding speech at real-time speeds even on a Raspberry Pi 4.
Elungileyo ku: Quick previews, accessibility, and embedded applications
Zama simahla
VITS Free
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is a parallel end-to-end TTS method that generates more natural sounding audio than current two-stage models. It adopts variational inference augmented with normalizing flows and an adversarial training process, achieving a significant improvement in naturalness.
Elungileyo ku: General-purpose text-to-speech with natural prosody
Zama simahla
MeloTTS Free
MeloTTS by MyShell.ai is a multilingual TTS library supporting English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. It is extremely fast, processing text at near real-time speed on CPU alone. MeloTTS is designed for production use and supports both CPU and GPU inference.
Elungileyo ku: Iinkqubo zokuvelisa ezifuna i-TTS ekhawulezayo, eneelwimi ezininzi
Zama simahla
Bark Standard
Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.
Umbhekisi phambili: Suno · Ilayisenisi: MIT
Zama kwakhona
Bark Small Standard
Lighter version of Bark with faster inference and lower memory usage.
Umbhekisi phambili: Suno · Ilayisenisi: MIT
Zama kwakhona
CosyVoice 2 Standard
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Umbhekisi phambili: Alibaba (Tongyi Lab) · Ilayisenisi: Apache 2.0
Zama kwakhona
Dia TTS Standard
Imodeli yokuveliswa kwencoko yababini yesandi esininzi eyenza unxibelelwano oluqhelekileyo phakathi kwamasandi.
Umbhekisi phambili: Nari Labs · Ilayisenisi: Apache 2.0
Zama kwakhona
Parler TTS Standard
Describe the voice you want in natural language and Parler generates matching speech.
Umbhekisi phambili: Hugging Face · Ilayisenisi: Apache 2.0
Zama kwakhona
IndexTTS-2 Standard
Zero-shot TTS with fine-grained emotion control and high expressiveness.
Umbhekisi phambili: Index Team · Ilayisenisi: Apache 2.0
Zama kwakhona
Spark TTS Standard
Voice cloning TTS with controllable emotion and speaking style via prompts.
Umbhekisi phambili: SparkAudio · Ilayisenisi: Apache 2.0
Zama kwakhona
GPT-SoVITS Standard
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
Umbhekisi phambili: RVC-Boss · Ilayisenisi: MIT
Zama kwakhona
Orpheus Standard
Human-level emotional TTS model trained on 100K hours of speech data.
Umbhekisi phambili: Canopy Labs · Ilayisenisi: Llama 3.2 Community
Zama kwakhona
Qwen3 TTS Standard
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
Umbhekisi phambili: Alibaba (Qwen) · Ilayisenisi: Apache 2.0
Zama kwakhona
CosyVoice 2
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Iilwimi: en, zh, ja, ko, fr, de, it, es
Ilizwi lika-Clone
IndexTTS-2
Zero-shot TTS with fine-grained emotion control and high expressiveness.
Iilwimi: en, zh
Ilizwi lika-Clone
Spark TTS
Voice cloning TTS with controllable emotion and speaking style via prompts.
Iilwimi: en, zh
Ilizwi lika-Clone
GPT-SoVITS
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
Iilwimi: en, zh, ja, ko
Ilizwi lika-Clone
Chatterbox
I-state-of-the-art zero-shot voice cloning ngolawulo lweemvakalelo ukusuka kwi-Resemble AI.
Iilwimi: en
Ilizwi lika-Clone
Tortoise TTS
Umbhalo-uku-thetha ngelizwi elininzi olujolise kumgangatho kunye noyilo oluya ezantsi ngokuzenzekelayo.
Iilwimi: en
Ilizwi lika-Clone
OpenVoice
Instant voice cloning with granular control over style, emotion, and accent.
Iilwimi: en, zh, ja, ko, fr, de, es, it
Ilizwi lika-Clone
Qwen3 TTS
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
Iilwimi: en, zh, ja, ko, de, fr, ru, pt, es, it
Ilizwi lika-CloneUmbhekisi phambili-Okuqalayo
I-REST API ehambelana ne-OpenAI. Incopho enye yesiphelo, iimodeli ezingaphezu kwe-22. Inkxaso yosasazo lwezicelo zexesha elibonakalayo.
- Ifomati ehambelana ne-OpenAI
- Unikezelo lwe-TTS lweenkqubo zexesha elibonakalayo
- Uqhubekeko lweqela lomsebenzi omkhulu
- Isaziso se Webhook
import requests
response = requests.post(
"https://api.tts.ai/v1/tts/",
headers={"Authorization": "Bearer sk-tts-xxx"},
json={
"model": "kokoro",
"text": "Hello from TTS.ai!",
"voice": "af_bella",
}
)
with open("output.mp3", "wb") as f:
f.write(response.content)
Ixabiso elilula, elicacileyo
Qala ngokukhululekileyo. Ubungakanani njengoko ukhula.
Ekhululekileyo
50 credits
- Kokoro, Piper, VITS, MeloTTS
- Umda wophawu lwe 500
- 3 gen/iyure (akukho akhawunti)
Isiqalisi
500 credits/month
- Zonke iimodeli ezingama-22+
- 5,000 umda wophawu
- I-Voice Cloning
I-Pro
2,000 iikhredithi/inyanga
- Yonke into kwisiqalisi
- Ufikelelo lwe-API
- Ukuqhubekeka okuphambili
I-Entreprise
10,000 iikhredithi/inyanga
- Yonke into kwi-Pro
- I-Bulk API
- Ufolo oluphambili
Imibuzo ebuzwa rhoqo
Qala Ukusebenzisa i-AI Voice Namhlanje
Join creators, developers, and businesses using TTS.ai