Free AI Text to SpeechQuery
22+ open-source mafano, 100+ mawu, 32+ Palibe akaunti zofunika.
Zonse zomwe muyenera kudziwa za Voice AI
Zipangizo za 26 zomwe zimapangidwa ndi 24+ open-source AI models
22+ AI Models za mawu
Kusonkhanitsa kwakukulu kwambiri kwa ma TTS open-source models m'modzi m'modzi
Kokoro Free
Kokoro is an 82 million parameter text-to-speech model that punches well above its weight class. Despite its tiny size, it produces remarkably natural and expressive speech. Kokoro supports multiple languages including English, Japanese, Chinese, and Korean with a variety of expressive voices. It runs incredibly fast — generating audio nearly 100x faster than real-time on a GPU.
Oyenera kwa: High-quality TTS with minimal latency, streaming applications
Phunzirani kwaulere
Piper Free
Piper is a lightweight text-to-speech engine developed by Rhasspy that uses VITS and larynx architectures. It runs entirely on CPU, making it ideal for edge devices, home automation, and applications requiring offline TTS. With over 100 voices across 30+ languages, Piper delivers natural-sounding speech at real-time speeds even on a Raspberry Pi 4.
Oyenera kwa: Quick previews, accessibility, and embedded applications
Phunzirani kwaulere
VITS Free
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is a parallel end-to-end TTS method that generates more natural sounding audio than current two-stage models. It adopts variational inference augmented with normalizing flows and an adversarial training process, achieving a significant improvement in naturalness.
Oyenera kwa: General-purpose text-to-speech with natural prosody
Phunzirani kwaulere
MeloTTS Free
MeloTTS by MyShell.ai is a multilingual TTS library supporting English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. It is extremely fast, processing text at near real-time speed on CPU alone. MeloTTS is designed for production use and supports both CPU and GPU inference.
Oyenera kwa: Ntchito zopanga zomwe zimafunikira TTS yofulumira komanso yosiyanasiyana
Phunzirani kwaulere
Bark Standard
Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.
Wopanga: Suno · License: MIT
Yambitsani
Bark Small Standard
Lighter version of Bark with faster inference and lower memory usage.
Wopanga: Suno · License: MIT
Yambitsani
CosyVoice 2 Standard
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Wopanga: Alibaba (Tongyi Lab) · License: Apache 2.0
Yambitsani
Dia TTS Standard
Multi-wokamba nkhani dialogue chitukuko chitsanzo chomwe chimaumba zokambirana zachilengedwe pakati pa wokamba nkhani.
Wopanga: Nari Labs · License: Apache 2.0
Yambitsani
Parler TTS Standard
Describe the voice you want in natural language and Parler generates matching speech.
Wopanga: Hugging Face · License: Apache 2.0
Yambitsani
IndexTTS-2 Standard
Zero-shot TTS with fine-grained emotion control and high expressiveness.
Wopanga: Index Team · License: Apache 2.0
Yambitsani
Spark TTS Standard
Voice cloning TTS with controllable emotion and speaking style via prompts.
Wopanga: SparkAudio · License: Apache 2.0
Yambitsani
GPT-SoVITS Standard
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
Wopanga: RVC-Boss · License: MIT
Yambitsani
Orpheus Standard
Human-level emotional TTS model trained on 100K hours of speech data.
Wopanga: Canopy Labs · License: Llama 3.2 Community
Yambitsani
Qwen3 TTS Standard
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
Wopanga: Alibaba (Qwen) · License: Apache 2.0
Yambitsani
CosyVoice 2
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Zilankhulo: en, zh, ja, ko, fr, de, it, es
Clone Voice
IndexTTS-2
Zero-shot TTS with fine-grained emotion control and high expressiveness.
Zilankhulo: en, zh
Clone Voice
Spark TTS
Voice cloning TTS with controllable emotion and speaking style via prompts.
Zilankhulo: en, zh
Clone Voice
GPT-SoVITS
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
Zilankhulo: en, zh, ja, ko
Clone Voice
Chatterbox
State-of-the-art zero-shot voice cloning ndi kuwongolera maganizo kuchokera ku Resemble AI.
Zilankhulo: en
Clone Voice
Tortoise TTS
Multi-voice text-to-speech yodziyimira pawokha yodziyimira pawokha yodziyimira pawokha.
Zilankhulo: en
Clone Voice
OpenVoice
Instant voice cloning with granular control over style, emotion, and accent.
Zilankhulo: en, zh, ja, ko, fr, de, es, it
Clone Voice
Qwen3 TTS
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
Zilankhulo: en, zh, ja, ko, de, fr, ru, pt, es, it
Clone VoiceDeveloper-First API
OpenAI-kugwirizana REST API. One endpoint, 22 + mafano. Streaming thandizo kwa real-time mapulogalamu.
- Format yogwirizana ndi OpenAI
- Streaming TTS kwa real-time mapulogalamu
- Batch processing kwa ntchito zazikulu
- Zidziwitso za Webhook
import requests
response = requests.post(
"https://api.tts.ai/v1/tts/",
headers={"Authorization": "Bearer sk-tts-xxx"},
json={
"model": "kokoro",
"text": "Hello from TTS.ai!",
"voice": "af_bella",
}
)
with open("output.mp3", "wb") as f:
f.write(response.content)
Zosavuta, Zowoneka bwino Zotsatsa
Kuyamba kwaulere. Scale monga mukukula.
_Yaulere
50 credits
- Kokoro, Piper, VITS, MeloTTS
- 500 chizindikiro malire
- 3 gen / ola (opanda akaunti)
Woyamba
500 credits / mwezi
- onse 22+ zojambula
- 5,000 characters limit
- Chizindikiro cha mawu
Pro
2,000 credits / mwezi
- Zonse mu Starter
- Kugwiritsa ntchito API
- Priority processing
Enterprise
10,000 credits / mwezi
- Zonse mu Pro
- Mphamvu ya API
- Priority queue
Funso Lofunsidwa Kawirikawiri
Kuyamba kugwiritsa ntchito AI Voice lero
Join opanga, opanga, ndi makampani pogwiritsa ntchito TTS.ai