Slobodna AL Tekst za govor
22+ modeli otvorenog izvora, 100+ glasova, 32+ jezici. Nije potreban račun.
Sve što trebaš za glasovnu inteligenciju
26 alata koji pokreću 24+ Open-source modeli AI
22+ AI Glasovni modeli
Najopsežnija kolekcija modela TTS otvorenog izvora u jednoj platformi
Kokoro Free
Kokoro is an 82 million parameter text-to-speech model that punches well above its weight class. Despite its tiny size, it produces remarkably natural and expressive speech. Kokoro supports multiple languages including English, Japanese, Chinese, and Korean with a variety of expressive voices. It runs incredibly fast — generating audio nearly 100x faster than real-time on a GPU.
Najbolje za: High-quality TTS with minimal latency, streaming applications
Pokušaj slobodnoPiper Free
Piper is a lightweight text-to-speech engine developed by Rhasspy that uses VITS and larynx architectures. It runs entirely on CPU, making it ideal for edge devices, home automation, and applications requiring offline TTS. With over 100 voices across 30+ languages, Piper delivers natural-sounding speech at real-time speeds even on a Raspberry Pi 4.
Najbolje za: Quick previews, accessibility, and embedded applications
Pokušaj slobodnoVITS Free
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is a parallel end-to-end TTS method that generates more natural sounding audio than current two-stage models. It adopts variational inference augmented with normalizing flows and an adversarial training process, achieving a significant improvement in naturalness.
Najbolje za: General-purpose text-to-speech with natural prosody
Pokušaj slobodnoMeloTTS Free
MeloTTS by MyShell.ai is a multilingual TTS library supporting English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. It is extremely fast, processing text at near real-time speed on CPU alone. MeloTTS is designed for production use and supports both CPU and GPU inference.
Najbolje za: Proizvodnja zahtjeva za brzim, višejezičnim TTS-om
Pokušaj slobodnoBark Standard
Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.
Razvojnik: Suno · Dozvola: MIT
Probaj.Bark Small Standard
Lighter version of Bark with faster inference and lower memory usage.
Razvojnik: Suno · Dozvola: MIT
Probaj.CosyVoice 2 Standard
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Razvojnik: Alibaba (Tongyi Lab) · Dozvola: Apache 2.0
Probaj.Dia TTS Standard
Višezvučnički dijaloški model koji stvara prirodne razgovore između zvučnika.
Razvojnik: Nari Labs · Dozvola: Apache 2.0
Probaj.Parler TTS Standard
Describe the voice you want in natural language and Parler generates matching speech.
Razvojnik: Hugging Face · Dozvola: Apache 2.0
Probaj.IndexTTS-2 Standard
Zero-shot TTS with fine-grained emotion control and high expressiveness.
Razvojnik: Index Team · Dozvola: Apache 2.0
Probaj.Spark TTS Standard
Voice cloning TTS with controllable emotion and speaking style via prompts.
Razvojnik: SparkAudio · Dozvola: Apache 2.0
Probaj.GPT-SoVITS Standard
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
Razvojnik: RVC-Boss · Dozvola: MIT
Probaj.Orpheus Standard
Human-level emotional TTS model trained on 100K hours of speech data.
Razvojnik: Canopy Labs · Dozvola: Llama 3.2 Community
Probaj.Qwen3 TTS Standard
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
Razvojnik: Alibaba (Qwen) · Dozvola: Apache 2.0
Probaj.CosyVoice 2
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Jezici: en, zh, ja, ko, fr, de, it, es
Kloniranje glasaIndexTTS-2
Zero-shot TTS with fine-grained emotion control and high expressiveness.
Jezici: en, zh
Kloniranje glasaSpark TTS
Voice cloning TTS with controllable emotion and speaking style via prompts.
Jezici: en, zh
Kloniranje glasaGPT-SoVITS
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
Jezici: en, zh, ja, ko
Kloniranje glasaChatterbox
Najmoderniji kloniranje glasa s kontrolom emocija iz Resemble AI-a.
Jezici: en
Kloniranje glasaTortoise TTS
Višeglasni tekst-na-speech fokusiran na kvalitetu s autoregresivnom arhitekturom.
Jezici: en
Kloniranje glasaOpenVoice
Instant voice cloning with granular control over style, emotion, and accent.
Jezici: en, zh, ja, ko, fr, de, es, it
Kloniranje glasaQwen3 TTS
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
Jezici: en, zh, ja, ko, de, fr, ru, pt, es, it
Kloniranje glasaProgramer- prvi API
OpenAI kompatibilan REST API. Jedan ishod, 22+ modeli. Streaming support for real-time applications.
- OpenAI kompatibilan format
- Streaming TTS za aplikacije u realnom vremenu
- Paketska obrada za velike poslove
- Webhook obavijesti
import requests
response = requests.post(
"https://api.tts.ai/v1/tts/",
headers={"Authorization": "Bearer sk-tts-xxx"},
json={
"model": "kokoro",
"text": "Hello from TTS.ai!",
"voice": "af_bella",
}
)
with open("output.mp3", "wb") as f:
f.write(response.content)
Jednostavna, prozirna cijena
Počnite slobodno.
Slobodno
50 kredita
- Kokoro, Piper, VITS, MeloTTS
- Ograničenje znaka
- 3 gen/sat (bez računa)
Profesionalno
2.000 kredita/mjesečno
- Sve u Starteru
- API pristup
- Prioritetna obrada
Česta pitanja
Počnite koristiti AI glas danas
Pridružite se kreatorima, programerima i tvrtkama koristeći TTS.ai