Vry Kunsmatige inteligensie Teks vir spraak
22+ oop- seurce modelle, 100+-stemme, 32+ tale. Geen rekening benodig.
Alles wat jy nodig het vir stemKI
26 hulpmiddels wat deur 24+ Open-onse-KI-modelle aangedryf word
22+ KI-stemmodel's
Die omvattendste versameling van ope-onsorce TTS modelle in een platform
Kokoro Free
Kokoro is an 82 million parameter text-to-speech model that punches well above its weight class. Despite its tiny size, it produces remarkably natural and expressive speech. Kokoro supports multiple languages including English, Japanese, Chinese, and Korean with a variety of expressive voices. It runs incredibly fast — generating audio nearly 100x faster than real-time on a GPU.
Beste vir: High-quality TTS with minimal latency, streaming applications
Probeer vry
Piper Free
Piper is a lightweight text-to-speech engine developed by Rhasspy that uses VITS and larynx architectures. It runs entirely on CPU, making it ideal for edge devices, home automation, and applications requiring offline TTS. With over 100 voices across 30+ languages, Piper delivers natural-sounding speech at real-time speeds even on a Raspberry Pi 4.
Beste vir: Quick previews, accessibility, and embedded applications
Probeer vry
VITS Free
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is a parallel end-to-end TTS method that generates more natural sounding audio than current two-stage models. It adopts variational inference augmented with normalizing flows and an adversarial training process, achieving a significant improvement in naturalness.
Beste vir: General-purpose text-to-speech with natural prosody
Probeer vry
MeloTTS Free
MeloTTS by MyShell.ai is a multilingual TTS library supporting English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. It is extremely fast, processing text at near real-time speed on CPU alone. MeloTTS is designed for production use and supports both CPU and GPU inference.
Beste vir: Produksietoepassings wat vinnige, veeltalige TTS nodig het
Probeer vry
Bark Standard
Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.
Ontwikkelaar: Suno · Lisensie: MIT
Probeer dit
Bark Small Standard
Lighter version of Bark with faster inference and lower memory usage.
Ontwikkelaar: Suno · Lisensie: MIT
Probeer dit
CosyVoice 2 Standard
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Ontwikkelaar: Alibaba (Tongyi Lab) · Lisensie: Apache 2.0
Probeer dit
Dia TTS Standard
Multi- Conder dialoog model wat skep natuurlike gesprekke tussen sprekers.
Ontwikkelaar: Nari Labs · Lisensie: Apache 2.0
Probeer dit
Parler TTS Standard
Describe the voice you want in natural language and Parler generates matching speech.
Ontwikkelaar: Hugging Face · Lisensie: Apache 2.0
Probeer dit
IndexTTS-2 Standard
Zero-shot TTS with fine-grained emotion control and high expressiveness.
Ontwikkelaar: Index Team · Lisensie: Apache 2.0
Probeer dit
Spark TTS Standard
Voice cloning TTS with controllable emotion and speaking style via prompts.
Ontwikkelaar: SparkAudio · Lisensie: Apache 2.0
Probeer dit
GPT-SoVITS Standard
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
Ontwikkelaar: RVC-Boss · Lisensie: MIT
Probeer dit
Orpheus Standard
Human-level emotional TTS model trained on 100K hours of speech data.
Ontwikkelaar: Canopy Labs · Lisensie: Llama 3.2 Community
Probeer dit
Qwen3 TTS Standard
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
Ontwikkelaar: Alibaba (Qwen) · Lisensie: Apache 2.0
Probeer dit
CosyVoice 2
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Tale: en, zh, ja, ko, fr, de, it, es
Geveinsde stem
IndexTTS-2
Zero-shot TTS with fine-grained emotion control and high expressiveness.
Tale: en, zh
Geveinsde stem
Spark TTS
Voice cloning TTS with controllable emotion and speaking style via prompts.
Tale: en, zh
Geveinsde stem
GPT-SoVITS
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
Tale: en, zh, ja, ko
Geveinsde stem
Chatterbox
State-van-die-art nul-skoot stem kloning met emosie kontrole van Resemble-KI.
Tale: en
Geveinsde stem
Tortoise TTS
Multi- fax- to-sech gefokus op kwaliteit met outoregressiewe argitektuur.
Tale: en
Geveinsde stem
OpenVoice
Instant voice cloning with granular control over style, emotion, and accent.
Tale: en, zh, ja, ko, fr, de, es, it
Geveinsde stem
Qwen3 TTS
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
Tale: en, zh, ja, ko, de, fr, ru, pt, es, it
Geveinsde stemontwikkelaar- First API
OpenAI- versoenbaar met REST API. Een punt, 22+ modelle. Stroom ondersteuning vir werklike programme.
- OpenAI- versoenbaarte formaat
- Stroom TTS vir regte tyd apps
- Moenie vir groot werk verwerk word nie
- WebwerweName
import requests
response = requests.post(
"https://api.tts.ai/v1/tts/",
headers={"Authorization": "Bearer sk-tts-xxx"},
json={
"model": "kokoro",
"text": "Hello from TTS.ai!",
"voice": "af_bella",
}
)
with open("output.mp3", "wb") as f:
f.write(response.content)
Eenvoudig, deurskynend
Begin vry. Skaal namate jy groei.
Beskikbaar
50 krediete
- Kokoro, Piper, VITS, MeloTTS
- 500 karakterbeperking
- 3 gen/hour (geen rekening)
Pro
2 000 krediete/month
- Alles in Beginler
- API-toegang
- Prioriteitverwerking
Onderneming
10 000 krediete/onth
- Alles in Procrect
- Grootmaat API
- Prioriteit wagtou
Vrae wat dikwels gevra word
Begin vandag met die gebruik van KI-stem
Sluit by skeppers, ontwikkelaars en sakeondernemings aan deur TTS.ai te gebruik