Bezmaksas AI Teksts uz runu
22+ atvērtā avota modeļi, 100+ balsis, 32+ valodas. Konts nav nepieciešams.
Viss, kas jums nepieciešams balss AI
26 instrumenti, ko darbina 24+ atvērtā avota AI modeļi
22+ AI balss modeļi
Visplašākā atvērtā avota TTS modeļu kolekcija vienā platformā
Kokoro Free
Kokoro is an 82 million parameter text-to-speech model that punches well above its weight class. Despite its tiny size, it produces remarkably natural and expressive speech. Kokoro supports multiple languages including English, Japanese, Chinese, and Korean with a variety of expressive voices. It runs incredibly fast — generating audio nearly 100x faster than real-time on a GPU.
Labākais par: High-quality TTS with minimal latency, streaming applications
Mēģināt atbrīvot
Piper Free
Piper is a lightweight text-to-speech engine developed by Rhasspy that uses VITS and larynx architectures. It runs entirely on CPU, making it ideal for edge devices, home automation, and applications requiring offline TTS. With over 100 voices across 30+ languages, Piper delivers natural-sounding speech at real-time speeds even on a Raspberry Pi 4.
Labākais par: Quick previews, accessibility, and embedded applications
Mēģināt atbrīvot
VITS Free
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is a parallel end-to-end TTS method that generates more natural sounding audio than current two-stage models. It adopts variational inference augmented with normalizing flows and an adversarial training process, achieving a significant improvement in naturalness.
Labākais par: General-purpose text-to-speech with natural prosody
Mēģināt atbrīvot
MeloTTS Free
MeloTTS by MyShell.ai is a multilingual TTS library supporting English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. It is extremely fast, processing text at near real-time speed on CPU alone. MeloTTS is designed for production use and supports both CPU and GPU inference.
Labākais par: Ražošanas lietojumprogrammas, kurām nepieciešama ātra, daudzvalodu TTS
Mēģināt atbrīvot
Bark Standard
Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.
Izstrādātājs: Suno · Licence: MIT
Pamēģini to
Bark Small Standard
Lighter version of Bark with faster inference and lower memory usage.
Izstrādātājs: Suno · Licence: MIT
Pamēģini to
CosyVoice 2 Standard
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Izstrādātājs: Alibaba (Tongyi Lab) · Licence: Apache 2.0
Pamēģini to
Dia TTS Standard
Vairāku runātāju dialoga paaudzes modelis, kas rada dabiskas sarunas starp runātājiem.
Izstrādātājs: Nari Labs · Licence: Apache 2.0
Pamēģini to
Parler TTS Standard
Describe the voice you want in natural language and Parler generates matching speech.
Izstrādātājs: Hugging Face · Licence: Apache 2.0
Pamēģini to
IndexTTS-2 Standard
Zero-shot TTS with fine-grained emotion control and high expressiveness.
Izstrādātājs: Index Team · Licence: Apache 2.0
Pamēģini to
Spark TTS Standard
Voice cloning TTS with controllable emotion and speaking style via prompts.
Izstrādātājs: SparkAudio · Licence: Apache 2.0
Pamēģini to
GPT-SoVITS Standard
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
Izstrādātājs: RVC-Boss · Licence: MIT
Pamēģini to
Orpheus Standard
Human-level emotional TTS model trained on 100K hours of speech data.
Izstrādātājs: Canopy Labs · Licence: Llama 3.2 Community
Pamēģini to
Qwen3 TTS Standard
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
Izstrādātājs: Alibaba (Qwen) · Licence: Apache 2.0
Pamēģini to
CosyVoice 2
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Valodas: en, zh, ja, ko, fr, de, it, es
Clone Balss
IndexTTS-2
Zero-shot TTS with fine-grained emotion control and high expressiveness.
Valodas: en, zh
Clone Balss
Spark TTS
Voice cloning TTS with controllable emotion and speaking style via prompts.
Valodas: en, zh
Clone Balss
GPT-SoVITS
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
Valodas: en, zh, ja, ko
Clone Balss
Chatterbox
Mūsdienu nulles-shot balss klonēšana ar emociju kontroli no Reemble AI.
Valodas: en
Clone Balss
Tortoise TTS
Daudzbalsu teksts-to-speech koncentrējas uz kvalitāti ar autoregesīvu arhitektūru.
Valodas: en
Clone Balss
OpenVoice
Instant balss klonēšana ar granulu kontroli pār stilu, emocijām un akcentu.
Valodas: en, zh, ja, ko, fr, de, es, it
Clone Balss
Qwen3 TTS
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
Valodas: en, zh, ja, ko, de, fr, ru, pt, es, it
Clone BalssIzstrādātājs- pirmais API
Ar OpenAI saderīgu REST API. Viens mērķa kritērijs, 22+ modeļi. Streaming atbalsts reālā laika lietojumprogrammām.
- Ar OpenAI savietojams formāts
- TTS plūsmas reāllaika lietojumprogrammām
- Partijas apstrāde lielām darbavietām
- Webhook paziņojumi
import requests
response = requests.post(
"https://api.tts.ai/v1/tts/",
headers={"Authorization": "Bearer sk-tts-xxx"},
json={
"model": "kokoro",
"text": "Hello from TTS.ai!",
"voice": "af_bella",
}
)
with open("output.mp3", "wb") as f:
f.write(response.content)
Vienkārša, pārredzama cenu noteikšana
Sākt bez maksas. Scale kā jūs augt.
Bezmaksas
50 kredīti
- Kokoro, Piper, VITS, MeloTTS
- 500 rakstzīmju limits
- 3 g/stundā (nav konta)
Pro
2 000 kredīti/mēnesis
- Viss iesākumā
- API piekļuve
- Prioritārā apstrāde
Uzņēmums
10 000 kredīti/mēnesis
- Viss Pro
- Neiesaiņots API
- Prioritātes rinda
Bieži uzdoti jautājumi
Sākt izmantojot AI balsi šodien
Pievienojies radītāji, izstrādātāji, un uzņēmumi, kas izmanto TTS.ai