Free AI Test għal Diskors
22+ open-source mudelli, 100+ vuċijiet, 32+ L-ebda kont meħtieġ.
Dak kollu li għandek bżonn għall-vuċi AI
26 għodda mħaddma minn 24+ mudelli tal-AI b'sors miftuħ
22+ Mudelli tal-Vuċi AI
L-aktar kollezzjoni komprensiva ta' mudelli TTS b'sors miftuħ f'pjattaforma waħda
Kokoro Free
Kokoro is an 82 million parameter text-to-speech model that punches well above its weight class. Despite its tiny size, it produces remarkably natural and expressive speech. Kokoro supports multiple languages including English, Japanese, Chinese, and Korean with a variety of expressive voices. It runs incredibly fast — generating audio nearly 100x faster than real-time on a GPU.
L-aħjar għal: High-quality TTS with minimal latency, streaming applications
Ipprova b'xejn
Piper Free
Piper is a lightweight text-to-speech engine developed by Rhasspy that uses VITS and larynx architectures. It runs entirely on CPU, making it ideal for edge devices, home automation, and applications requiring offline TTS. With over 100 voices across 30+ languages, Piper delivers natural-sounding speech at real-time speeds even on a Raspberry Pi 4.
L-aħjar għal: Quick previews, accessibility, and embedded applications
Ipprova b'xejn
VITS Free
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is a parallel end-to-end TTS method that generates more natural sounding audio than current two-stage models. It adopts variational inference augmented with normalizing flows and an adversarial training process, achieving a significant improvement in naturalness.
L-aħjar għal: General-purpose text-to-speech with natural prosody
Ipprova b'xejn
MeloTTS Free
MeloTTS by MyShell.ai is a multilingual TTS library supporting English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. It is extremely fast, processing text at near real-time speed on CPU alone. MeloTTS is designed for production use and supports both CPU and GPU inference.
L-aħjar għal: Applikazzjonijiet tal-produzzjoni li jeħtieġu veloċi, multilingwi TTS
Ipprova b'xejn
Bark Standard
Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.
Żviluppatur: Suno · Liċenzja: MIT
Ipprovaha
Bark Small Standard
Lighter version of Bark with faster inference and lower memory usage.
Żviluppatur: Suno · Liċenzja: MIT
Ipprovaha
CosyVoice 2 Standard
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Żviluppatur: Alibaba (Tongyi Lab) · Liċenzja: Apache 2.0
Ipprovaha
Dia TTS Standard
Mudell tal-ġenerazzjoni tad-djalogu b'ħafna kelliema li joħloq konversazzjonijiet naturali bejn kelliema.
Żviluppatur: Nari Labs · Liċenzja: Apache 2.0
Ipprovaha
Parler TTS Standard
Describe the voice you want in natural language and Parler generates matching speech.
Żviluppatur: Hugging Face · Liċenzja: Apache 2.0
Ipprovaha
IndexTTS-2 Standard
Zero-shot TTS with fine-grained emotion control and high expressiveness.
Żviluppatur: Index Team · Liċenzja: Apache 2.0
Ipprovaha
Spark TTS Standard
Voice cloning TTS with controllable emotion and speaking style via prompts.
Żviluppatur: SparkAudio · Liċenzja: Apache 2.0
Ipprovaha
GPT-SoVITS Standard
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
Żviluppatur: RVC-Boss · Liċenzja: MIT
Ipprovaha
Orpheus Standard
Human-level emotional TTS model trained on 100K hours of speech data.
Żviluppatur: Canopy Labs · Liċenzja: Llama 3.2 Community
Ipprovaha
Qwen3 TTS Standard
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
Żviluppatur: Alibaba (Qwen) · Liċenzja: Apache 2.0
Ipprovaha
CosyVoice 2
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Lingwi: en, zh, ja, ko, fr, de, it, es
Il-vuċi tal-klonu
IndexTTS-2
Zero-shot TTS with fine-grained emotion control and high expressiveness.
Lingwi: en, zh
Il-vuċi tal-klonu
Spark TTS
Voice cloning TTS with controllable emotion and speaking style via prompts.
Lingwi: en, zh
Il-vuċi tal-klonu
GPT-SoVITS
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
Lingwi: en, zh, ja, ko
Il-vuċi tal-klonu
Chatterbox
State-of-the-art klonazzjoni vuċi żero-shot b'kontroll emozzjoni minn Resemble AI.
Lingwi: en
Il-vuċi tal-klonu
Tortoise TTS
Multi-vuċi test-to-diskors iffokat fuq il-kwalità bl-arkitettura autoregressive.
Lingwi: en
Il-vuċi tal-klonu
OpenVoice
Instant klonazzjoni vuċi b'kontroll granulari fuq l-istil, emozzjoni, u l-aċċent.
Lingwi: en, zh, ja, ko, fr, de, es, it
Il-vuċi tal-klonu
Qwen3 TTS
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
Lingwi: en, zh, ja, ko, de, fr, ru, pt, es, it
Il-vuċi tal-klonuL-ewwel API tal-iżviluppatur
OpenAI-kompatibbli REST API. One endpoint, 22+ mudelli. Streaming appoġġ għall-applikazzjonijiet fil-ħin reali.
- Format kompatibbli ma’ OpenAI
- Streaming TTS għall-applikazzjonijiet fil-ħin reali
- Ipproċessar tal-lott għall-impjiegi kbar
- Notifiki tal-webhook
import requests
response = requests.post(
"https://api.tts.ai/v1/tts/",
headers={"Authorization": "Bearer sk-tts-xxx"},
json={
"model": "kokoro",
"text": "Hello from TTS.ai!",
"voice": "af_bella",
}
)
with open("output.mp3", "wb") as f:
f.write(response.content)
Sempliċi, prezzijiet trasparenti
Ibda b'xejn. Skala kif tikber.
Liberi
50 kreditu
- Kokoro, Piper, VITS, MeloTTS
- Limitu ta’ 500 karattru
- 3 gen/siegħa (l-ebda kont)
Starter
500 kreditu / xahar
- Kollha 22+ mudelli
- Limitu ta’ 5,000 karattru
- Klonazzjoni tal-vuċi
Għaliex
2,000 kreditu/xahar
- Kollox fi Starter
- Aċċess għall-API
- Ipproċessar ta’ prijorità
Intrapriża
10,000 kreditu/xahar
- Kollox fil-Pro
- API bl-ingrossa
- Kju ta’ prijorità
Mistoqsijiet Frekwenti (FAQ)
Ibda tuża AI Voice Illum
Ingħaqad kreaturi, żviluppaturi, u n-negozji li jużaw TTS.ai