Frjáls AI Texti í talName
22+ opinn uppspretta módel, 100+ raddir, 32+ Engin reikningur krafist.
Allt sem þú þarft fyrir Voice AI
26 verkfæri knúin áfram af 24+ opnum AI módelum
22+ AI rödd módel
Alhliða safn af opnum uppruna TTS módel í einum vettvangi
Kokoro Free
Kokoro is an 82 million parameter text-to-speech model that punches well above its weight class. Despite its tiny size, it produces remarkably natural and expressive speech. Kokoro supports multiple languages including English, Japanese, Chinese, and Korean with a variety of expressive voices. It runs incredibly fast — generating audio nearly 100x faster than real-time on a GPU.
Best fyrir: High-quality TTS with minimal latency, streaming applications
Prófaðu ókeypis
Piper Free
Piper is a lightweight text-to-speech engine developed by Rhasspy that uses VITS and larynx architectures. It runs entirely on CPU, making it ideal for edge devices, home automation, and applications requiring offline TTS. With over 100 voices across 30+ languages, Piper delivers natural-sounding speech at real-time speeds even on a Raspberry Pi 4.
Best fyrir: Quick previews, accessibility, and embedded applications
Prófaðu ókeypis
VITS Free
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is a parallel end-to-end TTS method that generates more natural sounding audio than current two-stage models. It adopts variational inference augmented with normalizing flows and an adversarial training process, achieving a significant improvement in naturalness.
Best fyrir: General-purpose text-to-speech with natural prosody
Prófaðu ókeypis
MeloTTS Free
MeloTTS by MyShell.ai is a multilingual TTS library supporting English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. It is extremely fast, processing text at near real-time speed on CPU alone. MeloTTS is designed for production use and supports both CPU and GPU inference.
Best fyrir: Framleiðsluforrit sem þurfa hratt, fjöltyngt TTS
Prófaðu ókeypis
Bark Standard
Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.
Forritari: Suno · Leyfi: MIT
Prófaðu það
Bark Small Standard
Lighter version of Bark with faster inference and lower memory usage.
Forritari: Suno · Leyfi: MIT
Prófaðu það
CosyVoice 2 Standard
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Forritari: Alibaba (Tongyi Lab) · Leyfi: Apache 2.0
Prófaðu það
Dia TTS Standard
Multi-hátalara samræða kynslóð líkan sem skapar náttúruleg samtöl milli hátalara.
Forritari: Nari Labs · Leyfi: Apache 2.0
Prófaðu það
Parler TTS Standard
Describe the voice you want in natural language and Parler generates matching speech.
Forritari: Hugging Face · Leyfi: Apache 2.0
Prófaðu það
IndexTTS-2 Standard
Zero-shot TTS with fine-grained emotion control and high expressiveness.
Forritari: Index Team · Leyfi: Apache 2.0
Prófaðu það
Spark TTS Standard
Voice cloning TTS with controllable emotion and speaking style via prompts.
Forritari: SparkAudio · Leyfi: Apache 2.0
Prófaðu það
GPT-SoVITS Standard
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
Forritari: RVC-Boss · Leyfi: MIT
Prófaðu það
Orpheus Standard
Human-level emotional TTS model trained on 100K hours of speech data.
Forritari: Canopy Labs · Leyfi: Llama 3.2 Community
Prófaðu það
Qwen3 TTS Standard
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
Forritari: Alibaba (Qwen) · Leyfi: Apache 2.0
Prófaðu það
CosyVoice 2
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Tungumál: en, zh, ja, ko, fr, de, it, es
Klóna rödd
IndexTTS-2
Zero-shot TTS with fine-grained emotion control and high expressiveness.
Tungumál: en, zh
Klóna rödd
Spark TTS
Voice cloning TTS with controllable emotion and speaking style via prompts.
Tungumál: en, zh
Klóna rödd
GPT-SoVITS
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
Tungumál: en, zh, ja, ko
Klóna rödd
Chatterbox
State-of-the-art núll-skot rödd klónun með tilfinningum stjórna frá líkja AI.
Tungumál: en
Klóna rödd
Tortoise TTS
Multi-rödd texti-til-tal lögð áhersla á gæði með autoregressive arkitektúr.
Tungumál: en
Klóna rödd
OpenVoice
Augnablik rödd klónun með kornuðu stjórn á stíl, tilfinningum og hreim.
Tungumál: en, zh, ja, ko, fr, de, es, it
Klóna rödd
Qwen3 TTS
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
Tungumál: en, zh, ja, ko, de, fr, ru, pt, es, it
Klóna röddDeveloper-First API
OpenAI-samhæft REST API. Einn endapunktur, 22+ gerðir. Streaming stuðning fyrir rauntíma forrit.
- OpenAI-samhæft snið
- Streaming TTS fyrir rauntíma forrit
- Hópvinnsla fyrir stór störf
- Webhook tilkynningar
import requests
response = requests.post(
"https://api.tts.ai/v1/tts/",
headers={"Authorization": "Bearer sk-tts-xxx"},
json={
"model": "kokoro",
"text": "Hello from TTS.ai!",
"voice": "af_bella",
}
)
with open("output.mp3", "wb") as f:
f.write(response.content)
Einföld, gagnsæ verðlagning
Byrjaðu ókeypis. Scale eins og þú vex.
Frjáls
50 einingar
- Kokoro, Piper, VITS, MeloTTS
- 500 stafa takmörk
- 3 gen/klukkustund (enginn reikningur)
Fyrir
2.000 einingar / mánuður
- Allt í Starter
- API aðgangur
- Forgangsvinnsla
Fyrirtæki
10.000 einingar / mánuður
- Allt í Pro
- Magn API
- Forgangsröð
Algengar spurningar (FAQ)
Byrjaðu að nota AI Voice í dag
Taktu þátt í höfundum, verktaki og fyrirtækjum sem nota TTS.ai