QPrintPreviewDialog @ action

22+ open-source models, 100+ voices, 32+ Ba a buƙata wani asusu ba.

0/500 @ action QDialogButtonBox
QFileDialog 50 free credits 32+ Harsuna QShortcut
0:00 / 0:00
Download Audio Link expires in 24h
Like TTS.ai? Ka gaya wa abokanka!

QFontDatabase

Samfurin da ya fi faɗi na sifofin TTS masu ma'ana-farare a cikin dandamali guda

KokoroKokoro Free

Kokoro is an 82 million parameter text-to-speech model that punches well above its weight class. Despite its tiny size, it produces remarkably natural and expressive speech. Kokoro supports multiple languages including English, Japanese, Chinese, and Korean with a variety of expressive voices. It runs incredibly fast — generating audio nearly 100x faster than real-time on a GPU.

Mafi kyau ga: High-quality TTS with minimal latency, streaming applications

QShortcut

PiperPiper Free

Piper is a lightweight text-to-speech engine developed by Rhasspy that uses VITS and larynx architectures. It runs entirely on CPU, making it ideal for edge devices, home automation, and applications requiring offline TTS. With over 100 voices across 30+ languages, Piper delivers natural-sounding speech at real-time speeds even on a Raspberry Pi 4.

Mafi kyau ga: Quick previews, accessibility, and embedded applications

QShortcut

VITSVITS Free

VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is a parallel end-to-end TTS method that generates more natural sounding audio than current two-stage models. It adopts variational inference augmented with normalizing flows and an adversarial training process, achieving a significant improvement in naturalness.

Mafi kyau ga: General-purpose text-to-speech with natural prosody

QShortcut

MeloTTSMeloTTS Free

MeloTTS by MyShell.ai is a multilingual TTS library supporting English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. It is extremely fast, processing text at near real-time speed on CPU alone. MeloTTS is designed for production use and supports both CPU and GPU inference.

Mafi kyau ga: Shiryoyin ayuka na samarwa suna buƙatar TTS mai sauri, da ya ƙunshi yarukan da dama

QShortcut

BarkBark Standard

Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.

Mawallafi: Suno · Lasisi: MIT

@ action

Bark SmallBark Small Standard

Lighter version of Bark with faster inference and lower memory usage.

Mawallafi: Suno · Lasisi: MIT

@ action

CosyVoice 2CosyVoice 2 Standard

Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.

Mawallafi: Alibaba (Tongyi Lab) · Lasisi: Apache 2.0

@ action

Dia TTSDia TTS Standard

Tsarin samar da tattaunawar masu magana da yawa wanda ke samar da tattaunawar halitta tsakanin masu magana.

Mawallafi: Nari Labs · Lasisi: Apache 2.0

@ action

Parler TTSParler TTS Standard

Describe the voice you want in natural language and Parler generates matching speech.

Mawallafi: Hugging Face · Lasisi: Apache 2.0

@ action

IndexTTS-2IndexTTS-2 Standard

Zero-shot TTS with fine-grained emotion control and high expressiveness.

Mawallafi: Index Team · Lasisi: Apache 2.0

@ action

Spark TTSSpark TTS Standard

Voice cloning TTS with controllable emotion and speaking style via prompts.

Mawallafi: SparkAudio · Lasisi: Apache 2.0

@ action

GPT-SoVITSGPT-SoVITS Standard

Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.

Mawallafi: RVC-Boss · Lasisi: MIT

@ action

OrpheusOrpheus Standard

Human-level emotional TTS model trained on 100K hours of speech data.

Mawallafi: Canopy Labs · Lasisi: Llama 3.2 Community

@ action

Qwen3 TTSQwen3 TTS Standard

Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.

Mawallafi: Alibaba (Qwen) · Lasisi: Apache 2.0

@ action

ChatterboxChatterbox Premium

State-of-the-art zero-shot voice cloning with emotion control from Resemble AI.

QPrintPreviewDialog

@ action

Tortoise TTSTortoise TTS Premium

@ title: window

QPrintPreviewDialog

@ action

StyleTTS 2StyleTTS 2 Premium

Human-level text-to-speech through style diffusion and adversarial training.

QPrintPreviewDialog

@ action

OpenVoiceOpenVoice Premium

@ info: status

QPrintPreviewDialog

@ action

CosyVoice 2CosyVoice 2

Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.

Yare: en, zh, ja, ko, fr, de, it, es

QShortcut

IndexTTS-2IndexTTS-2

Zero-shot TTS with fine-grained emotion control and high expressiveness.

Yare: en, zh

QShortcut

Spark TTSSpark TTS

Voice cloning TTS with controllable emotion and speaking style via prompts.

Yare: en, zh

QShortcut

GPT-SoVITSGPT-SoVITS

Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.

Yare: en, zh, ja, ko

QShortcut

ChatterboxChatterbox

State-of-the-art zero-shot voice cloning with emotion control from Resemble AI.

Yare: en

QShortcut

Tortoise TTSTortoise TTS

@ title: window

Yare: en

QShortcut

OpenVoiceOpenVoice

@ info: status

Yare: en, zh, ja, ko, fr, de, es, it

QShortcut

Qwen3 TTSQwen3 TTS

Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.

Yare: en, zh, ja, ko, de, fr, ru, pt, es, it

QShortcut

Developer-First API

OpenAI-compatible REST API. One endpoint, 22+ models. Streaming support for real-time applications.

  • QPrintPreviewDialog
  • Streaming TTS ga shiryoyin ayuka na lokaci na gaskiya
  • Preview-size
  • QDialogButtonBox
QPrintPreviewDialog
Python
import requests

response = requests.post(
    "https://api.tts.ai/v1/tts/",
    headers={"Authorization": "Bearer sk-tts-xxx"},
    json={
        "model": "kokoro",
        "text": "Hello from TTS.ai!",
        "voice": "af_bella",
    }
)

with open("output.mp3", "wb") as f:
    f.write(response.content)

QPrintPreviewDialog

Ka fara kyauta. Ka girma kamar yadda kake girma.

QDialogButtonBox

$0

credits

  • Kokoro, Piper, VITS, MeloTTS
  • 500 haske
  • 3 gen/hour (no account)
Yi rijista

@ action

$9/MB

credits/month

  • @ label: textbox
  • 5,000 characters limit
  • QShortcut
@ action
QDialogButtonBox

QShortcut

$29/MB

2000 credits/month

  • Duk abin da ke cikin Mai Farawa
  • Aika API
  • QDialogButtonBox
QDialogButtonBox

QFontDatabase

$99/MB

10,000 credit/month

  • All in Pro
  • QDialogButtonBox
  • QFileDialog
QDialogButtonBox

View all plans including credit packs →

Tambayar da ake yi da yawa

TTS.ai shine mafi girman dandamalin sauti na AI, yana ba da 22 + samfuran rubutu zuwa magana, ƙirƙirar sauti, magana zuwa rubutu, da kayan aikin sauti. Dukkanin samfuran suna da tushe mai budewa ba tare da mai sayarwa ba.

TTS.ai yana ba da kyautar rubutu zuwa magana tare da Kokoro, Piper, VITS, da MeloTTS models. Babu asusun da ake buƙata. Yi rajista don samun 50 free credits da samun damar duk samfuran.

Don sauri, yi amfani da Kokoro ko Piper. Don inganci, yi amfani da CosyVoice 2 ko kuma StyleTTS 2. Don ƙirƙirar sauti, yi amfani da Chatterbox ko kuma GPT-SoVITS. Don zauren muhawara, yi amfani da Dia TTS. Yi amfani da nau'i-nau'i da yawa a kan rubutun guda don yin kwatanta.

Yes. OpenAI-compatible REST API for TTS, STT, voice cloning, and audio tools. Available on Pro ($29/mo) and Enterprise ($99/mo) plans. View documentation at tts.ai/api/.

Quality of voice varies by model. Premium models like CosyVoice 2, StyleTTS 2, and Chatterbox produce near-human quality speech with natural intonation and emotions. Free models like Kokoro offer excellent quality for most use cases.

TTS.ai goyon baya 30 + yarukan a kan model ta library. Ingilishi yana da mafi fadi model goyon baya, amma models kamar CosyVoice 2 rufe Sin, Japan, da Korean; GPT-SoVITS sarrafa Sin, Japan, Korean, da Ingilishi; da MeloTTS goyon baya Ingilishi, Spanish, Faransanci, Sin, Japan, da Korean.

Na'am. Dukan aikin yana faruwa a kan sarakunanmu na GPU. Ba mu adana shigarka na rubutu ko sauti da aka samar bayan aikawa. An yi amfani da misalin maganar da aka tura don kwaikwayo kawai ga zaman shawara na yanzu kuma ba a riƙe su ba. Ba mu raba bayananka da wasu ba ko kuma amfani da su wajen koyar da kwamfutoci.

Yes. All audio generated on TTS.ai is yours to use commercially, including for YouTube videos, podcasts, audiobooks, apps, advertisements, and products. Our models are open source under permissive licenses (MIT, Apache 2.0). No royalties or attribution required.

TTS.ai yana samar da sauti a cikin sifar WAV ta atomatik don mafi kyawun inganci. Za ka iya canjawa zuwa MP3, FLAC, OGG, ko M4A ta amfani da kayan aikinmu na kyauta na Audio Converter. API na goyon bayan bayyana sifar fitarwa da kake so kai tsaye a cikin tambaya.

Upload a short audio sample (as little as 5 seconds) of the voice you want to clone, then type any text to generate speech in that voice. Models like Chatterbox, GPT-SoVITS, and CosyVoice 2 support voice cloning. The cloned voice captures tone, accent, and speaking style.

Free models (Kokoro, Piper, VITS, MeloTTS) require no account and cost zero credits. Standard models (2 credits/1K characters) include Bark, CosyVoice 2, F5-TTS, and Dia. Premium models (4 credits/1K characters) include OpenVoice, Chatterbox, StyleTTS 2, and Tortoise. Paid models generally offer higher quality, more voices, and additional features like voice cloning.

A'a. API na goyon bayan aiwatar da bangare-bangare don canja girman adadin rubutu zuwa magana. Sanya tambayoyi da yawa kuma ka karɓi sakamakon asynchronously ta amfani da aiki UUIDs. Enterprise plans ($99/mo) sun ƙunshi dama mai fifiko don aiwatar da bangare-bangare mai sauri. Ideal for audiobook production, course content, and large-scale voiceover projects.
5.0/5 (1)

KCharselect unicode block name

Haɗu da masu ƙirƙira, masu haɓakawa, da kasuwancin da ke amfani da TTS.ai