Free AI @ action

31+ open-source models, 231+ Waɗanda suka kãfirta, 34+ Babu bukata ga asusun.

8K+
@ title: window
30K+
generations
31+
QPrintPreviewDialog
231+
preview-size
0/500 @ action · Sign up for 5,000 per generation → QDialogButtonBox
Yaushe kake son TTS.ai? Ka gaya wa abokanka!

31+ @ item Spelling dictionary

Babban rukunin kayan aiki na TTS mai ma'ana a cikin dandamali guda

KokoroKokoro Free

Kokoro wani nau'in rubutu zuwa magana mai paramita miliyan 82 ne wanda ke dauke da nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nauyin nau

Mafi kyawun ga: TTS mai inganci mai kyau tare da ƙarancin lokaci, shirye-shiryen tashar watsawa

QDialogButtonBox

PiperPiper Free

Piper wani mai sarrafa rubutu zuwa magana ne mai sauƙi wanda Rhasspy ya kirkiro wanda ke amfani da VITS da larynx architectures. Yana tafiyar da shi gaba ɗaya akan CPU, yana sanya shi mafi kyau ga na'urorin gefe, aikace-aikacen gida, da kuma aikace-aikacen da ke buƙatar TTS na waje. Tare da fiye da 100 na sauti a cikin harsuna 30 +, Piper yana bayar da magana mai sauti na halitta a cikin saurin lokaci na gaskiya har ma a kan Raspberry Pi 4.

Mafi kyawun ga: Previews quick, accessibility, and embedded applications

QDialogButtonBox

VITSVITS Free

@ info: shell

Mafi kyawun ga: KCharselect unicode block name

QDialogButtonBox

MeloTTSMeloTTS Free

MeloTTS by MyShell.ai is a multilingual TTS library supporting English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. It is extremely fast, processing text at near real-time speed on CPU alone. MeloTTS is designed for production use and supports both CPU and GPU inference.

Mafi kyawun ga: Shiryoyin ayuka na samarwa suna buƙatar TTS mai sauri, da ya ƙunshi yarukan da dama

QDialogButtonBox

OuteTTSOuteTTS Free

OuteTTS na faɗaɗa manyan nau'ikan harshe tare da damar rubutu zuwa magana yayin kiyaye tsarin asali. Yana goyon bayan wasu masu goyon baya ciki har da llama.cpp (CPU / GPU), Hugging Face Transformers, ExLlamaV2, VLLM, da har ma da nazarin mai bincike ta hanyar Transformers.js.

Mafi kyawun ga: @ info: status

QDialogButtonBox

Pocket TTSPocket TTS Free

Pocket TTS daga Kyutai (mawallafa na Moshi) wani ma'aunin rubutu zuwa magana mai girman 100M ne wanda ke dauke da nauyinsa. Yana aiki da kyau a kan CPU, yana goyon bayan ƙirƙirar sauti mai zafi daga misalin sauti guda, kuma yana samar da magana mai sauti na halitta. Girman ma'aunin ƙarami yana sa shi ya dace da amfani da gefe da wuraren da ke da albarkatu masu yawa.

Mafi kyawun ga: @ info: status

QDialogButtonBox

Kitten TTSKitten TTS Free

Kitten TTS by KittenML is an ultra-lightweight text-to-speech model built on ONNX. With variants from 15M to 80M parameters (25-80 MB on disk), it delivers high-quality voice synthesis on CPU without requiring a GPU. Features 8 built-in voices, adjustable speech speed, and built-in text preprocessing for numbers, currencies, and units. Ideal for edge deployment and low-latency applications.

Mafi kyawun ga: Fast lightweight TTS, edge deployment, low-latency applications

QDialogButtonBox

BarkBark Standard

Wani nau'in rubutu zuwa sauti wanda yake da tushe a kan mai canzawa wanda ke samar da magana mai gaskiya, kiɗa, da kuma sakamako na sauti.

Mawallafi: Suno · Lasisi: MIT

@ action

Bark SmallBark Small Standard

Wani nau'in Bark mai sauƙi da sauri da amfani da ƙwaƙwalwa mai ƙaranci.

Mawallafi: Suno · Lasisi: MIT

@ action

CosyVoice 2CosyVoice 2 Standard

Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.

Mawallafi: Alibaba (Tongyi Lab) · Lasisi: Apache 2.0

@ action

Dia TTSDia TTS Standard

@ item: inlistbox

Mawallafi: Nari Labs · Lasisi: Apache 2.0

@ action

Parler TTSParler TTS Standard

Ka bayyana maganar da kake so cikin harshe na halitta kuma Parler zai samar da maganar da ta dace.

Mawallafi: Hugging Face · Lasisi: Apache 2.0

@ action

GLM-TTSGLM-TTS Standard

QDialogButtonBox

Mawallafi: Zhipu AI · Lasisi: GLM-4 License

@ action

IndexTTS-2IndexTTS-2 Standard

Zero-shot TTS tare da fine-grained damuwa kula da kuma high bayyanawa.

Mawallafi: Index Team · Lasisi: Bilibili Model License

@ action

Spark TTSSpark TTS Standard

TTS na ƙãga halittar magana tare da jin daɗin da ake iya kulawa da kuma salon magana ta hanyar tambayoyi.

Mawallafi: SparkAudio · Lasisi: CC BY-NC-SA 4.0

@ action

GPT-SoVITSGPT-SoVITS Standard

TTS mai kwaikwayon sauti mai yawa wanda ke sakewa duk wani sauti daga sakan 5 na sauti kawai.

Mawallafi: RVC-Boss · Lasisi: MIT

@ action

OrpheusOrpheus Standard

@ item: inlistbox

Mawallafi: Canopy Labs · Lasisi: Llama 3.2 Community

@ action

Qwen3 TTSQwen3 TTS Standard

TTS na Alibaba mai yarukan da dama tare da ƙirƙirar sauti, saita sauti, da ƙirar sauti daga rubutu.

Mawallafi: Alibaba (Qwen) · Lasisi: Apache 2.0

@ action

Chatterbox TurboChatterbox Turbo Standard

Mai sauri Chatterbox tare da ƙarin-200ms da paralinguistic tags don murmushi, zazzabi, da kuma sauran.

Mawallafi: Resemble AI · Lasisi: MIT

@ action

Dia 2Dia 2 Standard

TTS mai magana da sauti mai gudu-na farko tare da tattaunawa da masu magana da yawa da kuma paralinguistic cues.

Mawallafi: Nari Labs · Lasisi: Apache 2.0

@ action

VoxCPMVoxCPM Standard

Tokenizer-free TTS producing 44.1kHz audio with context-aware paragraph consistency.

Mawallafi: OpenBMB · Lasisi: Apache 2.0

@ action

TADATADA Standard

Zero-hallucination TTS tare da rubutu-acoustic dual aligning, 5x sauri fiye da kwatanta LLM TTS.

Mawallafi: Hume AI · Lasisi: MIT

@ action

VibeVoiceVibeVoice Standard

Microsoft model for long-form multi-speaker content like podcasts and audiobooks.

Mawallafi: Microsoft · Lasisi: MIT

@ action

CosyVoice3CosyVoice3 Standard

Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.

Mawallafi: Alibaba (FunAudioLLM) · Lasisi: Apache 2.0

@ action

ChatterboxChatterbox Premium

State-of-the-art zero-shot voice cloning tare da kula da jin dadi daga Resemble AI.

QPrintPreviewDialog

@ action

Tortoise TTSTortoise TTS Premium

@ info: status

QPrintPreviewDialog

@ action

StyleTTS 2StyleTTS 2 Premium

Man-level text-to-speech ta hanyar style diffusion da kuma training.

QPrintPreviewDialog

@ action

OpenVoiceOpenVoice Premium

@ action

QPrintPreviewDialog

@ action

Sesame CSMSesame CSM Premium

Tsarin maganar maganar da ke samar da tattaunawa ta halitta tare da lokacin da ya dace da kuma jin dadi.

QPrintPreviewDialog

@ action

MOSS-TTSMOSS-TTS Premium

Ultra-long 20-language TTS supporting up to 1 hour of continuous generation with phoneme-level control.

QPrintPreviewDialog

@ action

MegaTTS3MegaTTS3 Premium

ByteDance's sparse alignment TTS with adjustable intelligibility vs. speaker similarity.

QPrintPreviewDialog

@ action

CosyVoice 2CosyVoice 2

Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.

Yare: en, zh, ja, ko, fr, de, it, es

@ action

GLM-TTSGLM-TTS

QDialogButtonBox

Yare: en, zh

@ action

IndexTTS-2IndexTTS-2

Zero-shot TTS tare da fine-grained damuwa kula da kuma high bayyanawa.

Yare: en, zh

@ action

Spark TTSSpark TTS

TTS na ƙãga halittar magana tare da jin daɗin da ake iya kulawa da kuma salon magana ta hanyar tambayoyi.

Yare: en, zh

@ action

GPT-SoVITSGPT-SoVITS

TTS mai kwaikwayon sauti mai yawa wanda ke sakewa duk wani sauti daga sakan 5 na sauti kawai.

Yare: en, zh, ja, ko

@ action

ChatterboxChatterbox

State-of-the-art zero-shot voice cloning tare da kula da jin dadi daga Resemble AI.

Yare: en

@ action

Tortoise TTSTortoise TTS

@ info: status

Yare: en

@ action

OpenVoiceOpenVoice

@ action

Yare: en, zh, ja, ko, fr, de, es, it

@ action

Qwen3 TTSQwen3 TTS

TTS na Alibaba mai yarukan da dama tare da ƙirƙirar sauti, saita sauti, da ƙirar sauti daga rubutu.

Yare: en, zh, ja, ko, de, fr, ru, pt, es, it

@ action

Chatterbox TurboChatterbox Turbo

Mai sauri Chatterbox tare da ƙarin-200ms da paralinguistic tags don murmushi, zazzabi, da kuma sauran.

Yare: en

@ action

VoxCPMVoxCPM

Tokenizer-free TTS producing 44.1kHz audio with context-aware paragraph consistency.

Yare: en, zh

@ action

OuteTTSOuteTTS

LLM-da aka dogara TTS da ke tafiya a kan CPU, GPU, ko mai bincike ta hanyar llama.cpp da Transformers.js.

Yare: en

@ action

Pocket TTSPocket TTS

@ item: inlistbox

Yare: en, fr

@ action

CosyVoice3CosyVoice3

Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.

Yare: en, zh, ja, ko, de, es, fr, it, ru

@ action

MOSS-TTSMOSS-TTS

Ultra-long 20-language TTS supporting up to 1 hour of continuous generation with phoneme-level control.

Yare: en, zh, de, es, fr, ja, it, hu, ko, ru, fa, ar, pl, pt, cs, da, sv, el, tr

@ action

MegaTTS3MegaTTS3

ByteDance's sparse alignment TTS with adjustable intelligibility vs. speaker similarity.

Yare: en, zh

@ action

Developer-First API

OpenAI-compatible REST API. One endpoint, 22+ models. Streaming support for real-time applications.

  • QPrintPreviewDialog
  • Streaming TTS ga shiryoyin ayuka na lokaci na gaskiya
  • Preview-size
  • QDialogButtonBox
Nuna takardun API
pip install ttsai npm install @ttsainpm/ttsai
Python
from tts_ai import TTSClient

client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
    text="Hello from TTS.ai!",
    model="kokoro",
    voice="af_bella",
)
client.save(audio, "output.mp3")

QPrintPreviewDialog

Ka fara kyauta. Ka girma kamar yadda kake girma.

QDialogButtonBox

$0

@ action

  • Kokoro, Piper, VITS, MeloTTS
  • 500 haƙƙin haɗewa
  • 3 gen/hour (ba a da asusun)
Yi rijista

@ action

$9/MB

500,000 characters/month

  • @ label: textbox
  • @ item font
  • QShortcut
@ action
QShortcut

QShortcut

$29/MB

2,000,000 characters/month

  • Duk abin da ke cikin Mai Farawa
  • Aika API
  • QDialogButtonBox
@ action

QShortcut

$99/MB

10,000,000 characters/month

  • All in Pro
  • QDialogButtonBox
  • QFileDialog
QDialogButtonBox

Nuna duk shirye-shiryen ciki har da shirye-shiryen alamomin →

Tambayar da ake yi da yawa

TTS.ai shine mafi yawan dandamalin magana na AI, yana ba da 22 + zane-zane zuwa magana, ƙirƙirar sauti, magana zuwa rubutu, da kayan aikin sauti. Dukkanin zane-zane suna da ma'ana mai budewa ba tare da mai sayarwa ba.

Ya! TTS.ai yana ba da kyautar rubutu zuwa magana tare da Kokoro, Piper, VITS, da MeloTTS. Babu lissafi da ake buƙata. Yi rajista don samun 15,000 na kyauta da damar duk samfuran. Ayyukan da aka biya sun fara a $ 9 / watan.

Don sauri, yi amfani da Kokoro ko Piper. Don inganci, yi amfani da CosyVoice 2 ko kuma StyleTTS 2. Don ƙirƙirar sauti, yi amfani da Chatterbox ko kuma GPT-SoVITS. Don zauren muhawara, yi amfani da Dia TTS. Yi amfani da nau'ikan da yawa a kan rubutun guda don yin kwatanta.

Na'am. OpenAI-compatible REST API ga TTS, STT, sauti cloning, da audio kayan aiki. Available a kan Pro ($29/mo) da Enterprise ($99/mo) plans. View documentation at tts.ai/api/.

Quality of voice varies by model. Premium models like CosyVoice 2, StyleTTS 2, and Chatterbox produce near-human quality speech with natural intonation and emotions. Free models like Kokoro offer excellent quality for most use cases.

TTS.ai yana goyon bayan harsuna 30+ a cikin ɗakin karatun nau'insa. Ingilishi yana da goyon bayan nau'in da ya fi faɗi, amma nau'in kamar CosyVoice 2 yana rufe Sin, Jamus, da Korea; GPT-SoVITS yana kula da Sin, Jamus, Korea, da Ingilishi; da MeloTTS yana goyon bayan Ingilishi, Sifanci, Faransanci, Sin, Jamus, da Korea.

Na'am. Duk dabarun da ake yi suna faruwa a kan masu kula da GPU na musamman. Ba mu adana shigarwar rubutunka ko sauti da aka samar ba bayan an aika su. An yi amfani da misalin maganar da aka tattara don kwaikwayo kawai ga zaman shawara na yanzu kuma ba a riƙe su ba. Ba mu raba bayananka da wasu ba ko kuma amfani da su wajen koyar da kwamfutoci.

Na'am. Duk sauti da aka samar akan TTS.ai na gare ka ka yi amfani da shi a cikin kasuwanci, ciki har da bidiyo na YouTube, podcasts, littattafai na sauti, aikace-aikace, tallace-tallace, da kayayyakin aiki. Manhajojinmu suna da tushe mai budewa a karkashin lasisi masu yarda (MIT, Apache 2.0). Babu bukatar biyan kudin mallaka ko bayar da shaida.

TTS.ai yana samar da sauti cikin sifar WAV bisa diff don mafi kyawun inganci. Za ka iya canjawa zuwa MP3, FLAC, OGG, ko M4A ta amfani da kayan aikinmu na kyauta na Mai Sauya Sauti. API na goyon bayan bayyana sifar fitarwa da kake so kai tsaye cikin tambaya.

Ka shigar da misalin sauti mai gajeren lokaci (kimanin sakan 5) na maganar da kake so ka kwafa, sa'an nan ka rubuta duk wani rubutu don ƙirƙirar magana cikin wannan maganar. Nau'o'i kamar Chatterbox, GPT-SoVITS, da CosyVoice 2 suna goyon bayan kwafawar magana. Zauren da aka kwafa yana riƙe da zaren magana, harshen magana, da salon magana.

Free models (Kokoro, Piper, VITS, MeloTTS) require no account and cost zero characters. Standard models (2,000 characters/1K input) include Bark, CosyVoice 2, F5-TTS, and Dia. Premium models (4,000 characters/1K input) include OpenVoice, Chatterbox, StyleTTS 2, and Tortoise. Paid models generally offer higher quality, more voices, and additional features like voice cloning.

A'a. API na goyon bayan aiwatar da bangare-bangare don canja girman adadin rubutu zuwa magana. Sanya tambayoyi da yawa kuma ka karɓi sakamakon asynchronously ta amfani da aiki UUIDs. Enterprise plans ($99/mo) sun ƙunshi dama mai fifiko don aiwatar da bangare-bangare mai sauri. Ideal for audiobook production, course content, and large-scale voiceover projects.
4.1/5 (21)

@ info

Fara Amfani da Sauti na AI yau

Haɗa masu ƙirƙira, masu haɓakawa, da kasuwancin da ke amfani da TTS.ai