Free AI Text to SpeechQuery

22+ open-source mafano, 100+ mawu, 32+ Palibe akaunti zofunika.

0/500 maonekedwe _Yaulere
Palibe khadi la ngongole 50 ufulu malipiro 32+ Zilankhulo Kugwiritsa ntchito kwamalonda OK
0:00 / 0:00
Download Audio Kugwirizana kumatha mu 24h
Mumakonda TTS.ai? udzauza anzanu!

Zonse zomwe muyenera kudziwa za Voice AI

Zipangizo za 26 zomwe zimapangidwa ndi 24+ open-source AI models

22+ AI Models za mawu

Kusonkhanitsa kwakukulu kwambiri kwa ma TTS open-source models m'modzi m'modzi

KokoroKokoro Free

Kokoro is an 82 million parameter text-to-speech model that punches well above its weight class. Despite its tiny size, it produces remarkably natural and expressive speech. Kokoro supports multiple languages including English, Japanese, Chinese, and Korean with a variety of expressive voices. It runs incredibly fast — generating audio nearly 100x faster than real-time on a GPU.

Oyenera kwa: High-quality TTS with minimal latency, streaming applications

Phunzirani kwaulere

PiperPiper Free

Piper is a lightweight text-to-speech engine developed by Rhasspy that uses VITS and larynx architectures. It runs entirely on CPU, making it ideal for edge devices, home automation, and applications requiring offline TTS. With over 100 voices across 30+ languages, Piper delivers natural-sounding speech at real-time speeds even on a Raspberry Pi 4.

Oyenera kwa: Quick previews, accessibility, and embedded applications

Phunzirani kwaulere

VITSVITS Free

VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is a parallel end-to-end TTS method that generates more natural sounding audio than current two-stage models. It adopts variational inference augmented with normalizing flows and an adversarial training process, achieving a significant improvement in naturalness.

Oyenera kwa: General-purpose text-to-speech with natural prosody

Phunzirani kwaulere

MeloTTSMeloTTS Free

MeloTTS by MyShell.ai is a multilingual TTS library supporting English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. It is extremely fast, processing text at near real-time speed on CPU alone. MeloTTS is designed for production use and supports both CPU and GPU inference.

Oyenera kwa: Ntchito zopanga zomwe zimafunikira TTS yofulumira komanso yosiyanasiyana

Phunzirani kwaulere

BarkBark Standard

Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.

Wopanga: Suno · License: MIT

Yambitsani

Bark SmallBark Small Standard

Lighter version of Bark with faster inference and lower memory usage.

Wopanga: Suno · License: MIT

Yambitsani

CosyVoice 2CosyVoice 2 Standard

Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.

Wopanga: Alibaba (Tongyi Lab) · License: Apache 2.0

Yambitsani

Dia TTSDia TTS Standard

Multi-wokamba nkhani dialogue chitukuko chitsanzo chomwe chimaumba zokambirana zachilengedwe pakati pa wokamba nkhani.

Wopanga: Nari Labs · License: Apache 2.0

Yambitsani

Parler TTSParler TTS Standard

Describe the voice you want in natural language and Parler generates matching speech.

Wopanga: Hugging Face · License: Apache 2.0

Yambitsani

IndexTTS-2IndexTTS-2 Standard

Zero-shot TTS with fine-grained emotion control and high expressiveness.

Wopanga: Index Team · License: Apache 2.0

Yambitsani

Spark TTSSpark TTS Standard

Voice cloning TTS with controllable emotion and speaking style via prompts.

Wopanga: SparkAudio · License: Apache 2.0

Yambitsani

GPT-SoVITSGPT-SoVITS Standard

Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.

Wopanga: RVC-Boss · License: MIT

Yambitsani

OrpheusOrpheus Standard

Human-level emotional TTS model trained on 100K hours of speech data.

Wopanga: Canopy Labs · License: Llama 3.2 Community

Yambitsani

Qwen3 TTSQwen3 TTS Standard

Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.

Wopanga: Alibaba (Qwen) · License: Apache 2.0

Yambitsani

ChatterboxChatterbox Premium

State-of-the-art zero-shot voice cloning ndi kuwongolera maganizo kuchokera ku Resemble AI.

Ubwino:

Yambitsani

Tortoise TTSTortoise TTS Premium

Multi-voice text-to-speech yodziyimira pawokha yodziyimira pawokha yodziyimira pawokha.

Ubwino:

Yambitsani

StyleTTS 2StyleTTS 2 Premium

Human-level text-to-speech through style diffusion and adversarial training.

Ubwino:

Yambitsani

OpenVoiceOpenVoice Premium

Instant voice cloning with granular control over style, emotion, and accent.

Ubwino:

Yambitsani

CosyVoice 2CosyVoice 2

Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.

Zilankhulo: en, zh, ja, ko, fr, de, it, es

Clone Voice

IndexTTS-2IndexTTS-2

Zero-shot TTS with fine-grained emotion control and high expressiveness.

Zilankhulo: en, zh

Clone Voice

Spark TTSSpark TTS

Voice cloning TTS with controllable emotion and speaking style via prompts.

Zilankhulo: en, zh

Clone Voice

GPT-SoVITSGPT-SoVITS

Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.

Zilankhulo: en, zh, ja, ko

Clone Voice

ChatterboxChatterbox

State-of-the-art zero-shot voice cloning ndi kuwongolera maganizo kuchokera ku Resemble AI.

Zilankhulo: en

Clone Voice

Tortoise TTSTortoise TTS

Multi-voice text-to-speech yodziyimira pawokha yodziyimira pawokha yodziyimira pawokha.

Zilankhulo: en

Clone Voice

OpenVoiceOpenVoice

Instant voice cloning with granular control over style, emotion, and accent.

Zilankhulo: en, zh, ja, ko, fr, de, es, it

Clone Voice

Qwen3 TTSQwen3 TTS

Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.

Zilankhulo: en, zh, ja, ko, de, fr, ru, pt, es, it

Clone Voice

Developer-First API

OpenAI-kugwirizana REST API. One endpoint, 22 + mafano. Streaming thandizo kwa real-time mapulogalamu.

  • Format yogwirizana ndi OpenAI
  • Streaming TTS kwa real-time mapulogalamu
  • Batch processing kwa ntchito zazikulu
  • Zidziwitso za Webhook
Pangani API Docs
Python
import requests

response = requests.post(
    "https://api.tts.ai/v1/tts/",
    headers={"Authorization": "Bearer sk-tts-xxx"},
    json={
        "model": "kokoro",
        "text": "Hello from TTS.ai!",
        "voice": "af_bella",
    }
)

with open("output.mp3", "wb") as f:
    f.write(response.content)

Zosavuta, Zowoneka bwino Zotsatsa

Kuyamba kwaulere. Scale monga mukukula.

_Yaulere

$0

50 credits

  • Kokoro, Piper, VITS, MeloTTS
  • 500 chizindikiro malire
  • 3 gen / ola (opanda akaunti)
Kulembetsa kwaulere

Woyamba

$9/mphindi

500 credits / mwezi

  • onse 22+ zojambula
  • 5,000 characters limit
  • Chizindikiro cha mawu
Kuyamba
Otchuka kwambiri

Pro

$29/mphindi

2,000 credits / mwezi

  • Zonse mu Starter
  • Kugwiritsa ntchito API
  • Priority processing
Kupeza Pro

Enterprise

$99/mphindi

10,000 credits / mwezi

  • Zonse mu Pro
  • Mphamvu ya API
  • Priority queue
Kulumikizana ndi Kugulitsa

View all plans including credit packs →

Funso Lofunsidwa Kawirikawiri

TTS.ai ndi imodzi mwamapulogalamu apamwamba kwambiri a AI, yomwe imapatsa 22+ mapangidwe a mawu, mawu, mawu ndi mawu, ndi zida za audio.Zosefera zonse ndi zaulere ndipo sizikugwirizana ndi wogulitsa.

Yes! TTS.ai amapereka ufulu text-to-speech ndi Kokoro, Piper, VITS, ndi MeloTTS mafano. No akaunti zofunika. Sign up kuti mudziwe 50 ufulu ngongole ndi kulowa onse mafano.

Kuti muchepetse nthawi,gwiritsani ntchito Kokoro kapena Piper. Kuti muchepetse mtengo,gwiritsani ntchito CosyVoice 2 kapena StyleTTS 2. Kuti muchepetse mawu,gwiritsani ntchito Chatterbox kapena GPT-SoVITS. Kuti muchepetse mawu,gwiritsani ntchito Dia TTS.

Yai. OpenAI-kugwirizana REST API kwa TTS, STT, mawu kloning, ndi audio zipangizo. Available pa Pro ($ 29 / mo) ndi Enterprise ($ 99 / mo) miyezo.

Kuwala kwa mawu kumasiyana malinga ndi mtundu wa foni. Mafoni a premium monga CosyVoice 2, StyleTTS 2, ndi Chatterbox amatulutsa mawu ofanana ndi mawu a munthu, ndi mawu owoneka bwino. Mafoni aulere monga Kokoro amapatsa mawu abwino kwambiri pogwiritsa ntchito foni.

TTS.ai amathandiza 30 + zilankhulo m'mabuku ake a model.English ali ndi chithandizo chabwino kwambiri cha model, koma mamodeli monga CosyVoice 2 amaphatikiza Chisipanishi, Chijapanizi, ndi Chikoreya; GPT-SoVITS amasamalira Chisipanishi, Chijapanizi, Chikoreya, ndi Chingelezi; ndi MeloTTS amathandizira Chisipanishi, Chisipanishi, Chijeremani, Chisipanishi, Chijapanizi, ndi Chikoreya.

Yai. Kuchita zonse kumachitika pa ma seva athu a GPU. Tisasunga malemba anu kapena ma audio omwe amapangidwa pambuyo pa kutumiza. Zolemba za mawu zomwe zatulutsidwa kuti zikhale zofanana zimagwiritsidwa ntchito pokhapokha pa seshoni yatsopano ndipo sizingachitike. Tisagawana deta yanu ndi anthu ena kapena kuzigwiritsa ntchito kuphunzitsa ma modeli.

Yes. All audio generated on TTS.ai is yours to use commercially, including for YouTube videos, podcasts, audiobooks, apps, advertisements, and products. Our models are open source under permissive licenses (MIT, Apache 2.0). No royalties or attribution required.

TTS.ai amapanga audio mu WAV mtundu mwa default kwa khalidwe lalikulu. Mukhoza kusintha kuti MP3, FLAC, OGG, kapena M4A pogwiritsa ntchito wathu ufulu Audio Converter chida.

Upload a short audio sample (as little as 5 seconds) of the voice you want to clone, then type any text to generate speech in that voice. Models like Chatterbox, GPT-SoVITS, and CosyVoice 2 support voice cloning. The cloned voice captures tone, accent, and speaking style.

Zithunzi zaulere (Kokoro, Piper, VITS, MeloTTS) sizikufunikira akaunti ndipo zimawononga ndalama zopanda phindu. Zithunzi za Standard (2 credits / 1K characters) zimaphatikizapo Bark, CosyVoice 2, F5-TTS, ndi Dia. Zithunzi za Premium (4 credits / 1K characters) zimaphatikizapo OpenVoice, Chatterbox, StyleTTS 2, ndi Tortoise.

Yes. The API supports batch processing for converting large volumes of text to speech. Submit multiple requests and retrieve results asynchronously using job UUIDs. Enterprise plans ($99/mo) include priority queue access for faster batch processing.
5.0/5 (1)

Kuyamba kugwiritsa ntchito AI Voice lero

Join opanga, opanga, ndi makampani pogwiritsa ntchito TTS.ai