Free AI Text to SpeechQuery

20+ open-source mafano, 107+ mawu, 32+ Palibe akaunti zofunika.

1K+
Opanga
2K+
chiyambi
20+
Models a AI
107+
mawu
0/500 maonekedwe _Yaulere
Mumakonda TTS.ai? udzauza anzanu!

Zonse zomwe mukufuna kwa Voice AI

30 + zipangizo zopangidwa ndi mapangidwe a AI otsegulidwa

20+ AI Voice Models

Kusonkhanitsa kokwanira kwambiri kwa ma TTS aulere aulere m'modzi m'modzi

KokoroKokoro Free

Kokoro ndi 82 miliyoni paramita malemba-ku-kulankhula chitsanzo chomwe punches bwino pamwamba pa khalidwe lake khalidwe. Ngakhale ndi waing'ono kukula kwake, amatulutsa mosalekeza zochititsa chidwi ndi mawu tanthauzo. Kokoro amathandiza zambiri zinenero kuphatikizapo Chijeremani, Chijeremani, Chingerezi, ndi Korean ndi mitundu yosiyanasiyana ya mawu tanthauzo.

Oyenera kwa: High-quality TTS ndi latency zochepa, streaming mapulogalamu

Phunzirani kwaulere

PiperPiper Free

Piper ndi makina otsika mtengo a mawu ochokera ku mawu omwe adapangidwa ndi Rhasspy omwe amagwiritsa ntchito VITS ndi larynx architectures. Imayenda kwathunthu pa CPU, zomwe zimapangitsa kuti ikhale yabwino kwa zida za edge, zowongolera zanyumba, ndi mapulogalamu omwe akufuna TTS osagwirizana. Ndi mawu oposa 100 m'zinenero 30 +, Piper imabweretsa mawu owoneka bwino panthawi ya real-time ngakhale pa Raspberry Pi 4.

Oyenera kwa: Kuwonetsa mofulumira, kupezeka, ndi mapulogalamu ophatikizidwa

Phunzirani kwaulere

VITSVITS Free

VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) ndi njira yofanana ya TTS yomwe imapanga ma audio owoneka bwino kwambiri kuposa ma modeli anthawi zonse awiri. Imagwiritsa ntchito variational inference yowonjezeredwa ndi ma flows osinthika ndi njira yophunzitsa yotsutsana, yomwe imakwaniritsa kuwonjezeka kwakukulu pakuwoneka bwino.

Oyenera kwa: Text-to-speech yogwiritsa ntchito nthawi zonse ndi prosody yachilengedwe

Phunzirani kwaulere

MeloTTSMeloTTS Free

MeloTTS ndi MyShell.ai ndi library ya TTS yokhala ndi zilankhulo zambiri zomwe zimathandizira Chijeremani (American, British, Indian, Australian), Chisipanishi, Chijeremani, Chijeremani, Chijeremani, Chijeremani, Chijeremani, Chijeremani, Chijeremani, Chijeremani, Chijeremani, Chijeremani, Chijeremani, Chijeremani, Chijeremani, Chijeremani, Chijeremani, Chijeremani, Chijeremani, Chijeremani, Chijeremani

Oyenera kwa: Ntchito zopanga zomwe zimafunikira TTS yofulumira komanso yosiyanasiyana

Phunzirani kwaulere

BarkBark Standard

Model ya text-to-audio yokhala ndi transformer yomwe imapanga mawu, nyimbo, ndi zotsatira za mawu zowoneka bwino.

Wopanga: Suno · License: MIT

Yambitsani

Bark SmallBark Small Standard

Lighter mtundu wa Bark ndi mofulumira kumvetsa ndi pansi kugwiritsa ntchito kumbukirani.

Wopanga: Suno · License: MIT

Yambitsani

CosyVoice 2CosyVoice 2 Standard

Alibaba's scalable streaming TTS ndi chikhalidwe cha munthu-parity ndi latency yaying'ono.

Wopanga: Alibaba (Tongyi Lab) · License: Apache 2.0

Yambitsani

Dia TTSDia TTS Standard

Multi-wokamba nkhani dialogue chitukuko chitsanzo chomwe chimaumba zokambirana zachilengedwe pakati pa wokamba nkhani.

Wopanga: Nari Labs · License: Apache 2.0

Yambitsani

Parler TTSParler TTS Standard

Kufotokozera mawu mukufuna mu chilankhulo chachilengedwe ndi Parler amapanga mawu ogwirizana.

Wopanga: Hugging Face · License: Apache 2.0

Yambitsani

GLM-TTSGLM-TTS Standard

Achire pansi kwambiri chizindikiro cha vuto kuchuluka pakati open-source TTS mafano.

Wopanga: Zhipu AI · License: GLM-4 License

Yambitsani

IndexTTS-2IndexTTS-2 Standard

Zero-shot TTS ndi kuwongolera kwa maganizo olimba komanso kutanthauzira kwakukulu.

Wopanga: Index Team · License: Bilibili Model License

Yambitsani

Spark TTSSpark TTS Standard

Kulankhula kloning TTS ndi controlable chisoni ndi kulankhula mtundu mwa kufunsa.

Wopanga: SparkAudio · License: CC BY-NC-SA 4.0

Yambitsani

GPT-SoVITSGPT-SoVITS Standard

Few-shot mawu kloning TTS kuti amachitanso chilichonse mawu kuchokera 5 masekondi a audio.

Wopanga: RVC-Boss · License: MIT

Yambitsani

OrpheusOrpheus Standard

Model ya TTS yokhudzana ndi munthu yophunzitsa 100K maola a data ya mawu.

Wopanga: Canopy Labs · License: Llama 3.2 Community

Yambitsani

Qwen3 TTSQwen3 TTS Standard

TTS ya Alibaba ndi mawu osiyanasiyana, mawu osankhidwa, ndi mawu opangidwa kuchokera ku malemba.

Wopanga: Alibaba (Qwen) · License: Apache 2.0

Yambitsani

ChatterboxChatterbox Premium

State-of-the-art zero-shot kujambula mawu ndi kuwongolera maganizo kuchokera ku Resemble AI.

Ubwino:

Yambitsani

Tortoise TTSTortoise TTS Premium

Multi-wolankhula malemba-ku-kulankhula kuganizira za katundu ndi autoregressive ukadaulo.

Ubwino:

Yambitsani

StyleTTS 2StyleTTS 2 Premium

Kusintha kwa mawu kukhala mawu pamalingaliro a munthu pogwiritsa ntchito kufalitsa kwa mtundu ndi kuphunzitsa motsutsana.

Ubwino:

Yambitsani

OpenVoiceOpenVoice Premium

Instant mawu kloning ndi granular kuwongolera pa mtundu, nkhawa, ndi accent.

Ubwino:

Yambitsani

Sesame CSMSesame CSM Premium

Kulankhulana kwa mawu kumabweretsa uthenga woyenera ndi nthawi yoyenera ndi maganizo.

Ubwino:

Yambitsani

CosyVoice 2CosyVoice 2

Alibaba's scalable streaming TTS ndi chikhalidwe cha munthu-parity ndi latency yaying'ono.

Zilankhulo: en, zh, ja, ko, fr, de, it, es

Kusintha mawu

GLM-TTSGLM-TTS

Achire pansi kwambiri chizindikiro cha vuto kuchuluka pakati open-source TTS mafano.

Zilankhulo: en, zh

Kusintha mawu

IndexTTS-2IndexTTS-2

Zero-shot TTS ndi kuwongolera kwa maganizo olimba komanso kutanthauzira kwakukulu.

Zilankhulo: en, zh

Kusintha mawu

Spark TTSSpark TTS

Kulankhula kloning TTS ndi controlable chisoni ndi kulankhula mtundu mwa kufunsa.

Zilankhulo: en, zh

Kusintha mawu

GPT-SoVITSGPT-SoVITS

Few-shot mawu kloning TTS kuti amachitanso chilichonse mawu kuchokera 5 masekondi a audio.

Zilankhulo: en, zh, ja, ko

Kusintha mawu

ChatterboxChatterbox

State-of-the-art zero-shot kujambula mawu ndi kuwongolera maganizo kuchokera ku Resemble AI.

Zilankhulo: en

Kusintha mawu

Tortoise TTSTortoise TTS

Multi-wolankhula malemba-ku-kulankhula kuganizira za katundu ndi autoregressive ukadaulo.

Zilankhulo: en

Kusintha mawu

OpenVoiceOpenVoice

Instant mawu kloning ndi granular kuwongolera pa mtundu, nkhawa, ndi accent.

Zilankhulo: en, zh, ja, ko, fr, de, es, it

Kusintha mawu

Qwen3 TTSQwen3 TTS

TTS ya Alibaba ndi mawu osiyanasiyana, mawu osankhidwa, ndi mawu opangidwa kuchokera ku malemba.

Zilankhulo: en, zh, ja, ko, de, fr, ru, pt, es, it

Kusintha mawu

Developer-First API

OpenAI-kugwirizana REST API. One endpoint, 22 + mafano. Streaming thandizo kwa real-time mapulogalamu.

  • Format yogwirizana ndi OpenAI
  • Streaming TTS kwa real-time mapulogalamu
  • Batch processing kwa ntchito zazikulu
  • Zidziwitso za Webhook
Pangani API Docs
pip install ttsai npm install @ttsainpm/ttsai
Python
from tts_ai import TTSClient

client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
    text="Hello from TTS.ai!",
    model="kokoro",
    voice="af_bella",
)
client.save(audio, "output.mp3")

Zosatheka, Zosawoneka bwino

Kuyamba kwaulere. Scale monga mukukula.

_Yaulere

$0

15,000 characters

  • Kokoro, Piper, VITS, MeloTTS
  • 500 Character limit
  • 3 gen / ola (opanda akaunti)
Kulembetsa kwaulere

Woyamba

$9/mphindi

500,000 characters/month

  • onse 22+ zojambula
  • 100,000 chars per generation
  • Chizindikiro cha mawu
Kuyamba
Otchuka kwambiri

Pro

$29/mphindi

2,000,000 characters/month

  • Zonse mu Starter
  • Kugwiritsa ntchito API
  • Priority processing
Kupeza Pro

Zamalonda

$99/mphindi

10,000,000 characters/month

  • Zonse mu Pro
  • Mphamvu ya API
  • Priority queue
Kupeza bizinesi

Onani zonse zowonjezera kuphatikizapo mapaketi azithunzi →

Funso Lofunsidwa Kawirikawiri

TTS.ai ndi imodzi mwamapulogalamu apamwamba kwambiri a AI, yomwe imapatsa 22+ mapangidwe a mawu, mawu, mawu ndi mawu, ndi zida za audio.Zosefera zonse ndi zaulere ndipo sizikugwirizana ndi wogulitsa.

Yes! TTS.ai amapereka ufulu malemba-ku-kulankhula ndi Kokoro, Piper, VITS, ndi MeloTTS mafano. No akaunti zofunika. Sign up kuti mudziwe 15,000 ufulu mafano ndi kulowa onse mafano.

Kuti muchepetse nthawi, yesani Kokoro kapena Piper. Kuti muchepetse mtengo, yesani CosyVoice 2 kapena StyleTTS 2. Kuti muchepetse mawu, yesani Chatterbox kapena GPT-SoVITS. Kuti muchepetse mauthenga, yesani Dia TTS. Yesani mamodeli ambiri pamutu umodzi kuti muwayerekezere.

Yai. OpenAI-kugwirizana REST API kwa TTS, STT, mawu kloning, ndi audio zipangizo. Available pa Pro ($ 29 / mo) ndi Enterprise ($ 99 / mo) miyezo.

Kuwala kwa mawu kumasiyana malinga ndi mtundu wa foni. Mafoni a premium monga CosyVoice 2, StyleTTS 2, ndi Chatterbox amatulutsa mawu ofanana ndi mawu a munthu, ndi mawu owoneka bwino. Mafoni aulere monga Kokoro amapatsa mawu abwino kwambiri pogwiritsa ntchito foni.

TTS.ai amathandiza 30 + zilankhulo m'mabuku ake a model.English ali ndi chithandizo chabwino kwambiri cha model, koma mamodeli monga CosyVoice 2 amaphatikiza Chisipanishi, Chijapanizi, ndi Chikoreya; GPT-SoVITS amasamalira Chisipanishi, Chijapanizi, Chikoreya, ndi Chingelezi; ndi MeloTTS amathandizira Chisipanishi, Chisipanishi, Chijeremani, Chisipanishi, Chijapanizi, ndi Chikoreya.

Ndikofunika. Kugwiritsa ntchito zonse kumachitika pa seva yathu yokhayo ya GPU. Tisasunga mawu anu omwe mwalemba kapena mawu omwe mwapanga pambuyo potumiza. Zolemba za mawu zomwe mwatsitsa kuti muzigwiritsa ntchito zimagwiritsa ntchito nthawi yokhayo yomwe mumagwiritsa ntchito ndipo sizingachitike. Sitingagawana deta yanu ndi anthu ena kapena kuzigwiritsa ntchito kuti tiziphunzitsa mamodeli.

Yai. Zonse zomvetsera zomwe zimapangidwa pa TTS.ai ndi zanu kuti muzigwiritsa ntchito kwachuma, kuphatikizapo mavidiyo a YouTube, podcasts, audiobooks, mapulogalamu, zotsatsa, ndi zinthu. Mamodeli athu ndi otsegulira masamba pansi pa malamulo ovomerezeka (MIT, Apache 2.0).

TTS.ai amapanga audio mu WAV mtundu mwa default kwa khalidwe lalikulu. Mukhoza kusintha kuti MP3, FLAC, OGG, kapena M4A pogwiritsa ntchito wathu ufulu Audio Converter chida.

Lowani chitsanzo chafupi cha mawu (masiku 5 okha) cha mawu omwe mukufuna kukulitsa, kenako pezani mawu kuti mupange mawu m'mawu amenewo. Models monga Chatterbox, GPT-SoVITS, ndi CosyVoice 2 amathandizira kukulitsa mawu.

Mapangidwe aulere (Kokoro, Piper, VITS, MeloTTS) safunikira akaunti ndipo amawononga maonekedwe a zero. Mapangidwe a standard (maonekedwe 2,000 / 1K input) amaphatikizapo Bark, CosyVoice 2, F5-TTS, ndi Dia. Mapangidwe a premium (maonekedwe 4,000 / 1K input) amaphatikizapo OpenVoice, Chatterbox, StyleTTS 2, ndi Tortoise. Mapangidwe olipira nthawi zambiri amapatsa mtundu wabwino kwambiri, mawu ambiri, komanso zinthu zina monga kujambula mawu.

Yes. The API supports batch processing for converting large volumes of text to speech. Submit multiple requests and retrieve results asynchronously using job UUIDs. Enterprise plans ($99/mo) include priority queue access for faster batch processing.
4.0/5 (8)

Kuyambiranso kugwiritsa ntchito AI Voice lero

Join opanga, opanga, ndi makampani pogwiritsa ntchito TTS.ai