Free AI Text to SpeechQuery

31+ open-source mafano, 231+ mawu, 34+ Palibe akaunti yofunika.

8K+
Opanga
30K+
chiyambi
31+
Models a AI
231+
mawu
0/500 maonekedwe · Sign up for 5,000 per generation → _Yaulere
Kukonda TTS.ai? udzauza anzanu!

Zonse zomwe mukufuna kwa Voice AI

30 + zipangizo zopangidwa ndi mapangidwe a AI otsegulidwa

31+ AI Voice Models

Kusonkhanitsa kokwanira kwambiri kwa ma TTS aulere aulere m'modzi m'modzi

KokoroKokoro Free

Kokoro ndi 82 miliyoni paramita malemba-ku-kulankhula chitsanzo chomwe punches bwino pamwamba pa khalidwe lake la thupi. Ngakhale ndi ochepa kukula, amatulutsa mawu owoneka bwino ndi owoneka bwino. Kokoro amathandiza mabungwe ambiri kuphatikizapo Chijeremani, Chijeremani, Chijeremani, ndi Korean ndi mitundu yosiyanasiyana ya mawu owoneka bwino.

Oyenera kwa: TTS yotsika mtengo ndi latency yotsika, machitidwe oyenda

Kuyesa kwaulere

PiperPiper Free

Piper ndi makina otsika mtengo a mawu ochokera ku mawu omwe adapangidwa ndi Rhasspy omwe amagwiritsa ntchito VITS ndi larynx architectures. Imayenda kwathunthu pa CPU, zomwe zimapangitsa kuti ikhale yabwino kwa zida za edge, zowongolera zanyumba, ndi mapulogalamu omwe akufuna TTS osagwirizana. Ndi mawu oposa 100 m'zinenero 30 +, Piper imabweretsa mawu owoneka bwino panthawi ya real-time ngakhale pa Raspberry Pi 4.

Oyenera kwa: Kuwonetsa mofulumira, kupezeka, ndi mapulogalamu ophatikizidwa

Kuyesa kwaulere

VITSVITS Free

VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) ndi njira yofanana yoyambira kumapeto kwa TTS yomwe imapanga mawu owoneka bwino kwambiri kuposa mamodeli anthawi zonse awiri. Imagwiritsa ntchito kutengera kwa maonekedwe osiyanasiyana omwe amawonjezeredwa ndi kuwongolera kwa magazi ndi njira yophunzitsa yotsutsana, yomwe imakwaniritsa kuwonjezeka kwakukulu kwa chilengedwe.

Oyenera kwa: Kusintha mawu kukhala malemba ndi mawu

Kuyesa kwaulere

MeloTTSMeloTTS Free

MeloTTS ndi MyShell.ai ndi TTS library yokhala ndi mabuku ambiri omwe amathandizira Chijeremani (cha America, cha British, cha Indian, cha Australia), Chisipanishi, Chifalansa, Chijeremani, cha Japanese, ndi cha Korean. Ndiyabwino kwambiri, yopanga malemba panthawi yoyenera kwambiri pa CPU yokha. MeloTTS idapangidwa kuti igwiritse ntchito kupanga ndipo imathandizira kuzindikira kwa CPU ndi GPU.

Oyenera kwa: Ntchito zopanga zomwe zimafunikira TTS yofulumira komanso yosiyanasiyana

Kuyesa kwaulere

OuteTTSOuteTTS Free

OuteTTS imawonjezera mapangidwe amtundu waukulu ndi zofunikira za mawu-ku-mawu poteteza mapangidwe ake oyambirira. Imathandizira ma backends ambiri kuphatikiza llama.cpp (CPU / GPU), Hugging Face Transformers, ExLlamaV2, VLLM, komanso kuzindikira kwa msakatuli pogwiritsa ntchito Transformers.js.

Oyenera kwa: Kukhazikitsa kwa Edge, TTS yochokera pabrowser, malo otsika azinthu

Kuyesa kwaulere

Pocket TTSPocket TTS Free

Pocket TTS ya Kyutai (opanga Moshi) ndi mtundu wa 100M wa maparamita a masamba omwe amagwira bwino ntchito. Imagwira ntchito bwino pa CPU, imathandizira kujambula mawu osatha kuchokera pa sampling ya audio imodzi, ndipo imatulutsa mawu owoneka bwino.

Oyenera kwa: Kugwiritsa ntchito kosavuta, CPU-only environments, kujambula mawu mwachangu

Kuyesa kwaulere

Kitten TTSKitten TTS Free

Kitten TTS by KittenML is an ultra-lightweight text-to-speech model built on ONNX. With variants from 15M to 80M parameters (25-80 MB on disk), it delivers high-quality voice synthesis on CPU without requiring a GPU. Features 8 built-in voices, adjustable speech speed, and built-in text preprocessing for numbers, currencies, and units. Ideal for edge deployment and low-latency applications.

Oyenera kwa: Fast lightweight TTS, edge deployment, low-latency applications

Kuyesa kwaulere

BarkBark Standard

Model ya text-to-audio yokhala ndi transformer yomwe imapanga mawu, nyimbo, ndi zotsatira za mawu zowoneka bwino.

Wopanga: Suno · License: MIT

Yambitsani

Bark SmallBark Small Standard

Lighter mtundu wa Bark ndi mofulumira kumvetsa ndi pansi kugwiritsa ntchito kumbukirani.

Wopanga: Suno · License: MIT

Yambitsani

CosyVoice 2CosyVoice 2 Standard

Alibaba's scalable streaming TTS ndi khalidwe la munthu-parity ndi kupitilira-zero latency.

Wopanga: Alibaba (Tongyi Lab) · License: Apache 2.0

Yambitsani

Dia TTSDia TTS Standard

Multi-wokamba nkhani dialogue chiyambi cha mtundu womwe amaumba zokambirana zachilengedwe pakati pa okamba nkhani.

Wopanga: Nari Labs · License: Apache 2.0

Yambitsani

Parler TTSParler TTS Standard

Kufotokozera mawu mukufuna mu chilankhulo chachilengedwe ndi Parler amapanga mawu ogwirizana.

Wopanga: Hugging Face · License: Apache 2.0

Yambitsani

GLM-TTSGLM-TTS Standard

Achire pansi kwambiri chizindikiro cha vuto kuchuluka pakati open-source TTS mafano.

Wopanga: Zhipu AI · License: GLM-4 License

Yambitsani

IndexTTS-2IndexTTS-2 Standard

Zero-shot TTS ndi kuwongolera kwa maganizo olimba komanso kutanthauzira kwakukulu.

Wopanga: Index Team · License: Bilibili Model License

Yambitsani

Spark TTSSpark TTS Standard

Kulankhula kloning TTS ndi controlable chisoni ndi kulankhula mtundu mwa kufunsa.

Wopanga: SparkAudio · License: CC BY-NC-SA 4.0

Yambitsani

GPT-SoVITSGPT-SoVITS Standard

Few-shot mawu kloning TTS kuti amachitanso chilichonse mawu kuchokera 5 masekondi a audio.

Wopanga: RVC-Boss · License: MIT

Yambitsani

OrpheusOrpheus Standard

Model ya TTS yokhudzana ndi munthu yophunzitsa 100K maola a data ya mawu.

Wopanga: Canopy Labs · License: Llama 3.2 Community

Yambitsani

Qwen3 TTSQwen3 TTS Standard

TTS ya Alibaba ndi mawu osiyanasiyana, mawu osankhidwa, ndi mawu opangidwa kuchokera ku malemba.

Wopanga: Alibaba (Qwen) · License: Apache 2.0

Yambitsani

Chatterbox TurboChatterbox Turbo Standard

Faster Chatterbox ndi sub-200ms latency ndi paralinguistic tags kwa laughs, kupweteka, ndi zina zambiri.

Wopanga: Resemble AI · License: MIT

Yambitsani

Dia 2Dia 2 Standard

Kutumiza-pyamba conversational TTS ndi multi-wokamba nkhani uthenga ndi paralinguistic cues.

Wopanga: Nari Labs · License: Apache 2.0

Yambitsani

VoxCPMVoxCPM Standard

TTS yopanda Tokenizer yomwe imapanga 44.1kHz audio ndi kugwirizana kwa masamba omvetsetsa.

Wopanga: OpenBMB · License: Apache 2.0

Yambitsani

TADATADA Standard

Zero-hallucination TTS ndi malemba-acoustic dual kugwirizana, 5x mofulumira kuposa kuyerekezera LLM TTS.

Wopanga: Hume AI · License: MIT

Yambitsani

VibeVoiceVibeVoice Standard

Microsoft model for long-form multi-speaker content monga podcasts ndi audiobooks.

Wopanga: Microsoft · License: MIT

Yambitsani

CosyVoice3CosyVoice3 Standard

Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.

Wopanga: Alibaba (FunAudioLLM) · License: Apache 2.0

Yambitsani

ChatterboxChatterbox Premium

State-of-the-art zero-shot kujambula mawu ndi kuwongolera maganizo kuchokera ku Resemble AI.

Ubwino:

Yambitsani

Tortoise TTSTortoise TTS Premium

Multi-wolankhula malemba-ku-kulankhula kuganizira za katundu ndi autoregressive ukadaulo.

Ubwino:

Yambitsani

StyleTTS 2StyleTTS 2 Premium

Kusintha kwa mawu kukhala mawu pamalingaliro a munthu pogwiritsa ntchito kufalitsa kwa mtundu ndi kuphunzitsa motsutsana.

Ubwino:

Yambitsani

OpenVoiceOpenVoice Premium

Instant mawu kloning ndi granular kuwongolera pa mtundu, chisoni, ndi accent.

Ubwino:

Yambitsani

Sesame CSMSesame CSM Premium

Kulankhulana kwa mawu kumabweretsa uthenga woyenera ndi nthawi yoyenera ndi maganizo.

Ubwino:

Yambitsani

MOSS-TTSMOSS-TTS Premium

Ultra-long 20-language TTS supporting up to 1 hour of continuous generation with phoneme-level control.

Ubwino:

Yambitsani

MegaTTS3MegaTTS3 Premium

ByteDance's sparse alignment TTS with adjustable intelligibility vs. speaker similarity.

Ubwino:

Yambitsani

CosyVoice 2CosyVoice 2

Alibaba's scalable streaming TTS ndi khalidwe la munthu-parity ndi kupitilira-zero latency.

Zilankhulo: en, zh, ja, ko, fr, de, it, es

Kusintha mawu

GLM-TTSGLM-TTS

Achire pansi kwambiri chizindikiro cha vuto kuchuluka pakati open-source TTS mafano.

Zilankhulo: en, zh

Kusintha mawu

IndexTTS-2IndexTTS-2

Zero-shot TTS ndi kuwongolera kwa maganizo olimba komanso kutanthauzira kwakukulu.

Zilankhulo: en, zh

Kusintha mawu

Spark TTSSpark TTS

Kulankhula kloning TTS ndi controlable chisoni ndi kulankhula mtundu mwa kufunsa.

Zilankhulo: en, zh

Kusintha mawu

GPT-SoVITSGPT-SoVITS

Few-shot mawu kloning TTS kuti amachitanso chilichonse mawu kuchokera 5 masekondi a audio.

Zilankhulo: en, zh, ja, ko

Kusintha mawu

ChatterboxChatterbox

State-of-the-art zero-shot kujambula mawu ndi kuwongolera maganizo kuchokera ku Resemble AI.

Zilankhulo: en

Kusintha mawu

Tortoise TTSTortoise TTS

Multi-wolankhula malemba-ku-kulankhula kuganizira za katundu ndi autoregressive ukadaulo.

Zilankhulo: en

Kusintha mawu

OpenVoiceOpenVoice

Instant mawu kloning ndi granular kuwongolera pa mtundu, chisoni, ndi accent.

Zilankhulo: en, zh, ja, ko, fr, de, es, it

Kusintha mawu

Qwen3 TTSQwen3 TTS

TTS ya Alibaba ndi mawu osiyanasiyana, mawu osankhidwa, ndi mawu opangidwa kuchokera ku malemba.

Zilankhulo: en, zh, ja, ko, de, fr, ru, pt, es, it

Kusintha mawu

Chatterbox TurboChatterbox Turbo

Faster Chatterbox ndi sub-200ms latency ndi paralinguistic tags kwa laughs, kupweteka, ndi zina zambiri.

Zilankhulo: en

Kusintha mawu

VoxCPMVoxCPM

TTS yopanda Tokenizer yomwe imapanga 44.1kHz audio ndi kugwirizana kwa masamba omvetsetsa.

Zilankhulo: en, zh

Kusintha mawu

OuteTTSOuteTTS

LLM-ogwirizana TTS kuti amayenda pa CPU, GPU, kapena browser kudzera llama.cpp ndi Transformers.js.

Zilankhulo: en

Kusintha mawu

Pocket TTSPocket TTS

Model ya 100M ya Kyutai ndi kulumikizana kwa mawu kuchokera ku satifiketi imodzi.

Zilankhulo: en, fr

Kusintha mawu

CosyVoice3CosyVoice3

Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.

Zilankhulo: en, zh, ja, ko, de, es, fr, it, ru

Kusintha mawu

MOSS-TTSMOSS-TTS

Ultra-long 20-language TTS supporting up to 1 hour of continuous generation with phoneme-level control.

Zilankhulo: en, zh, de, es, fr, ja, it, hu, ko, ru, fa, ar, pl, pt, cs, da, sv, el, tr

Kusintha mawu

MegaTTS3MegaTTS3

ByteDance's sparse alignment TTS with adjustable intelligibility vs. speaker similarity.

Zilankhulo: en, zh

Kusintha mawu

Wopanga-Pyamba API

OpenAI-kugwirizana REST API. One endpoint, 22 + mafano. Streaming thandizo kwa real-time mapulogalamu.

  • Format yogwirizana ndi OpenAI
  • Streaming TTS kwa real-time mapulogalamu
  • Batch processing kwa ntchito zazikulu
  • Zidziwitso za Webhook
Kuonera API Docs
pip install ttsai npm install @ttsainpm/ttsai
Python
from tts_ai import TTSClient

client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
    text="Hello from TTS.ai!",
    model="kokoro",
    voice="af_bella",
)
client.save(audio, "output.mp3")

Zosatheka, Zosawoneka bwino

Kuyamba kwaulere. Scale monga mukukula.

_Yaulere

$0

15,000 characters

  • Kokoro, Piper, VITS, MeloTTS
  • 500 Character limit
  • 3 gen / ola (opanda akaunti)
Kulembetsa kwaulere

Woyamba

$9/mphindi

500,000 characters/mwezi

  • onse 22+ zojambula
  • 100,000 chars pa chiyambi
  • Chizindikiro cha mawu
Kuyamba
Otchuka kwambiri

Pro

$29/mphindi

2,000,000 characters/mwezi

  • Zonse mu Starter
  • Kugwiritsa ntchito API
  • Priority processing
Kupeza Pro

Zamalonda

$99/mphindi

10,000,000 characters/mwezi

  • Zonse mu Pro
  • Mphamvu ya API
  • Priority queue
Kupeza bizinesi

Onani zonse zowonjezera kuphatikizapo mapaketi azithunzi →

Funso Lofunsidwa Kawirikawiri

TTS.ai ndi njira yokhala ndi mawu okwanira kwambiri a AI, yomwe imapatsa 22 + mapangidwe a mawu, kulumikizana kwa mawu, mawu olemba mawu, ndi zida za audio.

Yes! TTS.ai amapereka ufulu malemba-ku-kulankhula ndi Kokoro, Piper, VITS, ndi MeloTTS mafano. No akaunti zofunika. Sign up kuti mudziwe 15,000 ufulu mafano ndi kulowa onse mafano.

Kuti muchepetse nthawi, yesani Kokoro kapena Piper. Kuti muchepetse mtengo, yesani CosyVoice 2 kapena StyleTTS 2. Kuti muchepetse mawu, yesani Chatterbox kapena GPT-SoVITS. Kuti muchepetse mauthenga, yesani Dia TTS. Yesani mamodeli ambiri pamutu umodzi kuti muwayerekezere.

OpenAI-kugwirizana REST API kwa TTS, STT, kulankhula kloning, ndi audio zipangizo. Zili pa Pro ($ 29 / mo) ndi Enterprise ($ 99 / mo) miyezo.

Kuwala kwa mawu kumasiyana malinga ndi mtundu wa foni. Mafoni a premium monga CosyVoice 2, StyleTTS 2, ndi Chatterbox amatulutsa mawu ofanana ndi mawu a munthu, ndi mawu owoneka bwino. Mafoni aulere monga Kokoro amapatsa mawu abwino kwambiri pogwiritsa ntchito foni.

TTS.ai amathandiza 30 + mabungwe a zamakono m'mabuku ake. Chingelezi ali ndi chithandizo chabwino kwambiri, koma maphunziro monga CosyVoice 2 amaphatikiza Chisipanishi, Chijapanizi, ndi Chikoreya; GPT-SoVITS amagwira Chisipanishi, Chijapanizi, Chikoreya, ndi Chingelezi; ndi MeloTTS amathandizira Chisipanishi, Chisipanishi, Chijeremani, Chisipanishi, Chijapanizi, ndi Chikoreya.

Ndikofunika. Kugwiritsa ntchito zonse kumachitika pa seva yathu yokhayo ya GPU. Tisasunga mawu anu omwe mwalemba kapena mawu omwe mwapanga pambuyo potumiza. Zolemba za mawu zomwe mwatsitsa kuti muzigwiritsa ntchito zimagwiritsa ntchito nthawi yokhayo yomwe mumagwiritsa ntchito ndipo sizingachitike. Sitingagawana deta yanu ndi anthu ena kapena kuzigwiritsa ntchito kuti tiziphunzitsa mamodeli.

Yai. Zonse zomvetsera zomwe zimapangidwa pa TTS.ai ndi zanu kuti muzigwiritsa ntchito kwachuma, kuphatikizapo mavidiyo a YouTube, podcasts, audiobooks, mapulogalamu, zotsatsa, ndi zinthu. Mamodeli athu ndi otsegulira masamba pansi pa malamulo ovomerezeka (MIT, Apache 2.0).

TTS.ai amapanga audio mu WAV mtundu mwa kuipa kwa khalidwe lalikulu. Mukhoza kusintha kuti MP3, FLAC, OGG, kapena M4A pogwiritsa ntchito wathu ufulu Audio Converter chida.

Lowani chitsanzo chafupi cha mawu (masiku 5 okha) cha mawu omwe mukufuna kukulitsa, kenako pezani mawu kuti mupange mawu m'mawu amenewo. Models monga Chatterbox, GPT-SoVITS, ndi CosyVoice 2 amathandizira kukulitsa mawu.

Mapangidwe aulere (Kokoro, Piper, VITS, MeloTTS) safunikira akaunti ndipo amawononga maonekedwe a zero. Mapangidwe a standard (maonekedwe 2,000 / 1K input) amaphatikizapo Bark, CosyVoice 2, F5-TTS, ndi Dia. Mapangidwe a premium (maonekedwe 4,000 / 1K input) amaphatikizapo OpenVoice, Chatterbox, StyleTTS 2, ndi Tortoise. Mapangidwe olipira nthawi zambiri amapatsa mtundu wabwino kwambiri, mawu ambiri, komanso zinthu zina monga kujambula mawu.

Yes. The API supports batch processing for converting large volumes of text to speech. Submit multiple requests and retrieve results asynchronously using job UUIDs. Enterprise plans ($99/mo) include priority queue access for faster batch processing.
4.1/5 (21)

Kodi tingachitire chiyani kuti tisinthe? Maganizo anu amatithandiza kuchotsa mavuto.

Kuyambiranso kugwiritsa ntchito AI Voice lero

Join opanga, opanga, ndi makampani pogwiritsa ntchito TTS.ai