Free AI Text to SpeechQuery

33+ open-source mafano, 273+ mawu, 33+ Palibe akaunti yofunika.

17K+
Opanga
70K+
chiyambi
33+
Models a AI
273+
mawu
0/500 maonekedwe · Kulembetsa kwa 5,000 pa chiyambi → Opanda pake
Kukonda TTS.ai? udzauza anzanu!

Zonse zomwe mukufuna kwa Voice AI

30 + zipangizo zopangidwa ndi mapangidwe a AI otsegulidwa

33+ AI Voice Models

Kusonkhanitsa kokwanira kwambiri kwa ma TTS aulere aulere m'modzi m'modzi

KokoroKokoro Opanda ndalama

Kokoro ndi 82 miliyoni paramita malemba-ku-kulankhula chitsanzo chomwe punches bwino pamwamba pa khalidwe lake la thupi. Ngakhale ndi ochepa kukula, amatulutsa mawu owoneka bwino ndi owoneka bwino. Kokoro amathandiza mabungwe ambiri kuphatikizapo Chijeremani, Chijeremani, Chijeremani, ndi Korean ndi mitundu yosiyanasiyana ya mawu owoneka bwino.

Oyenera kwa: TTS yotsika mtengo ndi latency yotsika, machitidwe oyenda

Kuyesa kwaulere

PiperPiper Opanda ndalama

Piper ndi makina otsika mtengo a mawu ochokera ku mawu omwe adapangidwa ndi Rhasspy omwe amagwiritsa ntchito VITS ndi larynx architectures. Imayenda kwathunthu pa CPU, zomwe zimapangitsa kuti ikhale yabwino kwa zida za edge, zowongolera zanyumba, ndi mapulogalamu omwe akufuna TTS osagwirizana. Ndi mawu oposa 100 m'zinenero 30 +, Piper imabweretsa mawu owoneka bwino panthawi ya real-time ngakhale pa Raspberry Pi 4.

Oyenera kwa: Kuwonetsa mofulumira, kupezeka, ndi mapulogalamu ophatikizidwa

Kuyesa kwaulere

VITSVITS Opanda ndalama

VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) ndi njira yofanana yoyambira kumapeto kwa TTS yomwe imapanga mawu owoneka bwino kwambiri kuposa mamodeli anthawi zonse awiri. Imagwiritsa ntchito kutengera kwa maonekedwe osiyanasiyana omwe amawonjezeredwa ndi kuwongolera kwa magazi ndi njira yophunzitsa yotsutsana, yomwe imakwaniritsa kuwonjezeka kwakukulu kwa chilengedwe.

Oyenera kwa: Kusintha mawu kukhala malemba ndi mawu

Kuyesa kwaulere

MeloTTSMeloTTS Opanda ndalama

MeloTTS ndi MyShell.ai ndi TTS library yokhala ndi mabuku ambiri omwe amathandizira Chijeremani (cha America, cha British, cha Indian, cha Australia), Chisipanishi, Chifalansa, Chijeremani, cha Japanese, ndi cha Korean. Ndiyabwino kwambiri, yopanga malemba panthawi yoyenera kwambiri pa CPU yokha. MeloTTS idapangidwa kuti igwiritse ntchito kupanga ndipo imathandizira kuzindikira kwa CPU ndi GPU.

Oyenera kwa: Ntchito zopanga zomwe zimafunikira TTS yofulumira komanso yosiyanasiyana

Kuyesa kwaulere

Kani TTS 2Kani TTS 2 Opanda ndalama

Kani-TTS-2 ndi NineNineSix ndi mtundu wa 400M wa 400M wopangidwa ndi Liquid AI LFM2 backbone ndi NVIDIA NanoCodec. Imagwira ntchito pa 3GB VRAM yokha ndipo imatulutsa ~ 10 masekondi a mawu mu ~ 2 masekondi pa A100 (RTF 0.2). Kutulutsa kwa anthu kwatsopano kumatumiza chitsimikizo cha `kani-tts-2-en` cha Chingerezi kokha ndipo sichikuwonetsa khomo lophatikizira lofunikira kwa kujambula mawu - kugwiritsa ntchito Chatterbox / IndexTTS2 / F5-TTS kwa kujambula, kapena Kokoro / MeloTTS kwa omwe salankhula Chingelezi.

Oyenera kwa: Fast English chiyambi pa low-VRAM zida, mawonedwe ofulumira

Kuyesa kwaulere

OuteTTSOuteTTS Opanda ndalama

OuteTTS imawonjezera mapangidwe amtundu waukulu ndi zofunikira za mawu-ku-mawu poteteza mapangidwe ake oyambirira. Imathandizira ma backends ambiri kuphatikiza llama.cpp (CPU / GPU), Hugging Face Transformers, ExLlamaV2, VLLM, komanso kuzindikira kwa msakatuli pogwiritsa ntchito Transformers.js.

Oyenera kwa: Kukhazikitsa kwa Edge, TTS yochokera pabrowser, malo otsika azinthu

Kuyesa kwaulere

Pocket TTSPocket TTS Opanda ndalama

Pocket TTS ya Kyutai (opanga Moshi) ndi mtundu wa 100M wa maparamita a masamba omwe amagwira bwino ntchito. Imagwira ntchito bwino pa CPU, imathandizira kujambula mawu osatha kuchokera pa sampling ya audio imodzi, ndipo imatulutsa mawu owoneka bwino.

Oyenera kwa: Kugwiritsa ntchito kosavuta, CPU-only environments, kujambula mawu mwachangu

Kuyesa kwaulere

Kitten TTSKitten TTS Opanda ndalama

Kitten TTS ndi KittenML ndi mtundu wa text-to-speech wokhala ndi ma parameters 15M mpaka 80M (25-80 MB pa diski), imapatsa mawu osiyanasiyana amtundu wamphamvu pa CPU popanda kufunikira GPU. Imaphatikizapo mawu 8 ophatikizidwa, kuthamanga kwa mawu osinthika, ndi kuphatikizidwa kwa masamba oyambirira a masamba, ndalama, ndi mayunitsi.

Oyenera kwa: Fast lightweight TTS, kukhazikitsa edge, ntchito zochepa

Kuyesa kwaulere

Ming-Omni TTSMing-Omni TTS Opanda ndalama

Ming-omni-tts-0.5B ndi inclusionAI ndi mtundu wa mawu opanda kanthu omwe amapangidwa pa backbone ya BailingMM yolimba ndi decoder ya audio yogwirizana ndi Patch-by-Patch. Imabweretsa 44.1kHz (kuzungulira CD quality), imathandizira kujambula mawu opanda kanthu kuchokera ku 3 + yachiwiri, ndipo imaphatikizapo kuwongolera kwa emotion / dialect / BGM kudzera mu malangizo a JSON.

Oyenera kwa: High-fidelity bilingual narration, kuyankhulana kwa mawu oyang'anira, zinenero za Chisipanishi

Kuyesa kwaulere

MOSS-TTS NanoMOSS-TTS Nano Opanda ndalama

MOSS-TTS-Nano-100M ndi mtundu wa 100M wa OpenMOSS wa banja la MOSS-TTS, lomwe limagawana chikhalidwe cha delay-transformer. Amagulitsa mtundu wa 8B wa 8B kwa ~ 80x ochepa komanso kutsitsa kwa VRAM, zomwe zimapangitsa kuti zikhale zoyenera kwa opanda chithandizo komanso opanga maphunziro.

Oyenera kwa: TTS yopanda malire, kupanga kwakukulu, kugwiritsa ntchito kolumikizana ndi kuchepa kwa latency

Kuyesa kwaulere

BarkBark Choyambirira

Model ya text-to-audio yokhala ndi transformer yomwe imapanga mawu, nyimbo, ndi zotsatira za mawu zowoneka bwino.

Wopanga: Suno · License: MIT

Yambitsani

Bark SmallBark Small Choyambirira

Lighter mtundu wa Bark ndi mofulumira kumvetsa ndi pansi kugwiritsa ntchito kumbukirani.

Wopanga: Suno · License: MIT

Yambitsani

CosyVoice 2CosyVoice 2 Choyambirira

Alibaba's scalable streaming TTS ndi khalidwe la munthu-parity ndi kupitilira-zero latency.

Wopanga: Alibaba (Tongyi Lab) · License: Apache 2.0

Yambitsani

Dia TTSDia TTS Choyambirira

Multi-wokamba nkhani dialogue chiyambi cha mtundu womwe amaumba zokambirana zachilengedwe pakati pa okamba nkhani.

Wopanga: Nari Labs · License: Apache 2.0

Yambitsani

Parler TTSParler TTS Choyambirira

Kufotokozera mawu mukufuna mu chilankhulo chachilengedwe ndi Parler amapanga mawu ogwirizana.

Wopanga: Hugging Face · License: Apache 2.0

Yambitsani

IndexTTS-2IndexTTS-2 Choyambirira

Zero-shot TTS ndi kuwongolera kwa maganizo olimba komanso kutanthauzira kwakukulu.

Wopanga: Index Team · License: Bilibili Model License

Yambitsani

Spark TTSSpark TTS Choyambirira

Kulankhula kloning TTS ndi controlable chisoni ndi kulankhula mtundu mwa kufunsa.

Wopanga: SparkAudio · License: CC BY-NC-SA 4.0

Yambitsani

GPT-SoVITSGPT-SoVITS Choyambirira

Few-shot mawu kloning TTS kuti amachitanso chilichonse mawu kuchokera 5 masekondi a audio.

Wopanga: RVC-Boss · License: MIT

Yambitsani

OrpheusOrpheus Choyambirira

Model ya TTS yokhudzana ndi munthu yophunzitsa 100K maola a data ya mawu.

Wopanga: Canopy Labs · License: Llama 3.2 Community

Yambitsani

Qwen3 TTSQwen3 TTS Choyambirira

TTS ya Alibaba ndi mawu osankhidwa ndi mawu osankhidwa kuchokera ku malemba.

Wopanga: Alibaba (Qwen) · License: Apache 2.0

Yambitsani

VieNeu-TTS-v2VieNeu-TTS-v2 Choyambirira

Vietnamese + Chijeremani code-switching TTS ndi 7 preset mawu ndi zero-shot mawu kloning. CPU-khama, palibe GPU zofunika.

Wopanga: Phạm Nguyễn Ngọc Bảo · License: Apache 2.0

Yambitsani

Chatterbox TurboChatterbox Turbo Choyambirira

Faster Chatterbox ndi sub-200ms latency ndi paralinguistic tags kwa laughs, kupweteka, ndi zina zambiri.

Wopanga: Resemble AI · License: MIT

Yambitsani

VoxCPMVoxCPM Choyambirira

TTS yopanda Tokenizer yomwe imapanga 44.1kHz audio ndi kugwirizana kwa masamba omvetsetsa.

Wopanga: OpenBMB · License: Apache 2.0

Yambitsani

VibeVoiceVibeVoice Choyambirira

Microsoft model for long-form multi-speaker content monga podcasts ndi audiobooks.

Wopanga: Microsoft · License: MIT

Yambitsani

CosyVoice3CosyVoice3 Choyambirira

TTS yatsopano ya TTS ndi bi-streaming, kuwongolera maganizo, ndi kujambula mawu opanda kanthu.

Wopanga: Alibaba (FunAudioLLM) · License: Apache 2.0

Yambitsani

NAMAA Saudi TTSNAMAA Saudi TTS Choyambirira

Kuyamba kutsegulira Saudi-Arabic TTS. Native Saudi dialect ndi Chatterbox-quality voice cloning.

Wopanga: NAMAA Space · License: MIT

Yambitsani

Darwin TTSDarwin TTS Choyambirira

Qwen3-TTS ndi mtundu wa Qwen3-1.7B, womwe umagwiritsa ntchito ma FFN kuti agwirizane ndi ma TTS ena.

Wopanga: FINAL-Bench · License: Apache 2.0

Yambitsani

MOSS-TTSDMOSS-TTSD Choyambirira

Model yotsatira ya macheza a olankhula ambiri - kuyambitsa macheza a podcast-style ndi mpaka 5 olankhula ndi maola 60 a mawu ogwirizana.

Wopanga: OpenMOSS · License: Apache 2.0

Yambitsani

ChatterboxChatterbox Premium

State-of-the-art zero-shot kujambula mawu ndi kuwongolera maganizo kuchokera ku Resemble AI.

Ubwino:

Yambitsani

Tortoise TTSTortoise TTS Premium

Multi-wolankhula malemba-ku-kulankhula kuganizira za katundu ndi autoregressive ukadaulo.

Ubwino:

Yambitsani

StyleTTS 2StyleTTS 2 Premium

Kusintha kwa mawu kukhala mawu pamalingaliro a munthu pogwiritsa ntchito kufalitsa kwa mtundu ndi kuphunzitsa motsutsana.

Ubwino:

Yambitsani

OpenVoiceOpenVoice Premium

Instant mawu kloning ndi granular kuwongolera pa mtundu, chisoni, ndi accent.

Ubwino:

Yambitsani

Sesame CSMSesame CSM Premium

Kulankhulana kwa mawu kumabweretsa uthenga woyenera ndi nthawi yoyenera ndi maganizo.

Ubwino:

Yambitsani

CosyVoice 2CosyVoice 2

Alibaba's scalable streaming TTS ndi khalidwe la munthu-parity ndi kupitilira-zero latency.

Zilankhulo: en, zh, ja, ko, fr, de, it, es

Kusintha mawu

IndexTTS-2IndexTTS-2

Zero-shot TTS ndi kuwongolera kwa maganizo olimba komanso kutanthauzira kwakukulu.

Zilankhulo: en, zh

Kusintha mawu

Spark TTSSpark TTS

Kulankhula kloning TTS ndi controlable chisoni ndi kulankhula mtundu mwa kufunsa.

Zilankhulo: en, zh

Kusintha mawu

GPT-SoVITSGPT-SoVITS

Few-shot mawu kloning TTS kuti amachitanso chilichonse mawu kuchokera 5 masekondi a audio.

Zilankhulo: en, zh, ja, ko

Kusintha mawu

ChatterboxChatterbox

State-of-the-art zero-shot kujambula mawu ndi kuwongolera maganizo kuchokera ku Resemble AI.

Zilankhulo: en

Kusintha mawu

Tortoise TTSTortoise TTS

Multi-wolankhula malemba-ku-kulankhula kuganizira za katundu ndi autoregressive ukadaulo.

Zilankhulo: en

Kusintha mawu

OpenVoiceOpenVoice

Instant mawu kloning ndi granular kuwongolera pa mtundu, chisoni, ndi accent.

Zilankhulo: en, zh, ja, ko, fr, es

Kusintha mawu

VieNeu-TTS-v2VieNeu-TTS-v2

Vietnamese + Chijeremani code-switching TTS ndi 7 preset mawu ndi zero-shot mawu kloning. CPU-khama, palibe GPU zofunika.

Zilankhulo: vi, en

Kusintha mawu

Chatterbox TurboChatterbox Turbo

Faster Chatterbox ndi sub-200ms latency ndi paralinguistic tags kwa laughs, kupweteka, ndi zina zambiri.

Zilankhulo: en

Kusintha mawu

VoxCPMVoxCPM

TTS yopanda Tokenizer yomwe imapanga 44.1kHz audio ndi kugwirizana kwa masamba omvetsetsa.

Zilankhulo: en, zh

Kusintha mawu

OuteTTSOuteTTS

LLM-ogwirizana TTS kuti amayenda pa CPU, GPU, kapena browser kudzera llama.cpp ndi Transformers.js.

Zilankhulo: en

Kusintha mawu

Pocket TTSPocket TTS

Model ya 100M ya Kyutai ndi kulumikizana kwa mawu kuchokera ku satifiketi imodzi.

Zilankhulo: en, fr

Kusintha mawu

CosyVoice3CosyVoice3

TTS yatsopano ya TTS ndi bi-streaming, kuwongolera maganizo, ndi kujambula mawu opanda kanthu.

Zilankhulo: en, zh, ja, ko, de, es, fr, it, ru

Kusintha mawu

NAMAA Saudi TTSNAMAA Saudi TTS

Kuyamba kutsegulira Saudi-Arabic TTS. Native Saudi dialect ndi Chatterbox-quality voice cloning.

Zilankhulo: ar

Kusintha mawu

Darwin TTSDarwin TTS

Qwen3-TTS ndi mtundu wa Qwen3-1.7B, womwe umagwiritsa ntchito ma FFN kuti agwirizane ndi ma TTS ena.

Zilankhulo: en, ko, ja, zh

Kusintha mawu

MOSS-TTSDMOSS-TTSD

Model yotsatira ya macheza a olankhula ambiri - kuyambitsa macheza a podcast-style ndi mpaka 5 olankhula ndi maola 60 a mawu ogwirizana.

Zilankhulo: en, zh

Kusintha mawu

Ming-Omni TTSMing-Omni TTS

Model ya 0.5B yokhala ndi mawu osiyanasiyana a inclusionAI ndi 44.1kHz yokhala ndi 44.1kHz yokhala ndi 44.1kHz ndi zero-shot voice cloning.

Zilankhulo: en, zh

Kusintha mawu

MOSS-TTS NanoMOSS-TTS Nano

Tiny 100M MOSS-TTS mtundu - chimodzimodzi chikhalidwe, 80x ochepa, free-tier latency.

Zilankhulo: en, zh, de, es, fr, ja, it, ko, ru, ar, pt

Kusintha mawu

Wopanga-Pyamba API

OpenAI-kugwirizana REST API. One endpoint, 22 + mafano. Streaming thandizo kwa real-time mapulogalamu.

  • Format yogwirizana ndi OpenAI
  • Streaming TTS kwa real-time mapulogalamu
  • Batch processing kwa ntchito zazikulu
  • Zidziwitso za Webhook
Kuonera API Docs
pip install ttsai npm install @ttsainpm/ttsai
Python
from tts_ai import TTSClient

client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
    text="Hello from TTS.ai!",
    model="kokoro",
    voice="af_bella",
)
client.save(audio, "output.mp3")

Zosatheka, Zosawoneka bwino

Kuyamba kwaulere. Scale monga mukukula.

Opanda pake

$0

15,000 characters + 5,000/day

  • 7 ufulu mafano kuphatikizapo Kokoro
  • 5,000 chars per generation
  • Kupeza kwa API kuphatikizidwa
Kulembetsa kwaulere

Woyamba

$9/mphindi

500,000 characters/mwezi

  • onse 22+ zojambula
  • 100,000 chars pa chiyambi
  • Chizindikiro cha mawu
Kuyamba
Otchuka kwambiri

Pro

$29/mphindi

2,000,000 characters/mwezi

  • Zonse mu Starter
  • Kugwiritsa ntchito API
  • Priority processing
Kupeza Pro

Zamalonda

$99/mphindi

10,000,000 characters/mwezi

  • Zonse mu Pro
  • Mphamvu ya API
  • Priority queue
Kupeza bizinesi

Onani zonse zowonjezera kuphatikizapo mapaketi azithunzi →

Funso Lofunsidwa Kawirikawiri

TTS.ai ndi njira yokhala ndi mawu okwanira kwambiri a AI, yomwe imapatsa 22 + mapangidwe a mawu, kulumikizana kwa mawu, mawu olemba mawu, ndi zida za audio.

Yes! TTS.ai amapereka ufulu malemba-ku-kulankhula ndi Kokoro, Piper, VITS, ndi MeloTTS mafano. No akaunti zofunika. Sign up kuti mudziwe 15,000 ufulu mafano ndi kulowa onse mafano.

Kuti muchepetse nthawi, yesani Kokoro kapena Piper. Kuti muchepetse mtengo, yesani CosyVoice 2 kapena StyleTTS 2. Kuti muchepetse mawu, yesani Chatterbox kapena GPT-SoVITS. Kuti muchepetse mauthenga, yesani Dia TTS. Yesani mamodeli ambiri pamutu umodzi kuti muwayerekezere.

Yes. OpenAI-kugwirizana REST API kwa TTS, STT, kulankhulana kwa mawu, ndi zida za audio. Kuphatikiza pa pa chilichonse cholinga kuphatikizapo ufulu, ndi kuchepetsa kuchuluka kwa kuchuluka kwa tier (Free: 10 req / min, Lite: 20, Starter: 30, Pro: 60, Business: 300).

Kuwala kwa mawu kumasiyana malinga ndi mtundu wa foni. Mafoni a premium monga CosyVoice 2, StyleTTS 2, ndi Chatterbox amatulutsa mawu ofanana ndi mawu a munthu, ndi mawu owoneka bwino. Mafoni aulere monga Kokoro amapatsa mawu abwino kwambiri pogwiritsa ntchito foni.

TTS.ai amathandiza 30 + mabungwe a zamakono m'mabuku ake. Chingelezi ali ndi chithandizo chabwino kwambiri, koma maphunziro monga CosyVoice 2 amaphatikiza Chisipanishi, Chijapanizi, ndi Chikoreya; GPT-SoVITS amagwira Chisipanishi, Chijapanizi, Chikoreya, ndi Chingelezi; ndi MeloTTS amathandizira Chisipanishi, Chisipanishi, Chijeremani, Chisipanishi, Chijapanizi, ndi Chikoreya.

Ndikofunika. Kugwiritsa ntchito zonse kumachitika pa seva yathu yokhayo ya GPU. Tisasunga mawu anu omwe mwalemba kapena mawu omwe mwapanga pambuyo potumiza. Zolemba za mawu zomwe mwatsitsa kuti muzigwiritsa ntchito zimagwiritsa ntchito nthawi yokhayo yomwe mumagwiritsa ntchito ndipo sizingachitike. Sitingagawana deta yanu ndi anthu ena kapena kuzigwiritsa ntchito kuti tiziphunzitsa mamodeli.

Yai. Zonse zomvetsera zomwe zimapangidwa pa TTS.ai ndi zanu kuti muzigwiritsa ntchito kwachuma, kuphatikizapo mavidiyo a YouTube, podcasts, audiobooks, mapulogalamu, zotsatsa, ndi zinthu. Mamodeli athu ndi otsegulira masamba pansi pa malamulo ovomerezeka (MIT, Apache 2.0).

TTS.ai amapanga audio mu WAV mtundu mwa kuipa kwa khalidwe lalikulu. Mukhoza kusintha kuti MP3, FLAC, OGG, kapena M4A pogwiritsa ntchito wathu ufulu Audio Converter chida.

Lowani chitsanzo chafupi cha mawu (masiku 5 okha) cha mawu omwe mukufuna kukulitsa, kenako pezani mawu kuti mupange mawu m'mawu amenewo. Models monga Chatterbox, GPT-SoVITS, ndi CosyVoice 2 amathandizira kukulitsa mawu.

Mapangidwe aulere (Kokoro, Piper, VITS, MeloTTS) safunikira akaunti ndipo amawononga maonekedwe a zero. Mapangidwe a standard (maonekedwe 2,000 / 1K input) amaphatikizapo Bark, CosyVoice 2, F5-TTS, ndi Dia. Mapangidwe a premium (maonekedwe 4,000 / 1K input) amaphatikizapo OpenVoice, Chatterbox, StyleTTS 2, ndi Tortoise. Mapangidwe olipira nthawi zambiri amapatsa mtundu wabwino kwambiri, mawu ambiri, komanso zinthu zina monga kujambula mawu.

Ndikofunika. API imathandizira kuyankha kwa masamba ambiri a masamba. Kutumiza mafunso ambiri ndi kupeza zotsatira mosagwirizana pogwiritsa ntchito UUID ya ntchito. Phunziro la Business ($ 99 / mo) ndi pamwambapa limaphatikizapo kulumikizana kwa priority queue kuti mupange masamba opitilira muyeso.
4.1/5 (42)

Kodi tingachitire chiyani kuti tisinthe? Maganizo anu amatithandiza kuchotsa mavuto.

Kuyambiranso kugwiritsa ntchito AI Voice lero

Join opanga, opanga, ndi makampani pogwiritsa ntchito TTS.ai