Free AI Tọghata ngwe ka ọsụsọ

31+ Open-source models, 231+ ụda 34+ Achọrọghị akaụntụ.

8K+
Ndị na-eme ya
30K+
Ọdịnaya
31+
Ụdị AI
231+
ụda
0/500 Ụdị · Sign up for 5,000 per generation → Ọfụụ
Ị hụrụ TTS.ai? Kpọtụrụ enyi gị!

Ihe niile ịchọrọ maka ụda AI

30+ tools powered by open-source AI models

31+ Ụdị ụda AI

Nchịkọta zuru ezu nke open-source TTS models na mbido otu

KokoroKokoro Free

Kokoro bụ 82 nde parameters ngwe-na-asị model nke punches mma n'elu ya weightclass. N'agbanyeghị ya obere size, ọ na-emepụta na-asị na-asị na-asị. Kokoro na-akwado asụsụ ndị ọzọ gụnyere English, Japanese, Chinese, na Korean na ụdị okwu ndị ọzọ. Ọ na-arụ ọrụ n'ụzọ dị mfe - na-emepụta ụda dị ka 100x n'ụzọ dị mfe karịa oge n'oge na GPU.

Ọkachasị maka: TTS nke dị elu na-enweghị mmebi, usoroiheomume ntụgharị

Nwalee

PiperPiper Free

Piper bụ engine ngwe-na-asụsụ na-asụgharị nke Rhasspy na-eji VITS na larynx architectures. Ọ na-arụ ọrụ nke ọma na CPU, na-eme ka ọ dị mma maka ngwaọrụ edge, ụlọ ọrụ na-arụ ọrụ, na ngwa ọrụ chọrọ TTS na-enweghị njikọ. Na okwu 100 n'elu asụsụ 30 +, Piper na-enye okwu na-asụgharị na-asụgharị na-asụgharị na-asụgharị na Raspberry Pi 4.

Ọkachasị maka: Nlebiritụanya nkịtị, ikikembanye, nakwa usoroiheomume embedded

Nwalee

VITSVITS Free

VITS (Variational Inference na-amụ ihe na-abịanụ maka ngwụcha-na-abịanụ Text-to-Speech) bụ ụzọ TTS na-abịanụ na-abịanụ nke na-emepụta ụda dị mma karịa ụdị abụọ nke ugbu a. Ọ na-ahọrọ ntụgharị dị iche iche na-agbakwunye na ntụgharị na usoro nkuzi na-abịanụ, na-enwetakwa mmelite dị mkpa na nghọta.

Ọkachasị maka: General-purpose text-to-speech na narịị prosody

Nwalee

MeloTTSMeloTTS Free

MeloTTS site na MyShell.ai bụ TTS multilingual library na-akwado English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, na Korean. Ọ dị ngwa ngwa, na-arụ ọrụ ngwe na-adịgide adịgide na CPU naanị. MeloTTS ejirila maka iji mmepụta na-akwado CPU na GPU inference.

Ọkachasị maka: Usoroiheomume mmepe na-achọ ngwa ngwa, TTS n'asụsụ dị iche iche

Nwalee

OuteTTSOuteTTS Free

OuteTTS na-eweta ụdị asụsụ dị ukwuu na ngwe-na-asụgharị n'oge na-echekwa ọdịnala. Ọ na-akwado ọtụtụ backends gụnyere llama.cpp (CPU / GPU), Hugging Face Transformers, ExLlamaV2, VLLM, na ọbụna ntụgharị ntụgharị na-eji Transformers.js. Ọ na-enyekwa ụda ụda site na profaịlụ ndị na-ekwu okwu echekwara dịka JSON.

Ọkachasị maka: Nhazi n'akụkụ, TTS nke na-adabere na brauịzaị, gburugburu ebe obibi nke na-enweghị uru

Nwalee

Pocket TTSPocket TTS Free

Pocket TTS site na Kyutai (ndị na-emepụta Moshi) bụ ụdị 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke 100M nke

Ọkachasị maka: Nhazi dị n'okpuru, CPU-ọbụla gburugburu, ịkọgharị ụda n'ụzọ ngwa ngwa

Nwalee

Kitten TTSKitten TTS Free

Kitten TTS by KittenML is an ultra-lightweight text-to-speech model built on ONNX. With variants from 15M to 80M parameters (25-80 MB on disk), it delivers high-quality voice synthesis on CPU without requiring a GPU. Features 8 built-in voices, adjustable speech speed, and built-in text preprocessing for numbers, currencies, and units. Ideal for edge deployment and low-latency applications.

Ọkachasị maka: Fast lightweight TTS, edge deployment, low-latency applications

Nwalee

BarkBark Standard

Transform-based text-to-audio model nke na-emepụta okwu, egwu, na mmetụta ụda.

Debanye aha: Suno · Ikikere: MIT

Jiri ya

Bark SmallBark Small Standard

Ụdị dị n'okpuru nke Bark na-eji nghọta dị n'okpuru.

Debanye aha: Suno · Ikikere: MIT

Jiri ya

CosyVoice 2CosyVoice 2 Standard

Alibaba's scalable streaming TTS na human-parity naturalness na nso-zero latency.

Debanye aha: Alibaba (Tongyi Lab) · Ikikere: Apache 2.0

Jiri ya

Dia TTSDia TTS Standard

Multi-speaker dialog generation model nke na-emepụta nchọgharị n'etiti ndị na-ekwu okwu.

Debanye aha: Nari Labs · Ikikere: Apache 2.0

Jiri ya

Parler TTSParler TTS Standard

Depụta ụda ịchọrọ n'asụsụ na-emeghị n'aka na Parler ga-eweta ụda dị n'otu.

Debanye aha: Hugging Face · Ikikere: Apache 2.0

Jiri ya

GLM-TTSGLM-TTS Standard

Na-enwe ụkpụrụ nke ndehie akara ala n'etiti okporo ụzọ TTS model.

Debanye aha: Zhipu AI · Ikikere: GLM-4 License

Jiri ya

IndexTTS-2IndexTTS-2 Standard

Zero-shot TTS na-ejikwa mmetụta uche nke dị mma nakwa n'ụzọ dị elu.

Debanye aha: Index Team · Ikikere: Bilibili Model License

Jiri ya

Spark TTSSpark TTS Standard

Klọnaịsị ụda TTS n'ụdị ụda na-achịkwa ya nakwa n'ụdị okwu site n'ịjụjụ.

Debanye aha: SparkAudio · Ikikere: CC BY-NC-SA 4.0

Jiri ya

GPT-SoVITSGPT-SoVITS Standard

Few-shot ụda na-ebuli TTS nke na-ebuli ụda ọbụla site na sekọnd 5 nke ụda.

Debanye aha: RVC-Boss · Ikikere: MIT

Jiri ya

OrpheusOrpheus Standard

Human-level emotional TTS model trained on 100K hours of speech data.

Debanye aha: Canopy Labs · Ikikere: Llama 3.2 Community

Jiri ya

Qwen3 TTSQwen3 TTS Standard

Alibaba's multilingual TTS na ụda cloning, preset ụda, na ụda nhazi site na ngwe.

Debanye aha: Alibaba (Qwen) · Ikikere: Apache 2.0

Jiri ya

Chatterbox TurboChatterbox Turbo Standard

Chatterbox n'ụzọ nkịtị na sub-200ms latency na paralinguistic tags maka nnụnụ, nkụda mmụọ, na ndị ọzọ.

Debanye aha: Resemble AI · Ikikere: MIT

Jiri ya

Dia 2Dia 2 Standard

Ntụgharị-ọhụrụ TTS na-asụgharị na-asụgharị na-asụgharị na-asụgharị na-asụgharị.

Debanye aha: Nari Labs · Ikikere: Apache 2.0

Jiri ya

VoxCPMVoxCPM Standard

Tokenizer-free TTS na-eweta 44.1kHz ụda na n'ozuzu ya na-aghọta paragraf.

Debanye aha: OpenBMB · Ikikere: Apache 2.0

Jiri ya

TADATADA Standard

Zero-hallucination TTS na ngwe-acoustic dual alignment, 5x ngwa ngwa karịa dị ka LLM TTS.

Debanye aha: Hume AI · Ikikere: MIT

Jiri ya

VibeVoiceVibeVoice Standard

Móòdù Microsoft maka ihenhọrọ ndị na-ekwusa ọtụtụ ihe dị ka podcasts na audiobooks.

Debanye aha: Microsoft · Ikikere: MIT

Jiri ya

CosyVoice3CosyVoice3 Standard

Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.

Debanye aha: Alibaba (FunAudioLLM) · Ikikere: Apache 2.0

Jiri ya

ChatterboxChatterbox Premium

State-of-the-art zero-shot ụda ịkọsa na nchịkwa mmetụta site na Resemble AI.

Nhazi:

Jiri ya

Tortoise TTSTortoise TTS Premium

Multi-voice text-to-speech na-atụle na mma na-eji autoregressive architecture.

Nhazi:

Jiri ya

StyleTTS 2StyleTTS 2 Premium

Nhazi ngwe-ka-asụsụ n'ụdị mmadụ site n'ịgbakọ na ịzụlite.

Nhazi:

Jiri ya

OpenVoiceOpenVoice Premium

Nkwado ụda na-akpaghị aka na nlekọta n'elu ụdị, mmetụta, nakwa ntụgharị.

Nhazi:

Jiri ya

Sesame CSMSesame CSM Premium

N'ihe banyere okwu, ọ bụ ihe na-eme ka okwu na-atọ ụtọ ma na-atọ ụtọ.

Nhazi:

Jiri ya

MOSS-TTSMOSS-TTS Premium

Ultra-long 20-language TTS supporting up to 1 hour of continuous generation with phoneme-level control.

Nhazi:

Jiri ya

MegaTTS3MegaTTS3 Premium

ByteDance's sparse alignment TTS with adjustable intelligibility vs. speaker similarity.

Nhazi:

Jiri ya

CosyVoice 2CosyVoice 2

Alibaba's scalable streaming TTS na human-parity naturalness na nso-zero latency.

Asụsụ: en, zh, ja, ko, fr, de, it, es

Kpọnye ụda

GLM-TTSGLM-TTS

Na-enwe ụkpụrụ nke ndehie akara ala n'etiti okporo ụzọ TTS model.

Asụsụ: en, zh

Kpọnye ụda

IndexTTS-2IndexTTS-2

Zero-shot TTS na-ejikwa mmetụta uche nke dị mma nakwa n'ụzọ dị elu.

Asụsụ: en, zh

Kpọnye ụda

Spark TTSSpark TTS

Klọnaịsị ụda TTS n'ụdị ụda na-achịkwa ya nakwa n'ụdị okwu site n'ịjụjụ.

Asụsụ: en, zh

Kpọnye ụda

GPT-SoVITSGPT-SoVITS

Few-shot ụda na-ebuli TTS nke na-ebuli ụda ọbụla site na sekọnd 5 nke ụda.

Asụsụ: en, zh, ja, ko

Kpọnye ụda

ChatterboxChatterbox

State-of-the-art zero-shot ụda ịkọsa na nchịkwa mmetụta site na Resemble AI.

Asụsụ: en

Kpọnye ụda

Tortoise TTSTortoise TTS

Multi-voice text-to-speech na-atụle na mma na-eji autoregressive architecture.

Asụsụ: en

Kpọnye ụda

OpenVoiceOpenVoice

Nkwado ụda na-akpaghị aka na nlekọta n'elu ụdị, mmetụta, nakwa ntụgharị.

Asụsụ: en, zh, ja, ko, fr, de, es, it

Kpọnye ụda

Qwen3 TTSQwen3 TTS

Alibaba's multilingual TTS na ụda cloning, preset ụda, na ụda nhazi site na ngwe.

Asụsụ: en, zh, ja, ko, de, fr, ru, pt, es, it

Kpọnye ụda

Chatterbox TurboChatterbox Turbo

Chatterbox n'ụzọ nkịtị na sub-200ms latency na paralinguistic tags maka nnụnụ, nkụda mmụọ, na ndị ọzọ.

Asụsụ: en

Kpọnye ụda

VoxCPMVoxCPM

Tokenizer-free TTS na-eweta 44.1kHz ụda na n'ozuzu ya na-aghọta paragraf.

Asụsụ: en, zh

Kpọnye ụda

OuteTTSOuteTTS

LLM-n'okpuru TTS na-agbagharị na CPU, GPU, mọọbụ nchọgharị site na llama.cpp na Transformers.js.

Asụsụ: en

Kpọnye ụda

Pocket TTSPocket TTS

Lightweight 100M parameter model site na Kyutai na ụda na-ebuli site na saịmpọn.

Asụsụ: en, fr

Kpọnye ụda

CosyVoice3CosyVoice3

Next-generation multilingual TTS with bi-streaming, emotion control, and zero-shot voice cloning.

Asụsụ: en, zh, ja, ko, de, es, fr, it, ru

Kpọnye ụda

MOSS-TTSMOSS-TTS

Ultra-long 20-language TTS supporting up to 1 hour of continuous generation with phoneme-level control.

Asụsụ: en, zh, de, es, fr, ja, it, hu, ko, ru, fa, ar, pl, pt, cs, da, sv, el, tr

Kpọnye ụda

MegaTTS3MegaTTS3

ByteDance's sparse alignment TTS with adjustable intelligibility vs. speaker similarity.

Asụsụ: en, zh

Kpọnye ụda

Developer-First API

OpenAI-compatible REST API. One endpoint, 22+ models. Streaming support for real-time applications.

  • OpenAI-compatible format
  • TTS na-edebata maka usoroiheomume oge n'eziokwu
  • Nhazi batch maka ọrụ ndị dị ukwuu
  • Ndesịta ozi ndị ahụ
Gosi dọkumenti API
pip install ttsai npm install @ttsainpm/ttsai
Python
from tts_ai import TTSClient

client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
    text="Hello from TTS.ai!",
    model="kokoro",
    voice="af_bella",
)
client.save(audio, "output.mp3")

Nnukwu, n'okporo ụzọ price

Bido n'efu. Nhazi dịka ị na-etolite.

Ọfụụ

$0

15,000 akara

  • Kokoro, Piper, VITS, MeloTTS
  • 500 akara
  • 3 gen/ọnụọgụgụ (enweghị akaụntụ)
Akaụntụ

Nhazi

$9/ọnwa

500,000 characters/month

  • Ụdị 22+ niile
  • 100,000 akara kwa mbido
  • Klọnsị ụda
Bido
Nke kacha amasị

Nhazi

$29/ọnwa

2,000,000 characters/month

  • Ihe nile na mbido
  • Ikikere API
  • Nhazi ihenlereanya
Nweta Pro

Ụlọọrụ

$99/ọnwa

10,000,000 characters/month

  • Ihe niile na Pro
  • Bulk API
  • Òtù n'ihu
Nweta ọrụ

Gosi usoroiheomume niile gụnyere akara pake →

Ajụjụ ndị a na-ajụkarị

TTS.ai bụ ikpo okwu olu AI kachasị zuru oke, na-enye 22 + ntinye-na-asụsụ, okwu cloning, okwu-na-asụsụ, na ngwaọrụ ụda. Models niile bụ isi na-emeghe na enweghị onye na-ere ahịa.

Ee! TTS.ai na-enye ntinye akwụkwọ n'efu na Kokoro, Piper, VITS, na MeloTTS. Ọ dịghị akaụntụ chọrọ. Tinye ka ị nweta akara 15,000 n'efu ma nweta ụdị niile. Nkwekọrịta na-akwụ ụgwọ na $ 9 / ọnwa.

Maka ọsọ, jiri Kokoro mọọbụ Piper. Maka ogo, jiri CosyVoice 2 mọọbụ StyleTTS 2. Maka ịkọsa ụda, jiri Chatterbox mọọbụ GPT-SoVITS. Maka dìilọọgụ, jiri Dia TTS. Jiri móòdù dị iche iche na ngwe ahụ ka ịtụle.

Ee. OpenAI-compatible REST API maka TTS, STT, okwu cloning, na audio tools. Available na Pro ($ 29 / mo) na Enterprise ($ 99 / mo) plans. View documentation at tts.ai/api /.

Nhazi ụda dị iche iche site na móòdù. Premium móòdù dị ka CosyVoice 2, StyleTTS 2, na Chatterbox na-eweta ụda dị ka ụda mmadụ na-egosipụta ụda na mmetụta uche. Free mòdù dị ka Kokoro na-enye ụda dị mma maka ihe ndị a na-ejikarị.

TTS.ai na-akwado asụsụ 30+ n'ime model library ya. English nwere nkwado model nke dị n'ime, mana ụdị dị ka CosyVoice 2 na-ekpuchi Chinese, Japanese, na Korean; GPT-SoVITS na-ejikwa Chinese, Japanese, Korean, na English; na MeloTTS na-akwado English, Spanish, French, Chinese, Japanese, na Korean.

Ee. Nhazi niile na-eme na sava GPU anyị. Anyị anaghị etinye ngwe gị ma ọ bụ ụda ịkewapụtara mgbe a na-eziga ya. A na-eji ụda ndị a na-ebubata maka ịkọnye naanị maka oge mmem ọfụụ ma a na-echekwa ha. Anyị anaghị etinye data gị n'aka ndị ọzọ ma ọ bụ jiri ya rụọ ọrụ maka ịkụzi ụdị.

Ee. Ọdịdị niile e mepụtara na TTS.ai bụ gị iji jiri ya n'ụzọ azụmahịa, gụnyere maka vidiyo YouTube, podcasts, audiobooks, ngwa, mgbasa ozi, na ngwaahịa. Ụdị anyị bụ isi mmalite mepere emepe n'okpuru ikike ikike (MIT, Apache 2.0). Enweghị ikike ma ọ bụ ikike achọrọ.

TTS.ai na-eweta ụda na WAV format site na difọ́ọ̀ltụ̀ maka ogo kacha nta. I nwere ike ịgbanwee ka MP3, FLAC, OGG, mọọbụ M4A site na iji anyị n'efu Audio Converter tool. API na-akwado ịkọwapụta gị n'aka ekpe ọfụụ output format n'ime arịrịọ ahụ.

Bipụta ụda dị mkpirikpi (ihe dị ka sekọnd 5) nke ụda ịchọrọ ịklonye, wee tinye ngwe ọbụla iji mepụta ụda na ụda ahụ. Models dị ka Chatterbox, GPT-SoVITS, na CosyVoice 2 na-akwado ịklonye ụda. Ụda a klonyekwara na-echekwa ụda, ụda, nakwa ụda okwu.

Free models (Kokoro, Piper, VITS, MeloTTS) chọrọ akaụntụ ọ bụla na-akwụ ụgwọ akara sekọnd. Standard models (2,000 characters/1K input) gụnyere Bark, CosyVoice 2, F5-TTS, na Dia. Premium models (4,000 characters/1K input) gụnyere OpenVoice, Chatterbox, StyleTTS 2, na Tortoise. Paid models na-enyekarị mma dị elu, ụda ndị ọzọ, na atụmatụ ndị ọzọ dị ka ịkọ ụda.

Ee. API na-akwado usoroiheomume batch maka ịgbanwe nnukwu ọnụọgụgụ nke ngwe ka ọsụsọ. Tinye ọtụtụ arịrịọ ma nweta nsonaazụ n'ụzọ asynchronous site na iji ọrụ UUIDs. Enterprise plans ($99/mo) na-agụnye nbanye ntọala ntọala maka usoroiheomume batch ngwa ngwa. Ideal for audiobook production, course content, and large-scale voiceover projects.
4.1/5 (21)

Gịnị ka anyị ga-eme ka ọ dịrị mma? Ntụziaka gị na-enyere anyị aka idozi nsogbu.

Bido iji ụda AI taa

Join creators, developers, na ụlọ ọrụ na-eji TTS.ai