Free AI Tọghata ngwe ka ọsụsọ

20+ Open-source models, 107+ ụda 32+ Achọrọghị akaụntụ.

1K+
Ndị na-eme ya
2K+
Ọdịnaya
20+
Ụdị AI
107+
ụda
0/500 Ụdị Ọfụụ
5,000 akara kwa mbido 15,000 akara ọfụụ Enweghị kaadị kredit Ọrụ ọhaneze OK
Nweta
0:00 / 0:00
Bubata ụda Ndesịta njikọ ahụ ga-agwụ n'ime 24h
Dị ka TTS.ai? Kpọtụrụ ndị enyi gị!

Ihe niile ịchọrọ maka ụda AI

30+ tools powered by open-source AI models

20+ Ụdị ụda AI

Nchịkọta zuru ezu nke open-source TTS models na mbido otu

KokoroKokoro Free

Kokoro bụ 82 million parameter text-to-speech model nke na-adọta n'ụzọ dị mma n'elu klas ya. Ọ bụ ezie na ọ dị obere, ọ na-emepụta okwu dị mma ma dị mma. Kokoro na-akwado asụsụ dị iche iche gụnyere English, Japanese, Chinese, na Korean na ọtụtụ ụda dị iche iche. Ọ na-arụ ọrụ ngwa ngwa - na-emepụta ụda dị ka 100x ngwa ngwa karịa oge dị ugbu a na GPU.

Ọkachasị maka: TTS nke dị elu n'ụdị na-abaghị uru, usoroiheomume nbudata

Chọpụta

PiperPiper Free

Piper bụ engine ngwe-na-asụsụ na-asụgharị nke Rhasspy na-eji VITS na larynx architectures. Ọ na-arụ ọrụ nke ọma na CPU, na-eme ka ọ dị mma maka ngwaọrụ edge, ụlọ ọrụ na-arụ ọrụ, na ngwa ọrụ chọrọ TTS na-enweghị njikọ. Na okwu 100 n'elu asụsụ 30 +, Piper na-enye okwu na-asụgharị na-asụgharị na-asụgharị na-asụgharị na Raspberry Pi 4.

Ọkachasị maka: Nlebiritụanya nkịtị, ikikembanye, nakwa usoroiheomume embedded

Chọpụta

VITSVITS Free

VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) bụ ụzọ TTS na-aga n'ihu na-aga n'ihu nke na-emepụta ụda dị mma karịa ụdị abụọ dị ugbu a. Ọ na-ewere nghọta dị iche iche na-abawanye na ntụgharị na usoro nkuzi na-aga n'ihu, na-eme ka ọ dịkwuo mma n'ụzọ dị mfe.

Ọkachasị maka: General-purpose text-to-speech na natural prosody

Chọpụta

MeloTTSMeloTTS Free

MeloTTS site na MyShell.ai bụ multilingual TTS library na-akwado English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, na Korean. Ọ dị ngwa ngwa, na-arụ ọrụ ngwe na-adịgide adịgide na CPU naanị. MeloTTS e mepụtara maka ọrụ mmepụta na-akwado CPU na GPU inference.

Ọkachasị maka: Usoroiheomume mmepe na-achọ ngwa ngwa, TTS n'asụsụ dị iche iche

Chọpụta

BarkBark Standard

Transform-based text-to-audio model nke na-emepụta okwu, egwu, na mmetụta ụda.

Debanye aha: Suno · Ikikere: MIT

Jiri ya

Bark SmallBark Small Standard

Ụdị dị n'okpuru nke Bark na-eji nghọta dị n'okpuru.

Debanye aha: Suno · Ikikere: MIT

Jiri ya

CosyVoice 2CosyVoice 2 Standard

Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.

Debanye aha: Alibaba (Tongyi Lab) · Ikikere: Apache 2.0

Jiri ya

Dia TTSDia TTS Standard

Multi-speaker dialog generation model nke na-ebipụta nchọgharị n'etiti ndị na-ekwu okwu.

Debanye aha: Nari Labs · Ikikere: Apache 2.0

Jiri ya

Parler TTSParler TTS Standard

Depụta ụda ịchọrọ n'asụsụ na-emeghị n'aka na Parler ga-eweta ụda dị n'otu.

Debanye aha: Hugging Face · Ikikere: Apache 2.0

Jiri ya

GLM-TTSGLM-TTS Standard

Na-enwe ụkpụrụ nke ndehie akara ala n'etiti okporo ụzọ TTS model.

Debanye aha: Zhipu AI · Ikikere: GLM-4 License

Jiri ya

IndexTTS-2IndexTTS-2 Standard

Zero-shot TTS na-ejikwa mmetụta uche nke dị mma nakwa n'ụzọ dị elu.

Debanye aha: Index Team · Ikikere: Bilibili Model License

Jiri ya

Spark TTSSpark TTS Standard

Klọnaịsị ụda TTS n'ụdị ụda na-achịkwa ya nakwa n'ụdị okwu site n'ịjụjụ.

Debanye aha: SparkAudio · Ikikere: CC BY-NC-SA 4.0

Jiri ya

GPT-SoVITSGPT-SoVITS Standard

Few-shot ụda na-ebuli TTS nke na-ebuli ụda ọbụla site na sekọnd 5 nke ụda.

Debanye aha: RVC-Boss · Ikikere: MIT

Jiri ya

OrpheusOrpheus Standard

Human-level emotional TTS model trained on 100K hours of speech data.

Debanye aha: Canopy Labs · Ikikere: Llama 3.2 Community

Jiri ya

Qwen3 TTSQwen3 TTS Standard

Alibaba's multilingual TTS na ụda cloning, preset ụda, na ụda nhazi site na ngwe.

Debanye aha: Alibaba (Qwen) · Ikikere: Apache 2.0

Jiri ya

ChatterboxChatterbox Premium

State-of-the-art zero-shot ụda ịkọsa na nchịkwa mmetụta site na Resemble AI.

Nhazi:

Jiri ya

Tortoise TTSTortoise TTS Premium

Multi-voice text-to-speech na-atụle na mma na-eji autoregressive architecture.

Nhazi:

Jiri ya

StyleTTS 2StyleTTS 2 Premium

Nhazi ngwe-ka-asụsụ n'ụdị mmadụ site n'ịgbakọ na ịzụlite.

Nhazi:

Jiri ya

OpenVoiceOpenVoice Premium

Klọnaịsị ụda n'oge na-adịghị anya na nlekọta nkịtị n'elu ụdị, mmetụta, nakwa ụda.

Nhazi:

Jiri ya

Sesame CSMSesame CSM Premium

N'ihe banyere okwu, ọ bụ ihe na-eme ka okwu na-atọ ụtọ ma na-atọ ụtọ.

Nhazi:

Jiri ya

CosyVoice 2CosyVoice 2

Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.

Asụsụ: en, zh, ja, ko, fr, de, it, es

Kpọnye ụda

GLM-TTSGLM-TTS

Na-enwe ụkpụrụ nke ndehie akara ala n'etiti okporo ụzọ TTS model.

Asụsụ: en, zh

Kpọnye ụda

IndexTTS-2IndexTTS-2

Zero-shot TTS na-ejikwa mmetụta uche nke dị mma nakwa n'ụzọ dị elu.

Asụsụ: en, zh

Kpọnye ụda

Spark TTSSpark TTS

Klọnaịsị ụda TTS n'ụdị ụda na-achịkwa ya nakwa n'ụdị okwu site n'ịjụjụ.

Asụsụ: en, zh

Kpọnye ụda

GPT-SoVITSGPT-SoVITS

Few-shot ụda na-ebuli TTS nke na-ebuli ụda ọbụla site na sekọnd 5 nke ụda.

Asụsụ: en, zh, ja, ko

Kpọnye ụda

ChatterboxChatterbox

State-of-the-art zero-shot ụda ịkọsa na nchịkwa mmetụta site na Resemble AI.

Asụsụ: en

Kpọnye ụda

Tortoise TTSTortoise TTS

Multi-voice text-to-speech na-atụle na mma na-eji autoregressive architecture.

Asụsụ: en

Kpọnye ụda

OpenVoiceOpenVoice

Klọnaịsị ụda n'oge na-adịghị anya na nlekọta nkịtị n'elu ụdị, mmetụta, nakwa ụda.

Asụsụ: en, zh, ja, ko, fr, de, es, it

Kpọnye ụda

Qwen3 TTSQwen3 TTS

Alibaba's multilingual TTS na ụda cloning, preset ụda, na ụda nhazi site na ngwe.

Asụsụ: en, zh, ja, ko, de, fr, ru, pt, es, it

Kpọnye ụda

Developer-First API

OpenAI-compatible REST API. One endpoint, 22+ models. Streaming support for real-time applications.

  • OpenAI-compatible format
  • TTS na-edebata maka usoroiheomume oge n'eziokwu
  • Nhazi batch maka ọrụ ndị dị ukwuu
  • Ndesịta ozi ndị ahụ
Gosi dọkumenti API
pip install ttsai npm install @ttsainpm/ttsai
Python
from tts_ai import TTSClient

client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
    text="Hello from TTS.ai!",
    model="kokoro",
    voice="af_bella",
)
client.save(audio, "output.mp3")

Nnukwu, n'okporo ụzọ price

Bido n'efu. Nhazi dịka ị na-etolite.

Ọfụụ

$0

15,000 akara

  • Kokoro, Piper, VITS, MeloTTS
  • 500 akara
  • 3 gen/ọnụọgụgụ (enweghị akaụntụ)
Akaụntụ

Nhazi

$9/ọnwa

500,000 characters/month

  • Ụdị 22+ niile
  • 100,000 akara kwa mbido
  • Klọnsị ụda
Bido
Nke kacha amasị

Nhazi

$29/ọnwa

2,000,000 characters/month

  • Ihe nile na mbido
  • Ikikere API
  • Nhazi ihenlereanya
Nweta Pro

Ụlọọrụ

$99/ọnwa

10,000,000 characters/month

  • Ihe niile na Pro
  • Bulk API
  • Òtù n'ihu
Nweta ọrụ

Gosi usoroiheomume niile gụnyere akara pake →

Ajụjụ ndị a na-ajụkarị

TTS.ai bụ ikpo okwu olu AI kachasị zuru oke, na-enye 22 + ụdị ederede-na-asụsụ, okwu cloning, okwu-na-asụsụ, na ngwaọrụ ụda. Models niile bụ isi mmalite mepere emepe na enweghị onye na-ere ahịa.

Ee! TTS.ai na-enye ntinye akwụkwọ n'efu na Kokoro, Piper, VITS, na MeloTTS. Ọ dịghị akaụntụ chọrọ. Tinye ka ị nweta akara 15,000 n'efu ma nweta ụdị niile. Nkwekọrịta na-akwụ ụgwọ na $ 9 / ọnwa.

Maka ọsọ, jiri Kokoro mọọbụ Piper. Maka ogo, jiri CosyVoice 2 mọọbụ StyleTTS 2. Maka ịkọsa ụda, jiri Chatterbox mọọbụ GPT-SoVITS. Maka dìilọọgụ, jiri Dia TTS. Jiri móòdù dị iche iche na ngwe ahụ ka ịtụle.

Ee. OpenAI-compatible REST API maka TTS, STT, okwu cloning, na audio tools. Available na Pro ($ 29 / mo) na Enterprise ($ 99 / mo) plans. View documentation at tts.ai/api /.

Nhazi ụda dị iche iche site na móòdù. Premium móòdù dị ka CosyVoice 2, StyleTTS 2, na Chatterbox na-eweta ụda dị ka ụda mmadụ na-egosipụta ụda na mmetụta uche. Free mòdù dị ka Kokoro na-enye ụda dị mma maka ihe ndị a na-ejikarị.

TTS.ai na-akwado 30 + asụsụ n'ime model library ya. English nwere nkwado model zuru oke, mana ụdị dị ka CosyVoice 2 na-ekpuchi Chinese, Japanese, na Korean; GPT-SoVITS na-elekọta Chinese, Japanese, Korean, na English; na MeloTTS na-akwado English, Spanish, French, Chinese, Japanese, na Korean.

Ee. Nhazi niile na-eme na sava GPU anyị. Anyị anaghị etinye ngwe gị ma ọ bụ ụda ịkewapụtara mgbe a na-eziga ya. A na-eji ụda ndị a na-ebubata maka ịkọnye naanị maka oge mmem ọfụụ ma a na-echekwa ha. Anyị anaghị etinye data gị n'aka ndị ọzọ ma ọ bụ jiri ya rụọ ọrụ maka ịkụzi ụdị.

Ee. Ọdịdị niile e mepụtara na TTS.ai bụ gị iji jiri ya n'ụzọ azụmahịa, gụnyere maka vidiyo YouTube, podcasts, audiobooks, ngwa, mgbasa ozi, na ngwaahịa. Ụdị anyị bụ isi mmalite mepere emepe n'okpuru ikike ikike (MIT, Apache 2.0). Enweghị ikike ma ọ bụ ikike achọrọ.

TTS.ai na-emepụta ụda na WAV format site na difọ́ọ̀ltụ̀ maka ogo kachasị elu. I nwere ike ịgbanwee ka MP3, FLAC, OGG, ma ọ bụ M4A site na iji ngwaọrụ anyị n'efu Audio Converter. API na-akwado ịkọwapụta format output gị n'ụzọ ziri ezi na arịrịọ ahụ.

Bipụta ụda dị mkpirikpi (ihe dị ka sekọnd 5) nke ụda ịchọrọ ịklonye, wee tinye ngwe ọbụla iji mepụta ụda na ụda ahụ. Models dị ka Chatterbox, GPT-SoVITS, na CosyVoice 2 na-akwado ịklonye ụda. Ụda a klonyekwara na-echekwa ụda, ụda, nakwa ụda okwu.

Free models (Kokoro, Piper, VITS, MeloTTS) chọrọ akaụntụ ọ bụla na-akwụ ụgwọ akara sekọnd. Standard models (2,000 characters/1K input) gụnyere Bark, CosyVoice 2, F5-TTS, na Dia. Premium models (4,000 characters/1K input) gụnyere OpenVoice, Chatterbox, StyleTTS 2, na Tortoise. Paid models na-enyekarị mma dị elu, ụda ndị ọzọ, na atụmatụ ndị ọzọ dị ka ịkọ ụda.

Ee. API na-akwado usoroiheomume batch maka ịgbanwe nnukwu ọnụọgụgụ nke ngwe ka ọsụsọ. Tinye ọtụtụ arịrịọ ma nweta nsonaazụ n'ụzọ asynchronous site na iji ọrụ UUIDs. Enterprise plans ($99/mo) na-agụnye nbanye ntọala ntọala maka usoroiheomume batch ngwa ngwa. Ideal for audiobook production, course content, and large-scale voiceover projects.
4.0/5 (8)

Bido iji ụda AI taa

Join creators, developers, na ụlọ ọrụ na-eji TTS.ai