Free AI Tọghata ngwe ka ọsụsọ
20+ Open-source models, 107+ ụda 32+ Achọrọghị akaụntụ.
Ihe niile ịchọrọ maka ụda AI
30+ tools powered by open-source AI models
20+ Ụdị ụda AI
Nchịkọta zuru ezu nke open-source TTS models na mbido otu
Kokoro Free
Kokoro bụ 82 million parameter text-to-speech model nke na-adọta n'ụzọ dị mma n'elu klas ya. Ọ bụ ezie na ọ dị obere, ọ na-emepụta okwu dị mma ma dị mma. Kokoro na-akwado asụsụ dị iche iche gụnyere English, Japanese, Chinese, na Korean na ọtụtụ ụda dị iche iche. Ọ na-arụ ọrụ ngwa ngwa - na-emepụta ụda dị ka 100x ngwa ngwa karịa oge dị ugbu a na GPU.
Ọkachasị maka: TTS nke dị elu n'ụdị na-abaghị uru, usoroiheomume nbudata
Chọpụta
Piper Free
Piper bụ engine ngwe-na-asụsụ na-asụgharị nke Rhasspy na-eji VITS na larynx architectures. Ọ na-arụ ọrụ nke ọma na CPU, na-eme ka ọ dị mma maka ngwaọrụ edge, ụlọ ọrụ na-arụ ọrụ, na ngwa ọrụ chọrọ TTS na-enweghị njikọ. Na okwu 100 n'elu asụsụ 30 +, Piper na-enye okwu na-asụgharị na-asụgharị na-asụgharị na-asụgharị na Raspberry Pi 4.
Ọkachasị maka: Nlebiritụanya nkịtị, ikikembanye, nakwa usoroiheomume embedded
Chọpụta
VITS Free
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) bụ ụzọ TTS na-aga n'ihu na-aga n'ihu nke na-emepụta ụda dị mma karịa ụdị abụọ dị ugbu a. Ọ na-ewere nghọta dị iche iche na-abawanye na ntụgharị na usoro nkuzi na-aga n'ihu, na-eme ka ọ dịkwuo mma n'ụzọ dị mfe.
Ọkachasị maka: General-purpose text-to-speech na natural prosody
Chọpụta
MeloTTS Free
MeloTTS site na MyShell.ai bụ multilingual TTS library na-akwado English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, na Korean. Ọ dị ngwa ngwa, na-arụ ọrụ ngwe na-adịgide adịgide na CPU naanị. MeloTTS e mepụtara maka ọrụ mmepụta na-akwado CPU na GPU inference.
Ọkachasị maka: Usoroiheomume mmepe na-achọ ngwa ngwa, TTS n'asụsụ dị iche iche
Chọpụta
Bark Standard
Transform-based text-to-audio model nke na-emepụta okwu, egwu, na mmetụta ụda.
Debanye aha: Suno · Ikikere: MIT
Jiri ya
Bark Small Standard
Ụdị dị n'okpuru nke Bark na-eji nghọta dị n'okpuru.
Debanye aha: Suno · Ikikere: MIT
Jiri ya
CosyVoice 2 Standard
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Debanye aha: Alibaba (Tongyi Lab) · Ikikere: Apache 2.0
Jiri ya
Dia TTS Standard
Multi-speaker dialog generation model nke na-ebipụta nchọgharị n'etiti ndị na-ekwu okwu.
Debanye aha: Nari Labs · Ikikere: Apache 2.0
Jiri ya
Parler TTS Standard
Depụta ụda ịchọrọ n'asụsụ na-emeghị n'aka na Parler ga-eweta ụda dị n'otu.
Debanye aha: Hugging Face · Ikikere: Apache 2.0
Jiri ya
GLM-TTS Standard
Na-enwe ụkpụrụ nke ndehie akara ala n'etiti okporo ụzọ TTS model.
Debanye aha: Zhipu AI · Ikikere: GLM-4 License
Jiri ya
IndexTTS-2 Standard
Zero-shot TTS na-ejikwa mmetụta uche nke dị mma nakwa n'ụzọ dị elu.
Debanye aha: Index Team · Ikikere: Bilibili Model License
Jiri ya
Spark TTS Standard
Klọnaịsị ụda TTS n'ụdị ụda na-achịkwa ya nakwa n'ụdị okwu site n'ịjụjụ.
Debanye aha: SparkAudio · Ikikere: CC BY-NC-SA 4.0
Jiri ya
GPT-SoVITS Standard
Few-shot ụda na-ebuli TTS nke na-ebuli ụda ọbụla site na sekọnd 5 nke ụda.
Debanye aha: RVC-Boss · Ikikere: MIT
Jiri ya
Orpheus Standard
Human-level emotional TTS model trained on 100K hours of speech data.
Debanye aha: Canopy Labs · Ikikere: Llama 3.2 Community
Jiri ya
Qwen3 TTS Standard
Alibaba's multilingual TTS na ụda cloning, preset ụda, na ụda nhazi site na ngwe.
Debanye aha: Alibaba (Qwen) · Ikikere: Apache 2.0
Jiri ya
CosyVoice 2
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Asụsụ: en, zh, ja, ko, fr, de, it, es
Kpọnye ụda
IndexTTS-2
Zero-shot TTS na-ejikwa mmetụta uche nke dị mma nakwa n'ụzọ dị elu.
Asụsụ: en, zh
Kpọnye ụda
Spark TTS
Klọnaịsị ụda TTS n'ụdị ụda na-achịkwa ya nakwa n'ụdị okwu site n'ịjụjụ.
Asụsụ: en, zh
Kpọnye ụda
GPT-SoVITS
Few-shot ụda na-ebuli TTS nke na-ebuli ụda ọbụla site na sekọnd 5 nke ụda.
Asụsụ: en, zh, ja, ko
Kpọnye ụda
Chatterbox
State-of-the-art zero-shot ụda ịkọsa na nchịkwa mmetụta site na Resemble AI.
Asụsụ: en
Kpọnye ụda
Tortoise TTS
Multi-voice text-to-speech na-atụle na mma na-eji autoregressive architecture.
Asụsụ: en
Kpọnye ụda
OpenVoice
Klọnaịsị ụda n'oge na-adịghị anya na nlekọta nkịtị n'elu ụdị, mmetụta, nakwa ụda.
Asụsụ: en, zh, ja, ko, fr, de, es, it
Kpọnye ụda
Qwen3 TTS
Alibaba's multilingual TTS na ụda cloning, preset ụda, na ụda nhazi site na ngwe.
Asụsụ: en, zh, ja, ko, de, fr, ru, pt, es, it
Kpọnye ụdaDeveloper-First API
OpenAI-compatible REST API. One endpoint, 22+ models. Streaming support for real-time applications.
- OpenAI-compatible format
- TTS na-edebata maka usoroiheomume oge n'eziokwu
- Nhazi batch maka ọrụ ndị dị ukwuu
- Ndesịta ozi ndị ahụ
pip install ttsai
npm install @ttsainpm/ttsai
from tts_ai import TTSClient
client = TTSClient(api_key="sk-tts-xxx")
audio = client.generate(
text="Hello from TTS.ai!",
model="kokoro",
voice="af_bella",
)
client.save(audio, "output.mp3")