Huru AI Text to Speech

22+ Waigaji wa wazi wa misaada, sauti 100+, 32+ Hakuna simulizi lililohitajiwa.

0/500 wahusika Huru
Hakuna kadi ya mkopo Sifa 50 bila malipo 32+ lugha Matumizi ya Biashara Sawa
0:00 / 0:00
Download Audio Kiungo kinakufa mnamo 24
TTS.ai? Waeleze rafiki zako!

22+ Picha za Sauti

Ukurasa wa makini zaidi wa wasifu wa TTS ulio wazi katika jukwaa moja

KokoroKokoro Free

Kokoro is an 82 million parameter text-to-speech model that punches well above its weight class. Despite its tiny size, it produces remarkably natural and expressive speech. Kokoro supports multiple languages including English, Japanese, Chinese, and Korean with a variety of expressive voices. It runs incredibly fast — generating audio nearly 100x faster than real-time on a GPU.

Faida kwa: High-quality TTS with minimal latency, streaming applications

Jaribu Kuwa Huru

PiperPiper Free

Piper is a lightweight text-to-speech engine developed by Rhasspy that uses VITS and larynx architectures. It runs entirely on CPU, making it ideal for edge devices, home automation, and applications requiring offline TTS. With over 100 voices across 30+ languages, Piper delivers natural-sounding speech at real-time speeds even on a Raspberry Pi 4.

Faida kwa: Quick previews, accessibility, and embedded applications

Jaribu Kuwa Huru

VITSVITS Free

VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is a parallel end-to-end TTS method that generates more natural sounding audio than current two-stage models. It adopts variational inference augmented with normalizing flows and an adversarial training process, achieving a significant improvement in naturalness.

Faida kwa: General-purpose text-to-speech with natural prosody

Jaribu Kuwa Huru

MeloTTSMeloTTS Free

MeloTTS by MyShell.ai is a multilingual TTS library supporting English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. It is extremely fast, processing text at near real-time speed on CPU alone. MeloTTS is designed for production use and supports both CPU and GPU inference.

Faida kwa: Matumizi ya Utayarishaji Wenye Kuhitaji TTS

Jaribu Kuwa Huru

BarkBark Standard

Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.

Develer: Suno · License: MIT

Jaribu kufanya hivyo

Bark SmallBark Small Standard

Lighter version of Bark with faster inference and lower memory usage.

Develer: Suno · License: MIT

Jaribu kufanya hivyo

CosyVoice 2CosyVoice 2 Standard

Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.

Develer: Alibaba (Tongyi Lab) · License: Apache 2.0

Jaribu kufanya hivyo

Dia TTSDia TTS Standard

Muundo wa viyombe vya kinenani unaotokeza mazungumzo ya kiasili kati ya wasemaji.

Develer: Nari Labs · License: Apache 2.0

Jaribu kufanya hivyo

Parler TTSParler TTS Standard

Describe the voice you want in natural language and Parler generates matching speech.

Develer: Hugging Face · License: Apache 2.0

Jaribu kufanya hivyo

IndexTTS-2IndexTTS-2 Standard

Zero-shot TTS with fine-grained emotion control and high expressiveness.

Develer: Index Team · License: Apache 2.0

Jaribu kufanya hivyo

Spark TTSSpark TTS Standard

Voice cloning TTS with controllable emotion and speaking style via prompts.

Develer: SparkAudio · License: Apache 2.0

Jaribu kufanya hivyo

GPT-SoVITSGPT-SoVITS Standard

Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.

Develer: RVC-Boss · License: MIT

Jaribu kufanya hivyo

OrpheusOrpheus Standard

Human-level emotional TTS model trained on 100K hours of speech data.

Develer: Canopy Labs · License: Llama 3.2 Community

Jaribu kufanya hivyo

Qwen3 TTSQwen3 TTS Standard

Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.

Develer: Alibaba (Qwen) · License: Apache 2.0

Jaribu kufanya hivyo

ChatterboxChatterbox Premium

Sauti ya Taifa-of-the-art sufuri - imetokana na udhibiti wa hisia - moyo kutoka Resemble AI.

Ubora:

Jaribu kufanya hivyo

Tortoise TTSTortoise TTS Premium

Maandishi ya kigeni-to-speech yalikazia ubora wa muundo wa mtu binafsi.

Ubora:

Jaribu kufanya hivyo

StyleTTS 2StyleTTS 2 Premium

Human-level text-to-speech through style diffusion and adversarial training.

Ubora:

Jaribu kufanya hivyo

OpenVoiceOpenVoice Premium

Sauti nzito sana huibuka kwa kutumia mawimbi ya sauti juu ya mtindo, hisia, na matamshi.

Ubora:

Jaribu kufanya hivyo

CosyVoice 2CosyVoice 2

Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.

Lugha: en, zh, ja, ko, fr, de, it, es

Sauti ya Clone

IndexTTS-2IndexTTS-2

Zero-shot TTS with fine-grained emotion control and high expressiveness.

Lugha: en, zh

Sauti ya Clone

Spark TTSSpark TTS

Voice cloning TTS with controllable emotion and speaking style via prompts.

Lugha: en, zh

Sauti ya Clone

GPT-SoVITSGPT-SoVITS

Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.

Lugha: en, zh, ja, ko

Sauti ya Clone

ChatterboxChatterbox

Sauti ya Taifa-of-the-art sufuri - imetokana na udhibiti wa hisia - moyo kutoka Resemble AI.

Lugha: en

Sauti ya Clone

Tortoise TTSTortoise TTS

Maandishi ya kigeni-to-speech yalikazia ubora wa muundo wa mtu binafsi.

Lugha: en

Sauti ya Clone

OpenVoiceOpenVoice

Sauti nzito sana huibuka kwa kutumia mawimbi ya sauti juu ya mtindo, hisia, na matamshi.

Lugha: en, zh, ja, ko, fr, de, es, it

Sauti ya Clone

Qwen3 TTSQwen3 TTS

Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.

Lugha: en, zh, ja, ko, de, fr, ru, pt, es, it

Sauti ya Clone

Mzazi wa Kwanza API

Picha ya mwisho, mifano 22+. Inaunga mkono matumizi halisi ya wakati.

  • Muundo wa wazi kabisa
  • Matukio Yanayovutia kwa ajili ya programu za wakati halisi
  • Kutayarisha Back kwa ajili ya kazi kubwa
  • Vituo vya Internet vinavyoonyesha ndoa kati ya ndoa na mtu mwingine
Mwono API Docs
Python
import requests

response = requests.post(
    "https://api.tts.ai/v1/tts/",
    headers={"Authorization": "Bearer sk-tts-xxx"},
    json={
        "model": "kokoro",
        "text": "Hello from TTS.ai!",
        "voice": "af_bella",
    }
)

with open("output.mp3", "wb") as f:
    f.write(response.content)

Njia Rahisi na Inayobadilika

Anzisha mizani unapokua.

Huru

$0

sifa 50

  • Kokoro, Piper, VITS, MeloTTS
  • Mpaka 500 wa herufi
  • 3 gen/hour (hakuna hesabu)
Fanyeni Ishara kwa Hiari

keyboard label

$9/mo

Namba 500 za mikopo/miezi

  • Waigaji wote 22+
  • Mipaka ya tabia 5,000
  • Sauti Yaungana
Anza
Wanapendwa Sana

Project

$29/mo

2,000 Sh. Sh.

  • Kila Kitu Kinaanza
  • Njia ya kuingia
  • Matayarisho ya Kabla ya Ndoa
Fanya Maendeleo

↓ ↓

$99/mo

10,000 sifa/miezi

  • Kila Kitu cha Kutoa
  • Bulk API
  • Sehemu ya mbele ya foleni
Mauzo ya Mawasiliano

View all plans including credit packs →

Maswali Ambayo Watu Huuliza Mara Nyingi

TTS.ai ndio jukwaa la sauti la AI, linalotoa violezo 22-to-speech, uundaji wa sauti, uandishi wa sauti, na vyombo vya sauti.

Ndiyo, TTS.ai inatoa ujumbe huru na Kokoro, Piper, VITS, na MeloTS. Hakuna anayetakiwa.

Kwa mwendo wa kasi, tumia Kokoro au Piper. Kwa ubora, jaribuni CosyVoice 2 au StyTTS 2. Ili ufanyizaji wa sauti, tumia alama Chatterbox au GPT-SHITS.

Ndiyo. Hebu openAI-kisidentity RES API for TTS, STST, uundaji wa sauti, na vyombo vya sauti. Inapatikana kwenye Propo (139/mo) na mipango (dola 99/mo). Tas ploment in tts.ai/api/.

Sifa ya sauti hutofautiana kwa muundo wa kimitindo kama CosyVoice 2, na Chatterboksi hutokeza karibu hotuba ya ubora wa binadamu yenye asili ya taifa na hisia. Maumbo huru kama Kokoro hutoa ubora bora kabisa kwa ajili ya visa vingi.

Jarida la Kiingereza (TTS.ai) linaunga mkono lugha 30+ katika maktaba yake ya mfano.

Sisi hatuweki habari zako kwenye kompyuta baada ya kuzitoa. Tunatumia sauti zilizopakiwa kwa ajili ya kipindi cha sasa na hatujazihifadhi.

Yes. All audio generated on TTS.ai is yours to use commercially, including for YouTube videos, podcasts, audiobooks, apps, advertisements, and products. Our models are open source under permissive licenses (MIT, Apache 2.0). No royalties or attribution required.

TTS.ai inaamsha sauti kwenye tovuti ya WAV kwa kiwango cha juu kabisa. Unaweza kubadilisha kuwa MP3, FAC, OGG, au M4A kwa kutumia chombo chetu cha bure cha Audio Transformer. API inaunga mkono kuonyesha wazi muundo wako unaopendelewa wa kitokezwaji moja kwa moja katika ombi hilo.

Upload a short audio sample (as little as 5 seconds) of the voice you want to clone, then type any text to generate speech in that voice. Models like Chatterbox, GPT-SoVITS, and CosyVoice 2 support voice cloning. The cloned voice captures tone, accent, and speaking style.

Waigaji huru (Kokoro, Piper, VITS, MeloTTS) hawahitaji akaunti na gharama. Mifano ya kawaida (watu wenye sifa/1K) ikiwa ni pamoja na Bark, CosyVoice 2, F5-TTSS, na Dia. Premium violezo (na sifa/1K) ikiwa ni pamoja na kipenVoice, Chatterbox, SCT 2, na Tortoise. Kwa ujumla, wanamitindo bora, sauti zaidi, na sauti zaidi kama sauti.

Ndiyo. API inaunga mkono hatua za kubadili maandishi mengi ya kusema. Ruhusu maombi mengi na kupata matokeo kwa kutumia kazi ya UUIDs. Mipango ya kuingilia (dola 99/mo) inajumuisha nafasi za kwanza za kazi kwa ajili ya utengenezaji wa haraka zaidi. Mafaa kwa ajili ya utokezaji wa vitabu vya sauti, masomo, na miradi mikubwa ya sauti.
5.0/5 (1)

Anza Kutumia Sauti ya Mimi Leo

Jiunge na Wakuzaji, wajenzi, na biashara kwa kutumia TTS.ai