Report Bug / Feature Request

Whakakōrero reo-wā-tūturu — Whakakōrero i tētahi reo i roto i ngā takirua

Ko te tāruatanga o tētahi reo me ngā takirua 5 anake o te oro whakahua. 9 ngā tauira tārua reo pūtake-whenua tae atu ki te Chatterbox, CosyVoice 2, GPT-SoVITS, me OpenVoice. Kāore he tārua-kōrero me te kore whakaakoranga e hiahiatia ana — tuku i tētahi tauira me te whakaputa kōrero i te wā kotahi. Ko ngā tauira katoa he whakaaetanga hokohoko.

Wā-tūturu 5-Tuarua ngā tauira 9 ngā tauira tārua Ka tūwhera te pūtake 17+ reo Ka whakahaua te whakahaere āhuahira

Āhuatanga whakairo reo wā tūturu

Kātahi anō ka tārua ngā reo me te AI ā-mohoao — kāore he whakaakoranga, kāore he huinga raraunga, kāore he tūmanako.

Kākau-kore

Kāore he whakaakoranga, kāore he whakahauhautanga, kāore he kohinga raraunga. Whakataki i ngā waeine 5 o te oro, ā, ka whiwhi reo tārua i te wā kotahi. Ka tangohia e te AI ngā āhuatanga kaikōrero i te wā tūturu.

9 ngā tauira tārua

Ka kōwhiria mai i te Chatterbox, CosyVoice 2, GPT-SoVITS, OpenVoice, Spark, IndexTTS-2, GLM-TTS, Qwen3-TTS, me te Tortoise. He rerekē ngā kaha o ia tauira mō te āhuatanga, te tere, me te reo.

Kāhui reo whakawhitiwhiti

Ka tārua tētahi reo i te reo Ingarihi, ā, ka whakaputaina he kōrero i te reo Hainamana, Hapanihi, Koreana, me ētahi atu. Ko CosyVoice 2 me Qwen3-TTS e tiaki ana i te tuakiri reo i roto i ngā reo 17+

Ka whakahaua te whakahaere āhuahira

Ka tautokona e te Chatterbox, OpenVoice, me te GLM-TTS te whakanaotanga ā-āhuatanga. Ka whakaputaina te kupu ōrite me ngā āhuatanga rerekē — māharahara, māharahara, whakahē, whakamātautau — i te pupuritanga o te reo tārua.

Māmā te pūtake me te hokohoko

He pūtake tūwhera ia tauira tārua i raro i ngā whakaaetanga MIT, Apache rānei 2.0. Ka whakamahia ngā reo tārua mō ngā ihirangi, ngā hua, me ngā taupānga kāore i te utu.

Ka whakakōrerotia te API

REST API mō te tārua reo papatono. Whakataki i te oro tohutoro, whakapūtā i te kupu, me te whiwhi kōrero tārua. SDKs mō Python me JavaScript. Kākau tārua mō ngā rerenga mahi nui.

Kāhua Kōrero Whakarohe

9 ngā tauira pūtake-mātau mō ia take whakamahi tārua

ChatterboxChatterbox

Premium

State-of-the-art zero-shot voice cloning with emotion control from Resemble AI.

Medium 5/5 Ko te tāruatanga reo

Ko te tino pai mo: Ko te āhuatanga tino pai rawa - ngā tauira 5-auau, te mana ā-āhuatanga, te whakaaetanga a MIT

Whakamātautau Chatterbox

CosyVoice 2CosyVoice 2

Standard

Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.

Medium 5/5 Ko te tāruatanga reo

Ko te tino pai mo: Ko te tārua reo maha pai rawa - e pupuri ana i te reo puta noa i te Chinese, English, Japanese, Korean

Whakamātautau CosyVoice 2

OpenVoiceOpenVoice

Premium

Instant voice cloning with granular control over style, emotion, and accent.

Medium 4/5 Ko te tāruatanga reo

Ko te tino pai mo: Āhua tere te tahuritanga tae me te whakawhitinga āhua me te āhua

Whakamātautau OpenVoice

Spark TTSSpark TTS

Standard

Voice cloning TTS with controllable emotion and speaking style via prompts.

Medium 4/5 Ko te tāruatanga reo

Ko te tino pai mo: Te tauira tārua tere rawa - ngā hua i roto i ~12 sekona

Whakamātautau Spark TTS

IndexTTS-2IndexTTS-2

Standard

Zero-shot TTS with fine-grained emotion control and high expressiveness.

Medium 4/5 Ko te tāruatanga reo

Ko te tino pai mo: He tino pai te tārua Chinese-English me te ōritetanga kōrero tiketike

Whakamātautau IndexTTS-2

Tortoise TTSTortoise TTS

Premium

Multi-voice text-to-speech focused on quality with autoregressive architecture.

Slow 5/5 Ko te tāruatanga reo

Ko te tino pai mo: Ko ngā hua o te āhua o te whare taupuni — te pai rawa mo ngā pukapuka oro me ngā kōrero pūmau.

Whakamātautau Tortoise TTS

He pēhea te mahi o te whakairo reo wā tūturu

Mai i tētahi tauira oro poto ki te kōrero tārua kāore i te whakawhāititia

1

Whakapupuri i te oro tohutoro

Whakataki, whakaata rānei i ngā 5-30 sekone o te kōrero mārama mai i te reo e hiahiatia ana e koe te tārua. WAV, MP3, whakataki hāngai rānei i roto i tō mātou kiritaki.

2

Hiku tētahi tauira tārua

Ka kōwhiria te tauira e ōrite ana ki ōna hiahia — Chatterbox mō te āhuatanga, Spark mō te tere, CosyVoice 2 mō ngā reo maha.

3

Kei roto i tō tou kupu

Type, paste rānei te kupu e hiahiatia ana e koe kia kōrerotia i roto i te reo tārua. He mahi ētahi reo e tautokona ana e te tauira.

4

Whakana & Tahuri

Ka pā ki te waihanga, me te mātau i tōna reo tārua i roto i ngā waeine 10-25. Ka whakataki hei WAV, MP3 rānei mō te whakamahi ā-waha.

He pēhea te mahi o te tārua reo Zero-Shot

Kāore he whakahauhautanga, kāore he kohinga tapeke raraunga - neke atu i te whakaata me te tārua

Ka whakawātea te kaikōrero i te whakawāteatanga

Ka tātari te AI i tō tātou oro tohutoro hei tango i tētahi whakahua kōrero — he whakaaturanga pāngarau whāiti o te reo.

  • He iti iho te mahi ki te 5 waeine o te oro
  • Ka tangohia te āhua o te āhua, te āhua o te āhua, me te āhua kōrero
  • Kāore he whakaakoranga, he whakahau-whakahaere rānei e hiahiatia ana
  • Kāore te oro i te rokiroki tūturu

Whakahaua kōrero ā-whakahaere

Ka whakaputaina e te tauira TTS he kōrero hou e whakawhāititia ana e te whakatūnga o te kaikōrero. He rite tonu te āhua o te hua ki te kaikōrero tohutoro e kī ana i tōna kupu — me te pūāhua māori, te whakahua tika, me te reo taketake.

  • Whakana te kōrero kore whakahauhau mai i tētahi tauira kotahi
  • Cross-language cloning (whakawhiti i ngā reo kāore i whakapā atu ki te tohutoro)
  • Ka whakawhitia te āhua me te āhua
  • Ko nga hua i roto i te 10-25 sekona

He whakatairite tauira tārua reo

Hiko te tauira tika mō tō tātou take whakamahi tārua

Kāhua Tautuhi iti Āhuatanga Kākāriki reo Emotion Ka taea te whakawātea
Chatterbox 5s ~21s Pai rawa EN MIT
CosyVoice 2 5s ~20s Whakahauhau CN, EN, JP, KO+ Apache 2.0
GPT-SoVITS 5s ~16s Whakahauhau CN, EN, JP, KO MIT
OpenVoice 5s ~15s Pai EN, CN, ES, FR+ MIT
Spark TTS 5s ~12s Pai CN, EN Apache 2.0
IndexTTS-2 5s ~18s Whakahauhau CN, EN Apache 2.0
GLM-TTS 5s ~25s Whakahauhau CN, EN Apache 2.0
Qwen3-TTS 5s ~16s Whakahauhau CN, EN, JP, KO+ Apache 2.0
Tortoise 15s ~60s Whare whetū EN Apache 2.0

He aha te whakamahinga o te tangata o te whakakōrero reo wā tūturu mō

Mai i te waihanga ihirangi ki te āheitanga — he taupānga kore e taea e te tārua reo.

Te kōrerotanga pukapuka oro

Ka tārua nga kaituhi i a rātau ake reo, ka waihanga i ngā pukapuka oro katoa me te kore e pau i ngā wā i roto i tētahi kaitiaki whakaata. Ka whakarerekētia ngā hapa mā te whakatupu anō i ngā rerenga kotahi i te wā e whakaata ana.

Whakapāpāhotanga vitio

Ka whakarerekētia ngā pouaka whakaata ki ētahi atu reo i te pupuri i te kaikōrero taketake

Hanganga ihirangi

Ko ngā YouTubers, ngā podcasters, me ngā kaitapere TikTok e tārua ana i to rātau reo mō te tohu ā-ringa. Ka whakaputa anō i ngā kōrero mō ngā ihirangi hōu me te kore whakataki, ka waihanga rānei i ngā putanga reo kē o ngā ataata tīariari.

Kitenga

Ka taea e ngā tāngata kua ngaro tō rātou reo nā te mate, i te whakatūnga rānei te tiaki i a rātou mā te tārua i ngā pūkete tawhito. Ka āhei te reo tārua ki te whakawhiti kōrero i a rātou reo ake mā te kupu-ki-te-kōrero.

Whakawhanaketanga Kīriki

Ko ngā kaiwhakaari reo tārua me te whakaputa i ngā rerekētanga kōrero kore noa iho me te kore horahanga pūtautau. Pai mo ngā kēmu ā-iwi, ngā whakarerekētanga, me te tauira i reira kāore te whakahua anō i ia raina.

IVR & pūnaha tāpu

Kāhua i tōna kaikōrero kamupene

TTS.ai vs ētahi atu rongoā whakairo reo

He aha te 9 ngā tauira e whawhai ana i tētahi kaupapa pūtake-mārō kotahi

He āhuatanga TTS.ai SV2TTS ElevenLabs Resemble AI
Kāhua tārua 9 1 1 1
Minim. Huinga tohutoro 5 sec 5 sec 30 sec 3 min
E hiahiatia ana te whakaakoranga Kāore Kāore Kāore He
Whakahauhautanga oro (2025) Whakahauhautanga-whakahauhau I te rā Whakahauhau Whakahauhau
Ka whakahaua te whakahaere āhuahira
Kāhui reo whakawhitiwhiti
Ka tūwhera te pūtake
E hiahiatia ana te GPU Cloud He Cloud Cloud
Ka taea te API
Tau wātea 15 ngā pūtea Kāwanatanga-māhina Whāiti

Te API Cloning reo

Ko ngā reo pūnaha me tātau REST API

Python - Whakakōrero reo REST API
from tts_ai import TTSClient

client = TTSClient(api_key="sk-tts-...")

# Clone a voice from a 5-second sample
result = client.clone_voice(
    name="My Cloned Voice",
    file="reference.wav",       # 5-30 seconds of clear speech
    model="chatterbox",         # or cosyvoice2, openvoice, spark...
    text="Hello! This is my cloned voice speaking new text.",
)

# Download the cloned audio
audio = client.poll_result(result.uuid)
with open("cloned_output.wav", "wb") as f:
    f.write(audio)
cURL — Whakakōrero reo REST API
curl -X POST https://api.tts.ai/v1/voice-clone \
  -H "Authorization: Bearer sk-tts-YOUR_KEY" \
  -F "reference=@voice_sample.wav" \
  -F "text=This is my cloned voice." \
  -F "model=chatterbox"

Ko ngā tohu mō ngā hua whakairo reo pai rawa

Ki te whiwhi i te tārua reo tino tika me ēnei tohutohu whakataki

Haumarutanga taiao

Ka whakataki i roto i tētahi ruma mārama me te pōhēhētanga papamuri iti rawa. Ka tino tika ake te whakawātea i ngā āhuatanga reo mai i te oro mārō.

10-30 waeine

Ahakoa e mahi ana te 5 sekone, ka whai hua nui ake te 10–30 sekone. Ko te kōrero māori ake e mārama ana te AI, ko te tika ake o te tārua.

Whakawhitiwhiti tūturu

E kōrero māori ana, ehara i te whakamātautau. Kei roto i te whakamātautau me te whakateretanga maha. Ka tangohia e te AI tōna kāhua kōrero māori, tae atu ki ngā whakapeka me ngā whakahau.

Ko te kaikōrero kotahi

Ka whakamahia tētahi tauira me tētahi tangata anake e kōrero ana. He maha nga reo e whakawātea ana i te kaikōrero e whakawātea ana, e whakaputa ana i ngā hua whakakotahi.

Ka tīmata te tārua o ngā reo i tēnei rā

Whakataki i te 5 waeine o te oro, me te whakarongo i tōna reo tārua i raro i te 30 waeine. Whakawhiwhia ki te whakamātau.

Kāhua te reo Ka taea te whakataki i te papatono

E pā ana ngā pātai

Ko ngā pātai noa iho mo te tārua reo wā tūturu

Ko te tārua reo-wā-tūturu he hangarau AI ka taea e ia te tārua i te reo o tētahi tangata mai i tētahi tauira orooro poto — he iti iho i te 5 sekone — me te kore whakaakoranga, whakahau rānei. Ka whakarewaina e koe tētahi tauira, ā, ka whakaputaina e te AI he kōrero hou e rite ana ki taua tangata. E TTS.ai e whakarato ana i ngā tauira tārua reo rerekē e 9, ia me ngā kaha rerekē mō te āhuatanga, te tere, me te tautoko reo.

He iti noa iho te 5 waeine e mahi ana ki te nuinga o ngā tauira (Chatterbox, CosyVoice 2, Spark, GPT-SoVITS, OpenVoice). E hiahiatia ana e te Tortoise he 15+ waeine mō ngā hua pai rawa. Mō te āhuatanga pai rawa i runga i ngā tauira katoa, e 10-30 waeine o te oro pūrongo kotahi e whakarongotia ana. Me kore te oro i te māharahara o te papamuri me te pūoro.

He ture te hangarau tārua reo i a ia anō. Heoi anō, me tārua anake e koe ngā oro e whakaaetia ana e koe kia whakamahia ai — tōmu oro, ngā oro e whakaaetia ana e koe, ngā oro rānei i te rohe tūmatanui. Ko te whakamahinga o te tārua reo hei whakairo i tētahi tangata me te kore whakaaetanga, te whakawātea, te waihanga rānei i ngā ihirangi whakawātea, he ā-ture i te nuinga o ngā mana ā-ture. E hiahiatia ana e ngā whakaritenga o te TTS.ai kia whai mana koe ki tētahi oro e tārua ana.

E ai ki tōna take whakamahi. Ko te Chatterbox e whakaputa ana i ngā tārua Ingarihi o te hua tino pai me te mana ā-āhua. Ko te CosyVoice 2 te pai rawa mō te tārua reo maha (Chinese, English, Japanese, Korean). Ko te Spark te tere rawa i te ~12 sekone. Ko te Tortoise e whakaputa ana i ngā hua o te mātauranga, engari he pōturi ake. Ko te GPT-SoVITS e tino pai ana i te tārua reo Hainamana. Whakamātau i ngā tauira maha hei kimi i te pai rawa mō tōna reo.

He — e kiia ana tēnei he tārua reo reo. CosyVoice 2, Qwen3-TTS, me OpenVoice e tautoko ana i tēnei. Hei tauira, ka taea e koe te whakataki i tētahi tauira reo Ingarihi me te whakaputa kōrero i te reo Hainamana, Hapanihi, Koreana rānei i te tiaki i ngā āhuatanga reo o te kaikōrero. He rerekē te āhuatanga i runga i te tauira me te takirua reo.

Ko te kaupapa GitHub o CorentinJ / Real-Time-Voice-Cloning (60K+ whetū) e whakamahi ana i te SV2TTS, he hangahanga 2019. Ahakoa i te wā, ko ngā tauira hōu pēnei i te Chatterbox, CosyVoice 2, me te GPT-SoVITS e whakaputa ana i tētahi āhuatanga oro pai ake me te ōritetanga kaikōrero pai ake. TTS.ai e whakahaere ana i ngā tauira ā-moho 9 (vs. te SV2TTS) ā, kāore e hiahiatia ana he whakaritenga GPU — whakaata anake me te tārua.

He. TTS.ai e whakarato ana i tētahi API REST mō te tārua reo. Whakataki i te oro me te kupu tohutoro, kōwhiri i tētahi tauira, me te whiwhi kōrero tārua. Kei te wātea mā te Python SDK (`pip install ttsai`), JavaScript SDK (`npm install @ttsainpm/ttsai`), ngā tono HTTP hāngai rānei. E tautoko ana i te tārua rōpū mō te tukatuka i ngā kupu maha me te reo ōrite.

I muri i te tāruatanga, ka tiakina te reo ki tōna pūnaha, ka whakamahia anō ki ngā whakatupuranga kāore i te whakawātea me te kore e whakawātea i te oro tohutoro. Ka puta ngā oro i tiakina ki tōna pūranga reo i te pātū tārua reo, ā, ka taea te uru ki te API.

E tautokona ana ngā WAV, MP3, OGG, FLAC, me te WebM katoa. Ka taea hoki e koe te whakataki hāngai i roto i tōtou kiritaki mā te whakamahi i te pūkete pūkoro whakatū. Mō ngā hua pai rawa, ka whakamahia te āhua WAV kore ngaro i te 16kHz, tiketike ake rānei. Ka whakamātau tūturu te AI i te oro (whakatautau anō, te tātaritanga o te hau) ahakoa te āhua tāuru.

He rerekē te wā whakatūnga e ai ki te tauira: He tere rawa te Spark i te ~12 sekone, OpenVoice i te ~15 sekone, GPT-SoVITS i te ~16 sekone, CosyVoice 2 i te ~20 sekone, Chatterbox i te ~21 sekone, me te Tortoise i te ~60 sekone. Ko ēnei wā mō te kuputuhi ā-waha noa iho. He roa ake ngā kuputuhi.

He. Ko ngā tauira tārua katoa o te 9 i runga i te TTS.ai e whakamahi ana i ngā whakaaetanga pūtake tūwhera (MIT, Apache 2.0 rānei) e whakaae ana ki te whakamahi hokohoko. Ka taea e koe te whakamahi i te oro tārua i roto i ngā pouaka whakaata YouTube, podcasts, pukapuka oro, papatono, kēmu, pūnaha tāpu, me ētahi atu taupānga hokohoko — e whakaritea ana kia whai mana koe ki te reo pūtake.

He. He pūtake tūwhera ia tauira e whakahaere ana i a mātou, ā, ka wātea ki GitHub/HuggingFace. Ka taea e koe te whakawhiwhi i te Chatterbox, CosyVoice 2, GPT-SoVITS, OpenVoice, Spark, IndexTTS-2, GLM-TTS, Qwen3-TTS, Tortoise rānei i runga i tōmu pūnaha GPU. E hiahiatia ana e te nuinga o ngā tauira he NVIDIA GPU me te 4-24GB VRAM e ai ki te tauira. TTS.ai e whakahaere ana i ngā hanganga katoa kia kore ai e hiahiatia.
5.0/5 (1)

What could we improve? Your feedback helps us fix issues.

Kāhua tētahi reo i roto i ngā takitahi

9 ngā tauira tārua reo pūtake-whenua. 5 ngā tauira waeine. Kāore he whakaakoranga e hiahiatia ana. Whakamātautau i te wātea - tuku i tōna oro, me te whakarongo i te tārua i te wā kotahi.