Ukuthetha-thethana

Tshintsha isandi esithethayo — tshintsha ilizwi, umnqweno, ulwimi, nesitayile ngelixa ugcina okuqulethwe kuqala.

Asikho nasiphi na isandi se-TTS kwisiNgesi sakho. Nceda uncedo lwethu ukongeza isandi sakho! Intengiso yelizwi lakho

Imvelaphi yesandi

Rhweba ngaphandle amanqaku encwadi ye Mozilla Khangela

Upload your speech recording. MP3, WAV, FLAC, OGG. Max 50MB.

file.mp3

0 MB
— okanye urekhode ilizwi lakho —
00:00

Izicwangciso Zoguqulelo

Rhweba ngaphandle amanqaku encwadi ye Mozilla Khangela

Upload a reference of the target voice. 10-30 sec recommended.

file.mp3

0 MB

I-Result

Layisha phezulu umyalezo wesandi, khetha uguqulelo lwakho, kwaye unqakraze uguqulelo ukuqalisa

Uguqulelo lokuthetha... Oku kungathatha ixesha.

& Ifayile elandelayo

Iguqulwe

Indlela esebenza ngayo

1. Layisha phezulu Ukuthetha

Rekoda okanye ulayishe isandi ofuna ukusiguqula

2. Khetha Uguqulelo

Khetha utshintsho lwesandi, utshintshiselwano lohlobo, okanye uguqulelo lwelwimi

3. Uguqulelo lwe-AI

I-AI iqhubekekisa umsindo ukusuka ekupheleni ukuya ekupheleni igcina imixholo yokuthetha

Layishela phantsi egronjiweyo

Lindela isiphumo uze ulayishe ezantsi isandi sakho esiguqulweyo

Iimeko Zokusetyenziswa

Ukuthetha-ukuthetha kwizinto eziquletheyo, ukufikelela, kunye neeprojekthi ezinobuchule

Ukuphinda uphinde uphinde

I-Dub ividiyo kwezinye iilwimi ngelixa igcina iimpawu zokuqala zesandi somthumeli.

Ulungelelaniso lweempawu

Tshintsha into ebonisa ububele bemiboniso — yenza ukuba ukuthetha okuzolileyo kukhuthaze, okanye ukuthetha okungenakukhathazeka kube mhle nothandekayo.

Ukwenziwa kwesandi

Guqula ushicilelo lwesandi esingwevu sibe ziingoma ezigqityiweyo ezinesandi kunye nesitayile esihlukileyo.

I-Voice Anonymization

Ukwenza ukuba umthunywa akwazi ukuchazwa ngelixa egcina igama ngalinye, ukukhusela ukungena kweengxelo okanye ukhuseleko lwemfihlo.

Iimodeli zeSpeech to Speech

OpenVoice

Uguqulelo lwesandi olukhawulezayo nolawulo lwesitayile esithe nkqo. Tshintsha uphawu lwesandi, isantya, kunye neemotions kwimizuzwana.

  • Inkqubo ekhawulezayo
  • Utshintshiselwano Lohlobo
  • Cross-language

Chatterbox

Uklonelo lwesandi se-zero-shot ngolawulo lweemvakalelo ze-fine-grained ukusuka kwi-Resemble AI.

  • Ulawulo lweemvakalelo
  • I-Zero-shot cloning
  • Ubuxoki obuphezulu

CosyVoice 2

I-cross-language voice cloning kwiilwimi ezisibhozo nge-prosody eqhelekileyo ne-streaming support.

  • 8 Iilwimi
  • Uklonelo lwesandi
  • Unikezelo lwesandi

Imibuzo ebuzwa rhoqo

Ukuthetha-ukuthetha (STS) AI iguqula ushicilelo lwesandi oluthethayo lube yimveliso yokuthetha eyahlukileyo - itshintsha ulizwi, uhlobo, umnqweno, okanye ulwimi ngelixa igcina amagama aphambili kunye nexesha. Idibanisa ukuqonda kokuthetha, uqhubekeko, kunye nokwenziwa kwezinto eziphilayo kwindlela enye yokuhambisa.

Umbhalo uguqula umbhalo obhaliweyo ube yisandi. Ukuthetha- thethana luthatha isandi esisisiseko njengengeniso kwaye luguqula ngokuthe ngqo ibe yisandi entsha - ugcina umculo oqhelekileyo, izithuba, uxinzelelo, kunye nengqondo yoshicilelo olungagqibekanga kunokuba uvelise ulwimi ukusuka kumbhalo ocacileyo.

Iinkqubo eziqhelekileyo ziquka ukudubula iividiyo kwezinye iilwimi, ukuguqula umsindo womntu othethayo kwirekhodi, ukuhlela umnqweno okanye umsindo wesandi esisekhoyo, ukwenza iingoma eziphezulu zesandi kwirekhodi elingenanto, nokwenza ukuba irekhodi lesandi lingaziwayo ngelixa ligcina imixholo.

Voice conversion models like OpenVoice and RVC handle voice-to-voice transformation. For cross-lingual speech to speech, CosyVoice 2 and GPT-SoVITS can clone and re-synthesize in a different language. Chatterbox also supports reference-audio-based synthesis.

Ewe. Usebenzisa iimodyuli zokudubula ilizwi, ungaguqula ukuthetha kwakho kwilwimi elahlukileyo ngelixa ugcina iimpawu zakho zelizwi. I-AI ikhupha uphawu lwelizwi lakho kwaye iphinde iphinde iguqule isandi kwilwimi elibekiweyo okanye kwindlela.

Inkqubo yendlela yokuhambisa iqala ngokushicilela ulwimi lwakho, iguqulela umbhalo kwilwimi elibekiweyo, emva koko isebenzisa ukuclonelwa kwelizwi ukuze liguqulwe kwilizwi lakho eliqhelekileyo. Iimodeli ezifana ne CosyVoice 2 zixhasa ulwimi olusibhozo lokuguqulwa kwelizwi.

Ukufumana iziphumo ezilungileyo, ulayishe umculo ococekileyo onesandi esingenanto. I-WAV okanye i-FLAC kwi-16kHz okanye ngaphezulu isebenza kakuhle. I-MP3, i-OGG, i-M4A, kunye ne-WEBM zivunyelwe. Ukuthetha okucacileyo kuvelisa utshintsho oluchanekileyo kakhulu.

Uqhubekeko lwexesha elifutshane lifumaneka nge-API yethu usebenzisa iimodeli ezikhawulezayo ezinjengeKokoro yokwenziwa kunye neFaster Whisper yokwahlula. Ukuphuma kwexesha kuxhomekeke kwimodeli kunye nobude besandi, kodwa i-sub-3-second turnarounds ifumanekayo kwiintlanganiso ezifutshane.

Ewe. Iimodeli ezinjenge Chatterbox, Spark TTS, ne IndexTTS- 2 zixhasa uvakalelo kunye nolawulo lwesitayile. Ungaguqula ukuthetha okuphumlayo kube luthando, kube lubuhlungu, okanye kube lumnandi ngelixa ugcina amagama afanayo nobuqu bomntu othethayo.

Ukuthetha-ukuthetha kudibanisa ukuqonda kunye nokwenza uphawu. Uguqulelo oluqhelekileyo lwemizuzu emi-1 lusebenzisa uphawu olu-3,000-8,000 luxhomekeke kwimodeli ekhethiweyo. Imodeli ye-free-tier njenge Kokoro ingasetyenziswa kwinyathelo lokwenza uphawu ngexabiso elipheleleyo.

Abasebenzisi abakhululekileyo bangaqhubekekisa isandi ukuya kuthi ga kwimizuzu emi-1. Iinkqubo ezihlawulwayo zixhasa iifayile ukuya kuthi ga kwimizuzu emi-10. Ukukhupha okude, hlula isandi zibe ziinxalenye okanye sebenzisa i-API yethu yoqhubekeko lweqela ngaphandle komda wobude.

Ewe, zonke iiseva ze-GPU ezikhuselekileyo zisebenzisa iiseva zethu ze-GPU ezikhuselekileyo kwaye zicinywa ngokuzenzekelayo kwiintsuku ezi-24. Asiyi kusebenzisa iiseva zakho zokuqeqesha iimodyuli. Zonke izithuthi zisebenzisa uxhulumaniso olufihliweyo kwaye unxibelelwano lweseva-kwiseva luqinisekisiwe.
5.0/5 (1)

Yintoni esinokuyilungisa? Ulwazi lwakho olufunyenweyo lunceda silungise iingxaki.

Tshintsha nasiphi na isivakalisi nge-AI

Tshintsha ilizwi, iimvakalelo, ulwimi, kunye nesitayile. Bhalisa simahla kwaye ufumane amakhadi angama-50 ukuqalisa.