Dia TTS

Speaker 2

Emiselweyo IsiNgesi Neutral Dia TTS

{igama} yi neutral AI yelizwi elinamandla eDia TTS umbhalo-kwi-speech model. Eli standard-level ilizwi lithetha IsiNgesi kwaye linika studio-quality speech synthesis. Nge phezulu unikezelo lwesantya kunye nomgangatho womgangatho we 5/5, Speaker 2 ulungele podcasts, audiobook dialogues, conversational content. I-Dia TTS injini iphuhliswe ngu Nari Labs under the Apache 2.0 license, iyenza ikhuseleke kwimisebenzi yentengiso. Iinkqubo eziphambili ziquka: {iimpawu}.

Akukho manqaku

Dia TTSUlwazi lwemodeli

Imodeli Dia TTS
Umbhekisi phambili Nari Labs
Umgangatho
Isantya I-Media
Ilayisensi Apache 2.0
Ukuklona Ayifumaneki
I-Tier Eqhelekileyo (2x uphawu)
Iiparamitha 1.6B
Uyilo lwezindlu Transformer Autoregressive + DAC
Iminyaka 2024

Iinkqubo ezilungileyo zokusetyenziswa Speaker 2

Iinkqubo ezicetyiswayo ezisekelwe kwiimpawu zalo msindo

Iincwadi ezinesandi & Uxwebhu

Sebenzisa i {igama} ukuchaza imixholo yefom ende nge-prosody eqhelekileyo ne-expression.

Ividiyo

Yongeza ukuthetha okuzimeleyo kwiividiyo zeYouTube, iintengiso, kunye nemixholo yemidiya yoluntu.

Ipodcasts & Ukusasazwa

Imveliso elungileyo yestudio elungele iipodcasts, umculo, kunye nokusasazwa okuzimeleyo.

Ukufunda nge-e-mail & Uqeqesho

Yenza izinto zokuqeqesha ezibandakanyayo, izifundo, kunye nezinto eziqulethe ulwazi ngemiboniso ecacileyo ye-AI.

I-More Dia TTS IiNkokheli

Ezinye iingoma zemodeli efanayo ye-TTS

Speaker 1

IsiNgesi Neutral

Imibuzo ebuzwa rhoqo

I-Dia yi-Nari Labs yi 1. 6B parameter yombhalo- ukuya- ku- ulwimi lwemodeli eyenziwe ngokukodwa ukudala unxibelelwano lomthumeli- omkhulu. Iyakwazi ukudala unxibelelwano olunombala phakathi kwamathumeli amabini ngokuthatha umjikelo ofanelekileyo, i-prosody, kunye nokubonisa iimvakalelo. I-Dia igqibelele ukudala imixholo yohlobo lwepodcast, unxibelelwano lweencwadi zesandi, kunye ne-AI ethetha- thetha.

I-Dia TTS yaphuhliswa yi-Nari Labs kwaye ikhutshwe phantsi kwelayisensi ye-Apache 2. 0, evumela ukusetyenziswa kwentengiso kwesandi esiveliswe.

I-Dia TTS ixhasa ulwimi 1: isiNgesi.

I-Dia TTS ikwinqanaba eliqhelekileyo — ii-credits ezi-2 ngabasebenzi aba-1,000. Ungajonga ngaphambili nayiphi na i-Dia TTS yelizwi ngaphandle kokuvelisa isandi esipheleleyo.

I-Dia TTS inesantya sokwakha esiphakathi. Ukwakha kuthatha imizuzwana emininzi ngokuxhomekeke kubude bombhalo.

I-Dia TTS ifunyenwe 5/5 kwi-audio quality kwi-TTS.ai. Inikezela nge-studio-grade, ukuthetha okunjalo nomuntu.

Hayi, iDia TTS isebenzisa iseti emiselweyo yelizwi elifakwe ngaphakathi. Ukwenza ilizwi lifana, zama iimodeli ezifana neCosyVoice 2, GPT-SoVITS, okanye i-Chatterbox.

Ewe, i Dia TTS icetyiswa ngokukodwa kwipodcasts, iincoko zencwadi yesandi, imixholo yencoko. Umthumeli wayo oninzi, ukwenziwa kwencoko, ukuthatha ukhetho oluqhelekileyo olugqibeleleyo kule meko yokusetyenziswa.

Ewe, iDia TTS ilayisensiwe phantsi kwe Apache 2. 0, evumela ukusetyenziswa korhwebo. Isandi esiveliswe ngee-Dia TTS zesandi zingasetyenziswa kwiividiyo, iipodcasts, iinkqubo, imidlalo, nakweyiphi na enye iprojekthi yorhwebo.

Ewe, zonke iingoma kwi-TTS.ai zisebenzisa iimodyuli ze-open-source ezilayisensiweyo ngentengiso (MIT, Apache 2.0). Isandi esiveliswe yiyo yakho ukuyisebenzisa kwividiyo, iipodcasts, iiapps, imidlalo, nakweyiphi na enye inkqubo yentengiso.

Thumela isicelo se POST ku /api/v1/tts/ ngegama lemodeli ne ID yesandi. Bona iphepha lethu le-API Documentation ngemizekelo yekhowudi kwi-Python, JavaScript, Go, kunye ne-cURL.

Ewe, nqakraza iqhosha lokudlala kweli phepha ukuva isampuli. Ungabhala umbhalo oqhelekileyo kwiphepha lombhalo ukuya kukuthetha kwaye wenze ukujonga kuqala simahla ngelizwi elithile.

Zama Speaker 2 Ngoku

Bhala nawuphi na umbhalo uze uyiva ithetha ngu Speaker 2. Ifumaneka simahla.