Umenzi Wevidiyo We-AI Lip Sync
Layisha phezulu umfanekiso we-face kunye ne-audio clip - fumana ividiyo ye-voice-head ene-realistic lip sync, i-head pose, kunye ne-blinks. Isebenza nge-SadTalker (MIT). Ukusetyenziswa kwentengiso OK.
Layisha phezulu i Face + Audio
1, 000 iimpawu ngomzuzwanaRhweba ngaphandle amanqaku encwadi ye Mozilla Khangela
JPG, PNG, or short MP4/WebM. Max 10MB. One clear, well-lit face works best.ifayili.mp3
0 MBRhweba ngaphandle amanqaku encwadi ye Mozilla Khangela
MP3, WAV, M4A, or FLAC. Max 10MB. Free: up to 30 sec. Pro: up to 5 min.ifayili.mp3
0 MBIbonisa ividiyo yakho. Oku kuthathelwa ingqalelo ukuba kuthatha imizuzwana engama-30 ukuya kwemizuzwana emibini.
Ividiyo yakho ethetha-i-ngqwalasela
Malunga ne SadTalker
I-SadTalker (CVPR 2023, Tencent ARC) yimodeli ye-open-source ye-talk-head eyenza umfanekiso we-face ofanayo usebenze ukuthetha nayiphi na i-audio. Ngokungafaniyo ne-Wav2Lip, i-SadTalker iyenza i-head pose, i-blinks, kunye ne-expression isebenze ukubonelela nge-outcome eninzi ebonakalayo.
Ikhowudi kunye nesisindo zisemthethweni kwi MIT ukusuka ekuqaleni ukuya kukuphela — akukho Llama, Gemma, okanye i-non-commercial backbone — ngoko iividiyo ozivelisayo zikhuselekile kwi-commercial use.
Iingcebiso zeziphumo ezilungileyo
- Sebenzisa umfanekiso ophezulu womgangatho, okhanyayo — amehlo abonakala, umlomo uvale
- Ubuso obuphakathi, isikwere okanye 4:5 uthelekiso lwe-aspect lusebenza kakuhle kakhulu
- Ukuthetha okucocekileyo (akukho mculo) kunika ukulungelelaniswa kweliphu okuqinileyo
- Yenza i-GFPGAN isebenze kwi-hero shots - iphindaphindwe ixesha lokuveza kodwa ikhawuleza inkcukacha
- Sebenzisa i-Still preset xa ufuna umfanekiso okhawulezayo okhawulezayo
Iinkqubo zevidiyo ze-Lip Sync
Qala ngokukhululekileyo, uphucule xa ufuna okuninzi
- Umda wesandi wemizuzu engama-30
- 256 px imveliso
- "Isilele" kuphela
- Akukho mfanekiso okhawulezayo
- Umda wesandi wemizuzu engama-30
- Zonke ii-"full" kunye ne-"still" ezimiselweyo
- 256 / 512 px imveliso
- GFPGAN ukuphucula i-face
- Umda wesandi wemizuzu emi-5
- Ufolo lwe-GPU oluphambili
- Ufikelelo lwe-API (ukukhuphela iinxalenye ezininzi)
- I-Webhook igqiba ukubiza kwakhona
- Ukusetyenziswa kwentengiso (ilayisensi yeMIT)
Imibuzo ebuzwa rhoqo
Yintoni esinokuyilungisa? Ulwazi lwakho olufunyenweyo lunceda silungise iingxaki.
Ilungile ukuqalisa?
Ubhaliso simahla kwaye ufumane 15,000 iimpawu. Akukho khadi letyala lifunekayo.