Report Bug / Feature Request

Ukukhuluma kuMbhalo

Bhala umsindo kanye nevidiyo ibe ngumbhalo nge-AI. Ixhasa izilimi ezingu-99, ama-timestamps, kanye nokuthola umsindo.

Asikho isikhulumi se-TTS ezweni lakho. Sicela usize ukungeza isandla sakho! Uhlu lwamagama

Layisha umsindo noma ividiyo

Thwebula bese ushiya ihele lakho lapha, noma bheka

Isekela i-MP3, WAV, FLAC, OGG, M4A, MP4, WebM. Max 100MB.

file.mp3

0 MB
— noma urekhode kusuka ku-microphone yakho —
00:00

Izilungiselelo

1,000/min amaphawu Bhala to track usage

Ukudluliswa

Layisha phezulu ifayela lomsindo bese uchofoza u-Transcribe ukuqala

Ukudlulisa umsindo... lokhu kungathatha isikhathi.

Kutholakala:

Indlela esebenza ngayo

Layisha umsindo

Layisha phezulu ifayela lakho lomsindo noma levidiyo. Sixhasa amafomethi we-MP3, WAV, FLAC, OGG, M4A, MP4, ne-WebM kuya ku-100MB.

2. Ama-AI Transcribes

Imodeli yethu ye-AI ihlela umsindo wakho, ithola ulwimi, ikhomba abakhulumayo, futhi ikhiqize umbhalo ofanele nesikhathi.

3. Thola umbhalo wakho

Kopela ukuguqulelwa kwakho noma ulayishe njenge-TXT noma i-SRT subtitle format. Hlela futhi uthuthukise njengoba kudingeka.

Sebenzisa izimo

Ukukhuluma umbhalo kunoma iyiphi imboni nokuhamba komsebenzi

Izingqungquthela

Ukuguqulela ngokuzenzakalela i-Zoom, amaqembu, kanye ne-Google Meet recordings. Ungase ushiye ingxenye yemisebenzi. Rhweba ngaphandle njengeziphawuli zengqungquthela noma izihloko.

Ukuxhumana nomphakathi

Ukubhala izinhlanganiso zezindaba, izincwadi zocwaningo, namadokhumende. Ukubhala izinhlanganiso zomsindo ukhomba ukuthi ngubani okhuluma yini ukuze kube lula ukuphawula.

Amapodcast nama-media

Dala izixhumanisi futhi uveze amabhukwana epodcasts. Dala amafayela atholakali wezinto zakho zomsindo. Engeza izihloko zevidiyo kupodcasts.

Ukufundisa nokufundiswa

Guqula izifundo ezirekhodiwe zibe izingcaphuno zokufunda. Yenza okuqukethwe kwemfundo kufinyeleleke ngesihloko esifanele. Sixhase abafundi abanezinkinga zokulalela.

Ukubhala ngemithi

Bhala kabusha iziguli-udokotela, amabhukwana eklinikhi, kanye nokucwaswa kwemithi. Gcina amahora encwadini yesandla ngempumelelo enamandla we-AI.

Izinqumo zomthetho

Bhala iziphakamiso, izingqungquthela, kanye nezingqungquthela zekhasimende. Isikhathi esifanele sokwethula. Rhweba ngaphandle ngefomethi efanelekayo yedokhumende lenkantolo.

Ukuqhathaniswa kwemodeli ye-STT

Whisper

Imodeli yokwazisa ulwimi oluqinile lwe-OpenAI oluxhasa ulwimi olungu-99.

  • Izilimi
  • Ukuhumusha
  • Ama-timestamps
  • Ukumelana nezingcingo
OpenAI

Faster Whisper

4x ngokushesha kune Whisper nge CTranslate2 optimization, ngokunemba okufanayo.

  • 4x ngokushesha
  • Inkumbulo ephansi
  • Zonke izilinganiso zemodeli
  • Uhlelo lwe-batch
  • Isihlungi se-VAD
SYSTRAN

SenseVoice

Imodeli yokuqonda umlayezo ngokuthola imizwa, izilimi ezingaphezu kuka-50.

  • Izilimi ezingaphezu kuka-50
  • Ukukhomba imizwa
  • Izinhlamvu zomsindo
  • Ucwaningo lomsindo
  • I-metadata eminingi
Alibaba (FunAudioLLM)

Ukukhuluma-noMbhalo

Qala ngokukhululekile, uthuthukise uma ufuna okuningi

Ikhululekile
  • Iminithi 1 umkhawulo womsindo
  • Imodeli ye-Faster Whisper
  • Ukuhumusha okujwayelekile
  • Izilimi ezingaphezu kuka-100
Okuthandwa kakhulu
I-akhawunti Ekhululekile
  • 30-minute audio + 15,000 characters
  • Zonke imodeli ye-STT
  • Igama-level timestamps
  • I-SRT & VTT subtitle export
  • Isikhulumi
Bhala
I-Pro
  • Amafayela omsindo wehora le-2
  • Ukuhunyushwa kweqembu
  • Ukuphathwa kwesinqumo
  • Ukufinyelela kwe-API
  • Igama lokuchaza amagama elijwayelekile
Ukulungiswa

Imibuzo ebuzwa kaningi

Ukukhuluma ku mbhalo (STT), futhi kubizwa ngokuthi ukuphawula ngokuzenzakalela kwezwi (ASR), kuguqula ulwimi olukhulumayo lube ngumbhalo obhalwe. Amamodeli ethu asebenzisa i-AI ukuhlela ngokucophelela umsindo kusuka emicimbini, emibukiso, emicimbini, emibukiso, nezinye izinto.

I-Faster Whisper ikhuthazwa kulezi zinkinga eziningi — ihamba ngokushesha kune-Whisper yakudala ngamaphesenti angama-4 ngenkathi igcina ukuthembeka okufanayo. Sebenzisa i-SenseVoice uma ufuna ukukhomba imizwa noma ukukhomba inkinga yomsindo kanye nokudluliswa.

Sixhasa amafomethi we-MP3, WAV, M4A, OGG, FLAC, WEBM, kanye namafomethi omsindo/wevidiyo ajwayelekile. Ubukhulu obuphezulu befayela yi-50MB. Uma uhlela amafayela amakhulu, cabanga ngokuhlukanisa umsindo kuqala.

Abasebenzisi abamahhala bangashicilela imizuzwana engu-5 yomsindo. Ama-plans akhokhelwayo axhasa amafayela omsindo angafinyelela emahorani angama-2. Ukufaka okude, sebenzisa i-API yethu ngokusebenza kwe-batch.

Imodeli yethu ifinyelela ku-95% + ukunemba kokukhuluma isiNgisi esicacile. Ukunemba kuhluka ngokwesilimi, umgangatho wesandi, kanye ne-background noise. Faster Whisper ne Whisper zixhasa izilimi ezingu-99 ngezinga lokunemba elihlukile.

Yebo, izindlela zethu ezithuthukisiwe zokudlulisa zikwazi ukukhomba futhi zibeke isihloko abakhulumayo abahlukene emisindo. Ukudlulisa abakhulumayo kubaluleke kakhulu ekubhaleni izingqungquthela, izingqungquthela, kanye ne-podcasts yabantu abaningi lapho ufuna ukwazi ukuthi ngubani okhulumayo.

Isikhathi sangempela sokudlulisa ukudluliswa kutholakala ngokusebenzisa i-API yethu usebenzisa i-Faster Whisper. Umsindo uphathwa ngama-chunks njengoba ufika, unikeza ukudluliswa kwengxenye nge-latency ephansi. Lokhu kufanelekile ukudluliswa kwesihloko esiphilayo kanye nokuthatha izinhlamvu zesikhathi sangempela.

Yebo, i-transliteration output yethu ifaka phakathi ama-timestamps egama-level angakhishwa njenge-SRT, VTT, noma amafayela we-ASS subtitle. Le yindlela engcono kakhulu yokufaka izihloko ku-YouTube videos, izifundo ze-online, kanye ne-social media content.

Yebo, zonke iziphumo zokuguqulela zifaka phakathi ama-timestamps ezinga lengxenye ngokuzenzakalela. Ama-timestamps ezinga legama akhona futhi, abonisa isikhathi sokuqala nokuphela kwegama ngalinye kumsindo.

I-Faster Whisper iqeqeshwe emisindo eminingi futhi iphatha izingcingo zesizinda eziphakathi kahle. Ukufaka iziqophi ezingenalutho, sicebisa ukuthi usebenzise umsindo nge-Audio Enhancer yethu kuqala ukuze uthuthukise ucacile ngaphambi kokufaka.

Yebo, amafayela omsindo alayishwe phezulu agcinwa kumaseva ethu aphephile we-GPU futhi asuswa ngokuzenzakalela ngemuva kokuba ukudluliswa kuqediwe. Asigcinanga, asihlukanisi, noma sisebenzisa umsindo wakho ngezinhloso zokuqeqesha. Zonke izidluliselo zibhalwe ngokufihliwe.

Abasebenzisi abamahhala bangashicilela imizuzwana engu-5 yomsindo ngaphandle kwezindleko. Ama-plans akhokhelwayo asebenzisa amaphawu asekelwe kusikhathi sokusebenza komsindo: cishe amaphawu angama-1,000 ngomzuzu womsindo. Khangela ikhasi lethu lokukhokha ukuze uthole imininingwane eminingi ye-plan kanye nama-packs wezimpawu.
5.0/5 (1)

Yini esingayithuthukisa? Umbono wakho usiza ukuxazulula izinkinga.

Bhala umsindo nge-AI

Thola ukudluliswa okulungile kwezilimi ezingu-99. Bhala ngokumahhala futhi uthole izibonakaliso ezingu-15,000 zokuqalisa.