VibeVoice

Speaker 4

@ action @ item Spelling dictionary Neutral VibeVoice

{nama} shine wani sauti na neutral AI wanda aka sarrafa shi da siffar rubutu zuwa magana {mai siffar}. Wannan sauti {mai siffar} yana magana da {harshe} kuma yana samar da ƙimar {ƙimar} na ƙimar ƙirar magana. Da sauri mai samar da QDialogButtonBox da kuma darajar ingancin 5/5, Speaker 4 yana da kyau ga podcasts, dialogues, long-form narration, multi-speaker content. An yi amfani da injin VibeVoice ta hanyar Microsoft under the MIT license, wanda ya sa shi amintacce ga amfanin kasuwanci. Ma'aunin da ake bukata sun hada da: multi-speaker, long-form (90 min), podcast generation, dialogue, low latency.

QSql

VibeVoiceQPrintPreviewDialog

@ action VibeVoice
Mawallafi Microsoft
QPrintPreviewDialog
QSoftKeyManager QPrintPreviewDialog
QFileDialog MIT
@ action QDialogButtonBox
DakataEthiopian month 11 - LongNamePossessive @ item font
Parameters 1.5B
KCharselect unicode block name LLM + DAC
QPrintPreviewDialog 100000 hours
@ option next month 2025

Mafi kyawun amfani da lokuta don Speaker 4

Shiryoyin Ayuka da aka shawarta bisa ga halayen wannan sauti

QShortcut

Yi amfani da {nama} don ka faɗi abun cikin da ya yi tsawo da prosody da maganar da ke da asali.

KCharselect unicode block name

Ƙara bayani mai sana'a ga bidiyo na YouTube, tallace-tallace, da kuma abun ciki na kafofin watsa labaru na zamantakewa.

Shiryoyin Ayuka & Masu Sauki

Yiwa wannan sauti kwafi mai sauri yana sa shi ya zama mafi kyau ga shiryoyin ayuka na lokaci-da-lokaci, masu karatun fuskar kwamfyuta, da kayan aikin canzawa.

Podcasts & Broadcasts

Studio-quality output suitable for podcasts, radio, and professional broadcasting.

QPrintPreviewDialog VibeVoice QShortcut

KCharselect unicode block name

Speaker 1

@ item Spelling dictionary Neutral

Speaker 1 (Chinese)

@ item Spelling dictionary Neutral

Speaker 2

@ item Spelling dictionary Neutral

Speaker 2 (Chinese)

@ item Spelling dictionary Neutral

Speaker 3

@ item Spelling dictionary Neutral

Tambayar da ake yi da yawa

VibeVoice by Microsoft comes in two variants: a 1.5B model for long-form content (up to 90 minutes, 4 speakers) and a Realtime 0.5B model for streaming with ~200ms first audio latency. The 1.5B variant excels at podcasts and audiobooks with speaker consistency over long passages. Note: Microsoft removed TTS code from the repository and generated audio includes audible AI disclaimers.

VibeVoice an gina shi ne da Microsoft kuma an saki shi a karkashin lasisi na MIT (inshorar bincike kawai), wanda ke ba da damar amfani da kasuwanci na sauti da aka samar.

VibeVoice yana goyon bayan harshe 1: Ingilishi.

VibeVoice yana cikin mataki na Premium — 4 credits a kowace 1,000 characters. Za ka iya gani na gaba na kowanne sauti na VibeVoice kyauta kafin ka samar da sauti mai cike.

VibeVoice yana da sauri mai tsawo wajen samarwa. Shiryawar tana ɗaukar sakan kaɗan dangane da tsawon rubutu.

VibeVoice an ba shi darajar 5/5 don ingancin sauti a kan TTS.ai. Yana bayar da magana mai ingancin studio, kamar na mutum.

Ba haka ba, VibeVoice na amfani da ƙungiya mai daidaituwa na sauti masu ƙunshe. Don ƙirƙirar sauti, ka yi kokarin nau'ikan kamar CosyVoice 2, GPT-SoVITS, ko Chatterbox.

Ya, VibeVoice an shawarce shi musamman don podcasts, littattafai masu sauti, da kuma abun ciki mai yawa na masu magana da yawa. Yana da masu magana da yawa, har zuwa minti 90, da kuma iyakar samar da podcasts, suna sa shi zama zaɓi mai kyau ga wannan amfani da yanayin.

Na'am, VibeVoice an ba da lasisi a karkashin MIT (mai nufin bincike kawai), wanda ke ba da damar amfanin kasuwanci. Za'a iya amfani da sauti da aka samar da sauti na VibeVoice a cikin bidiyo, podcasts, aikace-aikace, wasanni, da kuma duk wani shirin kasuwanci.

Na'am, duk sauti a kan TTS.ai suna amfani da nau'ikan ma'ana-farare-da-lasisi-na-kasuwar (MIT, Apache 2.0). Sauti da aka samar ita ce ta ka don amfani da ita a cikin bidiyo, podcasts, aikace-aikace, wasanni, da duk wani shirin ayuka na kasuwanci.

Aika da umarnin POST zuwa /api/v1/tts/ tare da sunan sigar da kuma shaidar magana. Ka duba shafinmu na takardun shaidar API don misalin alamun shafi a cikin Python, JavaScript, Go, da kuma cURL.

Na'am, danna maɓallin wasa a wannan shafi don jin misali. Za ka iya kuma rubuta rubutun ɗabi'a a cikin shafi na rubutu zuwa magana kuma ka samar da gani na gaba da kowacce magana.

@ action Speaker 4 @ action

Taɓa duk wani rubutu kuma ka ji shi an faɗa da Speaker 4. Free to use.