ڦيٿي / خاصيت جي درخواست رپورٽ ڪريو

CosyVoice 2 TTS

Alibaba Tongyi Lab's streaming TTS reaching human-parity naturalness with near-zero latency and zero-shot cloning.

0/500 نشان · 5000 جي هر نسل لاء رجسٽر →

رجسٽر ٿيو 5000 ڪارڪنن جي حد

SSML ريت (سيٽنگ ڪنٽرول لاءِ ڳالهائڻ جي سنٿسيٽ مارڪ اپ ٻوليName)

صحيح ڪنٽرول لاءِ پنھنجو متن SSML ٽيگ ۾ ويڙھيو:

<speak><prosody rate="slow">Slow speech</prosody></speak>

احساس / انداز ٽيگ

ٽيگ جيڪي چونڊيل ماڊل سمجھي ٿو - هڪ کي پنھنجي متن ۾ جتي ٿئي ٿو ڦيريڻ لاءِ ڪلڪ ڪريو:

پڙھڻ جي لغت

پنھنجو آواز بيان ڪريو (شيء = آواز):

پيچ 0

-12 +12

AI ماڊل

آواز

ٻولي

اخراج جي شڪل

رفتار 1.0x

0.5x 2.0x

پيپر، VITS، MeloTTS سان مفت

پنھنجو ٺاھيل آڊيو اتي نظر ايندو. ھڪ ماڊل چونڊيو، متن داخل ڪريو ۽ ٺاھڻ دٻايو.

بابت CosyVoice 2

CosyVoice 2, from Alibaba's Tongyi Lab, was designed to make high-quality speech viable in real time. It uses a finite scalar quantization approach combined with flow matching to support streaming synthesis at extremely low latency, while reaching human-comparable naturalness that outperforms many commercial systems in subjective tests. Beyond quality, it offers zero-shot voice cloning from about 3 seconds of audio, cross-lingual synthesis, and fine-grained emotion control. Covering 8 languages with a 1,000-character cap, it's a strong fit for voice assistants, streaming TTS, and other real-time applications.

بهترين: Real-time applications, streaming TTS, voice assistants

سڀ لکو CosyVoice 2 آواز

هڪ نظر ۾

ڊيولپر: Alibaba (Tongyi Lab)
لائسنس: Apache 2.0
جانور: standard
رفتار: medium
آواز جو کلون: ھائو
ٻوليون: English, Chinese, Japanese, Korean, French, German, Italian, Spanish
وڌيڪ نشان: 1000

CosyVoice 2 آواز

Chinese Female

Chinese

معياري Female

Chinese Male

Chinese

معياري Male

English Female

English

معياري Female

English Male

English

معياري Male

French Female

French

معياري Female

German Female

German

معياري Female

Italian Female

Italian

معياري Female

Japanese Female

Japanese

معياري Female

Korean Female

Korean

معياري Female

Spanish Female

Spanish

معياري Female

CosyVoice 2 TTS - پڇا ڳاڇا

Yes. CosyVoice 2 uses finite scalar quantization for streaming synthesis at very low latency, which is what makes it suitable for voice assistants and real-time applications.

Yes. It offers zero-shot voice cloning from roughly 3 seconds of reference audio, plus cross-lingual synthesis and emotion control.

Yes. CosyVoice 2 is Apache 2.0 licensed. It supports 8 languages: English, Chinese, Japanese, Korean, French, German, Italian, and Spanish.

← سڀ آواز

CosyVoice 2 TTS

TTS.ai کي پيارو آهي؟ پنھنجن دوستن کي چئو!

بابت CosyVoice 2

هڪ نظر ۾

CosyVoice 2 آواز

Chinese Female

Chinese Male

English Female

English Male

French Female

German Female

Italian Female

Japanese Female

Korean Female

Spanish Female

CosyVoice 2 TTS - پڇا ڳاڇا

Can CosyVoice 2 stream audio in real time?

Does CosyVoice 2 support voice cloning?

Is CosyVoice 2 free for commercial use?