ક્ષતિનો અહેવાલ આપો / લક્ષણ વિનંતી

CosyVoice 2 TTS

Alibaba Tongyi Lab's streaming TTS reaching human-parity naturalness with near-zero latency and zero-shot cloning.

0/500 અક્ષરો · 5,000 પ્રતિ પેઢી માટે નોંધણી કરો →

નોંધણી કરો ૫,૦૦૦ અક્ષરોની મર્યાદા માટે

SSML સ્થિતિ (સારા નિયંત્રણ માટે દ્રશ્ય સંયોજન માર્કઅપ ભાષાName)

ચોક્કસ નિયંત્રણ માટે SSML ટેગોમાં તમારું લખાણ લપેટો:

<speak><prosody rate="slow">Slow speech</prosody></speak>

લાગણી / શૈલી ટેગો

પસંદ કરેલ મોડેલ સમજે છે તે ટેગ્સ - તમારા લખાણમાં એકને મૂકવા માટે ક્લિક કરો જ્યાં તે થાય છે:

ઉચ્ચારણ શબ્દકોશ

વૈવિધ્યપૂર્ણ ઉચ્ચારણો વ્યાખ્યાયિત કરો (શબ્દ = ઉચ્ચારણ):

પીચ 0

-12 +12

AI મોડેલ

અવાજ

ભાષા

આઉટપુટ બંધારણ

ઝડપ 1.0x

0.5x 2.0x

Piper, VITS, MeloTTS સાથે મુક્ત

તમારું ઉત્પન્ન થયેલ ઓડિયો અહીં દેખાશે. મોડેલ પસંદ કરો, લખાણ દાખલ કરો, અને ઉત્પન્ન કરો પર ક્લિક કરો.

વિશે CosyVoice 2

CosyVoice 2, from Alibaba's Tongyi Lab, was designed to make high-quality speech viable in real time. It uses a finite scalar quantization approach combined with flow matching to support streaming synthesis at extremely low latency, while reaching human-comparable naturalness that outperforms many commercial systems in subjective tests. Beyond quality, it offers zero-shot voice cloning from about 3 seconds of audio, cross-lingual synthesis, and fine-grained emotion control. Covering 8 languages with a 1,000-character cap, it's a strong fit for voice assistants, streaming TTS, and other real-time applications.

માટે શ્રેષ્ઠ: Real-time applications, streaming TTS, voice assistants

બધું બ્રાઉઝ કરો CosyVoice 2 અવાજો

એક નજરમાં

ડેવલોપર: Alibaba (Tongyi Lab)
લાઇસન્સ: Apache 2.0
તીર: standard
ઝડપ: medium
અવાજ ક્લોનિંગ: હા
ભાષાઓ: English, Chinese, Japanese, Korean, French, German, Italian, Spanish
મહત્તમ અક્ષરો: 1000

CosyVoice 2 અવાજો

Chinese Female

Chinese

મૂળભૂત Female

Chinese Male

Chinese

મૂળભૂત Male

English Female

English

મૂળભૂત Female

English Male

English

મૂળભૂત Male

French Female

French

મૂળભૂત Female

German Female

German

મૂળભૂત Female

Italian Female

Italian

મૂળભૂત Female

Japanese Female

Japanese

મૂળભૂત Female

Korean Female

Korean

મૂળભૂત Female

Spanish Female

Spanish

મૂળભૂત Female

CosyVoice 2 TTS - વારંવાર પૂછાતા પ્રશ્નો

Yes. CosyVoice 2 uses finite scalar quantization for streaming synthesis at very low latency, which is what makes it suitable for voice assistants and real-time applications.

Yes. It offers zero-shot voice cloning from roughly 3 seconds of reference audio, plus cross-lingual synthesis and emotion control.

Yes. CosyVoice 2 is Apache 2.0 licensed. It supports 8 languages: English, Chinese, Japanese, Korean, French, German, Italian, and Spanish.

← બધા અવાજો

CosyVoice 2 TTS

TTS.ai ને પ્રેમ કરો છો? તમારા મિત્રોને કહી દો!

વિશે CosyVoice 2

એક નજરમાં

CosyVoice 2 અવાજો

Chinese Female

Chinese Male

English Female

English Male

French Female

German Female

Italian Female

Japanese Female

Korean Female

Spanish Female

CosyVoice 2 TTS - વારંવાર પૂછાતા પ્રશ્નો

Can CosyVoice 2 stream audio in real time?

Does CosyVoice 2 support voice cloning?

Is CosyVoice 2 free for commercial use?