የችግር / የችሎታ ጥያቄ አቅርብ

CosyVoice 2 የድምፅ ፋይል

Alibaba Tongyi Lab's streaming TTS reaching human-parity naturalness with near-zero latency and zero-shot cloning.

0/500 ፊደላት · ለእያንዳንዱ ትውልድ 5,000 ምዝገባ →

ምዝገባ ፊደል(ሎች)

SSML ዘዴ (የንግግር ማቀነባበሪያ ማሳያ ቋንቋ ለጥሩ ቁጥጥር)

ርዕሱን በSSML መለያዎች ውስጥ ለጥሩ ቁጥጥር ይዞሩት:

<speak><prosody rate="slow">Slow speech</prosody></speak>

ፊደል ሠሌዳው ላይ ያስተካክሉ...

የተመረጠው ሞዴል የሚያውቃቸው መለያዎች - በጽሑፍዎ ውስጥ የሚከሰትበትን ቦታ ለመውሰድ ጠቅ ያድርጉ፦

የድምፅ መዝገበ ቃላት

የራሱን ተናጋሪ ግለጽ (ቃል = ተናጋሪ):

ፊደል(ሎች) 0

-12 +12

ቅርጸት

ድምፅ

ቋንቋ

የምርጫ ቅርጸት

ፍጥነት 1.0x

0.5x 2.0x

ነጻ ከፒፐር, VITS, MeloTTS ጋር

የእርስዎ የተፈጠረ ድምፅ እዚህ ይታይ. ሞዴል ይምረጡ፣ ጽሑፍ ያስገቡ፣ እና ይፈጥሩ ላይ ጠቅ ያድርጉ

ስለ CosyVoice 2

CosyVoice 2, from Alibaba's Tongyi Lab, was designed to make high-quality speech viable in real time. It uses a finite scalar quantization approach combined with flow matching to support streaming synthesis at extremely low latency, while reaching human-comparable naturalness that outperforms many commercial systems in subjective tests. Beyond quality, it offers zero-shot voice cloning from about 3 seconds of audio, cross-lingual synthesis, and fine-grained emotion control. Covering 8 languages with a 1,000-character cap, it's a strong fit for voice assistants, streaming TTS, and other real-time applications.

ምርጥ ለ: Real-time applications, streaming TTS, voice assistants

ሁሉንም አጥፉ CosyVoice 2 ድምጾች

በጥቂቱ

የድር አዘጋጅ: Alibaba (Tongyi Lab)
ፈቃድ: Apache 2.0
ዐምድ: standard
ፍጥነት: medium
የድምፅ ቅጂ: አዎ
ቋንቋዎች: English, Chinese, Japanese, Korean, French, German, Italian, Spanish
ፊደላት: 1000

CosyVoice 2 ድምጾች

Chinese Female

Chinese

መደበኛ Female

Chinese Male

Chinese

መደበኛ Male

English Female

English

መደበኛ Female

English Male

English

መደበኛ Male

French Female

French

መደበኛ Female

German Female

German

መደበኛ Female

Italian Female

Italian

መደበኛ Female

Japanese Female

Japanese

መደበኛ Female

Korean Female

Korean

መደበኛ Female

Spanish Female

Spanish

መደበኛ Female

CosyVoice 2 የትርጉም መሳሪያ

Yes. CosyVoice 2 uses finite scalar quantization for streaming synthesis at very low latency, which is what makes it suitable for voice assistants and real-time applications.

Yes. It offers zero-shot voice cloning from roughly 3 seconds of reference audio, plus cross-lingual synthesis and emotion control.

Yes. CosyVoice 2 is Apache 2.0 licensed. It supports 8 languages: English, Chinese, Japanese, Korean, French, German, Italian, and Spanish.

← ሁሉንም ድምጾች

CosyVoice 2 የድምፅ ፋይል

TTS.aiን ወዳጅነት?

ስለ CosyVoice 2

በጥቂቱ

CosyVoice 2 ድምጾች

Chinese Female

Chinese Male

English Female

English Male

French Female

German Female

Italian Female

Japanese Female

Korean Female

Spanish Female

CosyVoice 2 የትርጉም መሳሪያ

Can CosyVoice 2 stream audio in real time?

Does CosyVoice 2 support voice cloning?

Is CosyVoice 2 free for commercial use?