报告错误/功能要求

CosyVoice 2 TT TT TTT TTT T TT TT T T TTT TTT TTT

Alibaba Tongyi Lab's streaming TTS reaching human-parity naturalness with near-zero latency and zero-shot cloning.

0/500 字符 · 每一代5,000人签名 →

签名对 5,000 字符限制的 5 000 个字符

SSML 模式 (用于精密控制的语音合成标记语言)

在 SSML 标记中折行文本以精确控制 :

<speak><prosody rate="slow">Slow speech</prosody></speak>

情感/样式标记

标记选中模式的理解度 - 单击将一个输入到文本中, 发生时 :

发音字典

定义自定义发音( Word = 发音) :

切进 0

-12 +12

AIT 型 AI 型

语音声音

语言

输出格式

速度 1.0x

0.5x 2.0x

免费的管道、VITS、MelotTS

您生成的音频将在此显示。选择一个模型, 输入文本, 并单击生成。

关于 CosyVoice 2

CosyVoice 2, from Alibaba's Tongyi Lab, was designed to make high-quality speech viable in real time. It uses a finite scalar quantization approach combined with flow matching to support streaming synthesis at extremely low latency, while reaching human-comparable naturalness that outperforms many commercial systems in subjective tests. Beyond quality, it offers zero-shot voice cloning from about 3 seconds of audio, cross-lingual synthesis, and fine-grained emotion control. Covering 8 languages with a 1,000-character cap, it's a strong fit for voice assistants, streaming TTS, and other real-time applications.

最佳: Real-time applications, streaming TTS, voice assistants

全部浏览 CosyVoice 2 声音

一眼看一眼,

开发者: Alibaba (Tongyi Lab)
许可证: Apache 2.0
级别: standard
速度: medium
语音克隆: 是
语言: English, Chinese, Japanese, Korean, French, German, Italian, Spanish
最大字符: 1000

CosyVoice 2 声音

Chinese Female

Chinese

标准 Female

Chinese Male

Chinese

标准 Male

English Female

English

标准 Female

English Male

English

标准 Male

French Female

French

标准 Female

German Female

German

标准 Female

Italian Female

Italian

标准 Female

Japanese Female

Japanese

标准 Female

Korean Female

Korean

标准 Female

Spanish Female

Spanish

标准 Female

CosyVoice 2 TTS - 常见问题

Yes. CosyVoice 2 uses finite scalar quantization for streaming synthesis at very low latency, which is what makes it suitable for voice assistants and real-time applications.

Yes. It offers zero-shot voice cloning from roughly 3 seconds of reference audio, plus cross-lingual synthesis and emotion control.

Yes. CosyVoice 2 is Apache 2.0 licensed. It supports 8 languages: English, Chinese, Japanese, Korean, French, German, Italian, and Spanish.

← 所有声音

CosyVoice 2 TT TT TTT TTT T TT TT T T TTT TTT TTT

喜欢TTS.ai吗？告诉你的朋友吧！

关于 CosyVoice 2

一眼看一眼,

CosyVoice 2 声音

Chinese Female

Chinese Male

English Female

English Male

French Female

German Female

Italian Female

Japanese Female

Korean Female

Spanish Female

CosyVoice 2 TTS - 常见问题

Can CosyVoice 2 stream audio in real time?

Does CosyVoice 2 support voice cloning?

Is CosyVoice 2 free for commercial use?