报告错误/功能要求

CosyVoice3 TT TT TTT TTT T TT TT T T TTT TTT TTT

Alibaba FunAudioLLM's latest multilingual model with ~150ms bi-streaming, instruction control, and zero-shot cloning.

0/500 字符 · 每一代5,000人签名 →

签名对 5,000 字符限制的 5 000 个字符

SSML 模式 (用于精密控制的语音合成标记语言)

在 SSML 标记中折行文本以精确控制 :

<speak><prosody rate="slow">Slow speech</prosody></speak>

情感/样式标记

标记选中模式的理解度 - 单击将一个输入到文本中, 发生时 :

发音字典

定义自定义发音( Word = 发音) :

切进 0

-12 +12

AIT 型 AI 型

语音声音

语言

输出格式

速度 1.0x

0.5x 2.0x

免费的管道、VITS、MelotTS

您生成的音频将在此显示。选择一个模型, 输入文本, 并单击生成。

关于 CosyVoice3

CosyVoice3 is the newest generation from Alibaba's FunAudioLLM team and a clear step up from CosyVoice 2. It introduces bi-streaming inference with roughly 150ms latency and instruction-based control, letting you steer emotion, speed, and volume through prompts. Speaker similarity for zero-shot voice cloning is improved, and coverage spans 9 languages plus 18 Chinese dialects. An RL-tuned variant pushes prosody to a state-of-the-art level. With a 5,000-character ceiling, fast generation, and strong cloning, it's geared toward multilingual production TTS and real-time applications.

最佳: Multilingual production TTS, real-time applications, voice cloning

全部浏览 CosyVoice3 声音

一眼看一眼,

开发者: Alibaba (FunAudioLLM)
许可证: Apache 2.0
级别: standard
速度: fast
语音克隆: 是
语言: English, Chinese, Japanese, Korean, German, Spanish, French, Italian, Russian
最大字符: 5000

CosyVoice3 声音

Chinese Female

Chinese

标准 Female

Chinese Male

Chinese

标准 Male

English Female

English

标准 Female

English Male

English

标准 Male

French Female

French

标准 Female

German Female

German

标准 Female

Italian Female

Italian

标准 Female

Japanese Female

Japanese

标准 Female

Korean Female

Korean

标准 Female

Russian Female

Russian

标准 Female

Spanish Female

Spanish

标准 Female

CosyVoice3 TTS - 常见问题

CosyVoice3 adds bi-streaming inference at around 150ms latency, instruction-based control over emotion/speed/volume, improved speaker similarity for cloning, and coverage of 9 languages plus 18 Chinese dialects, with an RL-tuned variant for state-of-the-art prosody.

Yes. It supports zero-shot voice cloning from a reference clip (around 3 seconds minimum) with improved speaker similarity over the previous generation.

Yes. CosyVoice3 is licensed under Apache 2.0, permitting commercial use.

← 所有声音

CosyVoice3 TT TT TTT TTT T TT TT T T TTT TTT TTT

喜欢TTS.ai吗？告诉你的朋友吧！

关于 CosyVoice3

一眼看一眼,

CosyVoice3 声音

Chinese Female

Chinese Male

English Female

English Male

French Female

German Female

Italian Female

Japanese Female

Korean Female

Russian Female

Spanish Female

CosyVoice3 TTS - 常见问题

What makes CosyVoice3 different from CosyVoice 2?

Does CosyVoice3 support voice cloning?

Is CosyVoice3 free for commercial use?