گزارش اشکال / درخواست ویژگی

Dia TTS TTS

A 1.6B-parameter model purpose-built for generating natural multi-speaker dialogue, not just single-voice narration.

0/500 نویسه‌ها · براي 5000 نفر در هر نسل ثبت نام کنيد →

ثبت نام برای حد ۵۰۰۰ کاراکتر

حالت SSML (زبان نشانه‌گذاری ترکیب گفتار برای کنترل دقیق)

برای کنترل دقیق ، متن خود را در برچسبهای SSML بپیچید:

<speak><prosody rate="slow">Slow speech</prosody></speak>

برچسبهای احساس/ سبک

برچسبهایی که مدل برگزیده می‌فهمد — برای انداختن یکی در متن خود ، جایی که اتفاق می‌افتد ، کلیک کنید:

واژه‌نامه تلفظ

تعریف تلفظ سفارشی) کلمه = تلفظ (:

ارتفاع 0

-12 +12

مدل

صدا

زبان

قالب خروجی

سرعت 1.0x

0.5x 2.0x

آزاد با Piper, VITS, MeloTTS

صدای تولید شده شما در اینجا ظاهر خواهد شد. یک مدل را انتخاب کنید ، متن را وارد کنید ، و تولید را فشار دهید.

در مورد Dia TTS

Dia by Nari Labs is a 1.6-billion-parameter text-to-speech model designed from the ground up for dialogue rather than monologue. It generates conversations between two speakers with realistic turn-taking, prosody, and emotional expression, producing audio that sounds like a real exchange instead of two voices read separately. Architecturally it pairs an autoregressive transformer with the Descript Audio Codec (DAC) for waveform generation. Dia is a strong fit for podcast-style content, scripted audiobook dialogue, and conversational scenes, and is released under Apache 2.0. Generations are heavier than single-voice models, so it favors quality over raw speed.

بهترین برای: Podcasts, audiobook dialogues, conversational content

مرور همۀ Dia TTS صداها

يه نگاهي بنداز

توسعه‌دهنده: Nari Labs
مجوز: Apache 2.0
حیوان: standard
سرعت: medium
شبیه‌سازی صدا: نه
زبانها: English
بیشینه نویسه‌ها: 800

Dia TTS صداها

Speaker 1

English

پیش‌فرض Neutral

Speaker 2

English

پیش‌فرض Neutral

Dia TTS FAQ - پرسش و پاسخ

Multi-speaker dialogue. Unlike most TTS models that read one voice at a time, Dia generates a two-speaker conversation with natural turn-taking, prosody, and emotion in a single pass — ideal for podcasts and scripted scenes.

It is a 1.6-billion-parameter model from Nari Labs, built on an autoregressive transformer with the Descript Audio Codec for audio generation.

On TTS.ai, Dia is configured for English. Its strength is dialogue generation rather than broad multilingual coverage.

← همه صداها

Dia TTS TTS

دوست داريد TTS.ai؟ به دوستانتون بگو!

در مورد Dia TTS

يه نگاهي بنداز

Dia TTS صداها

Speaker 1

Speaker 2

Dia TTS FAQ - پرسش و پاسخ

What is Dia TTS designed for?

How big is the Dia model?

Does Dia support languages other than English?