Segnala bug / richiesta di funzionalità

Dia TTS TTS

A 1.6B-parameter model purpose-built for generating natural multi-speaker dialogue, not just single-voice narration.

Testo
File

0/500 caratteri · Iscriviti per 5.000 per generazione →

Iscriviti per un limite di 5.000 caratteri

Modalità SSML (Linguaggio di marcatura sintesi vocale per un controllo fine)

Avvolgi il tuo testo nei tag SSML per un controllo preciso:

<speak><prosody rate="slow">Slow speech</prosody></speak>

Emozione / Tag stile

Tags il modello selezionato comprende clic su

Dizionario della pronuncia

Definire le pronunciazioni personalizzate (parola = pronuncia):

Piazzola 0

-12 +12

Modello AI

Voce

Lingua

Formato di output

Velocità 1.0x

0.5x 2.0x

Gratis con Piper, VITS, MeloTTS

L'audio generato apparirà qui. Scegli un modello, inserisci testo e fai clic su Genera.

Informazioni Dia TTS

Dia by Nari Labs is a 1.6-billion-parameter text-to-speech model designed from the ground up for dialogue rather than monologue. It generates conversations between two speakers with realistic turn-taking, prosody, and emotional expression, producing audio that sounds like a real exchange instead of two voices read separately. Architecturally it pairs an autoregressive transformer with the Descript Audio Codec (DAC) for waveform generation. Dia is a strong fit for podcast-style content, scripted audiobook dialogue, and conversational scenes, and is released under Apache 2.0. Generations are heavier than single-voice models, so it favors quality over raw speed.

Meglio per: Podcasts, audiobook dialogues, conversational content

Sfoglia tutti Dia TTS voci

A colpo d'occhio

Sviluppatore: Nari Labs
Licenza: Apache 2.0
Livello: standard
Velocità: medium
Clonazione vocale: No.
Lingue: English
Caratteri massimi: 800

Dia TTS voci

Speaker 1

English

Standard Neutral

Speaker 2

English

Standard Neutral

Dia TTS FAQ del TTS

Multi-speaker dialogue. Unlike most TTS models that read one voice at a time, Dia generates a two-speaker conversation with natural turn-taking, prosody, and emotion in a single pass — ideal for podcasts and scripted scenes.

It is a 1.6-billion-parameter model from Nari Labs, built on an autoregressive transformer with the Descript Audio Codec for audio generation.

On TTS.ai, Dia is configured for English. Its strength is dialogue generation rather than broad multilingual coverage.

← Tutte le voci

Dia TTS TTS

Ti piace TTS.ai? Dillo ai tuoi amici!

Informazioni Dia TTS

A colpo d'occhio

Dia TTS voci

Speaker 1

Speaker 2

Dia TTS FAQ del TTS

What is Dia TTS designed for?

How big is the Dia model?

Does Dia support languages other than English?