Segnala bug / richiesta di funzionalità

GPT-SoVITS TTS

A few-shot voice cloning model that replicates a voice — and can even sing — from as little as five seconds of audio.

Testo
File

0/500 caratteri · Iscriviti per 5.000 per generazione →

Iscriviti per un limite di 5.000 caratteri

Modalità SSML (Linguaggio di marcatura sintesi vocale per un controllo fine)

Avvolgi il tuo testo nei tag SSML per un controllo preciso:

<speak><prosody rate="slow">Slow speech</prosody></speak>

Emozione / Tag stile

Tags il modello selezionato comprende clic su

Dizionario della pronuncia

Definire le pronunciazioni personalizzate (parola = pronuncia):

Piazzola 0

-12 +12

Modello AI

Voce

Lingua

Formato di output

Velocità 1.0x

0.5x 2.0x

Gratis con Piper, VITS, MeloTTS

L'audio generato apparirà qui. Scegli un modello, inserisci testo e fai clic su Genera.

Informazioni GPT-SoVITS

GPT-SoVITS, created by the developer known as RVC-Boss, combines GPT-style language modeling with SoVITS (Singing Voice Conversion / synthesis) to deliver some of the most accessible voice cloning in open source. With as little as five seconds of reference audio it captures a speaker's timbre and style, and it stands out from most TTS models in handling singing as well as speech. It works across English, Chinese, Japanese, and Korean and supports cross-lingual generation, so a cloned voice can speak a language the reference clip never used. It is widely used by content creators for voice replication, dubbing, and song covers, and reaches high fidelity for a model of its size.

Meglio per: Voice cloning, singing synthesis, content creator voice replication

Sfoglia tutti GPT-SoVITS voci

A colpo d'occhio

Sviluppatore: RVC-Boss
Licenza: MIT
Livello: standard
Velocità: slow
Clonazione vocale: Sì
Lingue: English, Chinese, Japanese, Korean
Caratteri massimi: 500

GPT-SoVITS voci

Default

Chinese

Standard Neutral

English Default

English

Standard Neutral

Japanese Default

Japanese

Standard Neutral

Korean Default

Korean

Standard Neutral

GPT-SoVITS FAQ del TTS

As little as five seconds. It uses few-shot learning, so a short reference clip is enough to capture a speaker, though a cleaner and slightly longer sample improves similarity.

Yes. Its SoVITS lineage comes from singing voice synthesis, so unlike most TTS models it can generate singing as well as spoken voice, which is why it is popular for song covers.

English, Chinese, Japanese, and Korean, with cross-lingual synthesis — a voice cloned from one language can be made to speak the others.

← Tutte le voci

GPT-SoVITS TTS

Ti piace TTS.ai? Dillo ai tuoi amici!

Informazioni GPT-SoVITS

A colpo d'occhio

GPT-SoVITS voci

Default

English Default

Japanese Default

Korean Default

GPT-SoVITS FAQ del TTS

How much audio does GPT-SoVITS need to clone a voice?

Can GPT-SoVITS sing?

Which languages does GPT-SoVITS support?