GPT-SoVITS TTS
A few-shot voice cloning model that replicates a voice — and can even sing — from as little as five seconds of audio.
GPT-SoVITS, created by the developer known as RVC-Boss, combines GPT-style language modeling with SoVITS (Singing Voice Conversion / synthesis) to deliver some of the most accessible voice cloning in open source. With as little as five seconds of reference audio it captures a speaker's timbre and style, and it stands out from most TTS models in handling singing as well as speech. It works across English, Chinese, Japanese, and Korean and supports cross-lingual generation, so a cloned voice can speak a language the reference clip never used. It is widely used by content creators for voice replication, dubbing, and song covers, and reaches high fidelity for a model of its size.
A colpo d'occhio
- Sviluppatore
- RVC-Boss
- Licenza
- MIT
- Livello
- standard
- Velocità
- slow
- Clonazione vocale
- Sì
- Lingue
- English, Chinese, Japanese, Korean
- Caratteri massimi
- 500
GPT-SoVITS voci
Meglio per
Voice cloning, singing synthesis, content creator voice replication