Open Source Text to Speech Models

Mtundu uliwonse wa TTS papulatifomu yathu ndi waulere ndi zilolezo zogwirizana ndi malonda. MIT, Apache 2.0 - palibe zoletsa zovomerezeka, palibe zoletsa zogwiritsira ntchito, palibe ndalama zosayembekezereka zovomerezeka.

Zolemba Zotsegulidwa MIT License Apache 2.0 Kukhala ndi Mlengi GitHub

Yambitsani Tsopano

Free ndi Kokoro, Piper, VITS, MeloTTS
Zina zanu zopangidwa ndi mawu zidzawonekera pano
Zopangidwa
Kutsitsa
Kukonda TTS.ai? udzauza anzanu!

Open Source TTS Zothandiza

Nchifukwa chiyani mapangidwe a otsegulira amakhudza mabizinesi anu

Zonse Open-Source Licensed

Mtundu uliwonse wa TTS.ai umagwiritsa ntchito chilolezo chotsegulira chilolezo. Palibe mabatani oyera ovomerezeka, palibe kulephera kwa wogulitsa, palibe ndalama zosayembekezereka.

MIT / Apache 2.0

Models ali ndi chilolezo pansi MIT kapena Apache 2.0, osati kwambiri ovomerezeka open-source zilolezo.

Kukhala ndi Mlengi

Mukhoza kutsitsa chilichonse cha mtundu uliwonse ndikuchigwiritsa ntchito pa zida zanu. Mukhoza kuwongolera kwathunthu pa data yanu, kulephera, komanso chitetezo.

GPU yosinthidwa

Mapangidwe amasinthidwa kuti azigwirizana ndi NVIDIA GPUs ndi CUDA. Piper imagwira ntchito pa CPU yokha. Mapangidwe ambiri amafunikira 2-8GB VRAM kuti azitha kuzindikira bwino.

Kugwirizana

Mabungwe otsegulira otsegulira amasamalira ndi kukulitsa machitidwewa. Kuphatikiza kwaulere - kutumiza ma bugs, kukulitsa, ndi mawu atsopano pa GitHub.

Kugwiritsa ntchito kwamalonda OK

Mukhozanso kupanga zinthu, kugulitsa ntchito, ndi kupanga zinthu zamalonda popanda ndalama za royalties kapena ndalama zogwiritsira ntchito.

Timaperekanso Open Source Model Catalog

Gawo lililonse la mtundu, lisensi yake, ndi zomwe zimachita bwino

KokoroKokoro

Free

Lightweight 82M parameter model delivering studio-quality speech with blazing-fast inference.

Fast 5/5

Oyenera kwa: Apache 2.0 - yabwino kwambiri yaulere yaulere, 82M params, yosavuta kuyendera

_Phunzirani Kokoro

PiperPiper

Free

A fast, local neural text to speech system optimized for Raspberry Pi and embedded devices.

Fast 3/5

Oyenera kwa: MIT - CPU yokha, yabwino kwa zida za edge ndi embedded self-hosting

_Phunzirani Piper

VITSVITS

Free

Conditional variational autoencoder with adversarial learning for end-to-end text-to-speech.

Fast 3/5

Oyenera kwa: MIT - Foundational architecture amagwiritsa ntchito ambiri downstream mafano

_Phunzirani VITS

BarkBark

Standard

Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.

Slow 4/5

Oyenera kwa: MIT - osiyana audio chitukuko zipangizo m'mphepete mwa standard TTS

_Phunzirani Bark

Tortoise TTSTortoise TTS

Premium

Multi-voice text-to-speech focused on quality with autoregressive architecture.

Slow 5/5 Chizindikiro cha mawu

Oyenera kwa: Apache 2.0 - kukula kwabwino kwambiri, kuphunzira kochuluka pamalamulo

_Phunzirani Tortoise TTS

OpenVoiceOpenVoice

Premium

Instant voice cloning with granular control over style, emotion, and accent.

Medium 4/5 Chizindikiro cha mawu

Oyenera kwa: MIT - kutulutsa mawu otsegulira ndi kuwongolera kwamtundu wa granular

_Phunzirani OpenVoice

Momwe Mungagwiritsire Ntchito Open Source TTS

Mutha kugwiritsa ntchito ma API athu kapena kuyendetsa mapangidwe anu okha

1

Kafukufuku wa Open Source Models

Pezani mndandanda wathu wa 20+ open-source TTS models.Kawirikawiri, tsamba la model limasonyeza chilolezo, chilankhulo, zofunikira, ndi zofunikira za self-hosting.

2

Pezani mu msakatuli wanu

Test any model directly on TTS.ai without installing anything. Our GPU servers handle processing so you can evaluate quality before committing to self-hosting.

3

Self-Host kapena kugwiritsa ntchito API yathu

Clone model repos kuchokera ku GitHub ndikugwira ntchito pamalo, kapena kugwiritsa ntchito API yathu yoyang'anira kupanga.Self-hosting imapatsa kuwongolera kwathunthu; API yathu imapatsa njira yoyang'anira.

4

Kukhazikitsa wanu Application

Kuphatikiza TTS muzinthu zanu pogwiritsa ntchito mapangidwe oyang'anira okha kapena REST API yathu.Zosefa zonse ndizogwiritsa ntchito malonda popanda ndalama za lisensi kapena royalties.

Kuyerekezera kwa License

Zomwe zili pa TTS.ai zimagwiritsa ntchito malayisensi otsegulira otsegulira

Model License Kugwiritsa ntchito kwamalonda Kusintha Woyang'anira yekha Kugwirizana
Kokoro Apache 2.0 Zofunika
Piper MIT Chosafunikira
VITS MIT Chosafunikira
MeloTTS MIT Chosafunikira
Chatterbox MIT Chosafunikira
Tortoise TTS Apache 2.0 Zofunika
StyleTTS 2 MIT Chosafunikira
OpenVoice MIT Chosafunikira
Sesame CSM Apache 2.0 Zofunika
Orpheus Llama 3.2 "Built with Llama"

Kukhala ndi Hosting vs Hosted API

Kuyendetsa mapangidwe anu nokha kapena tikuthandizeni kusamalira chitetezo

Self-Host pa Hardware yanu

Mtundu uliwonse wa TTS.ai ulipo ngati projekiti yotsegulidwa pa GitHub kapena Hugging Face. Mutha kutsitsa kunenepa, kukhazikitsa zotengera, ndi kuyendetsa chidziwitso pa GPU yanu.Muli ndi kuwongolera kwathunthu pa latency, privacy, ndi scaling.

  • Full deta privacy - audio sadzasiya seva yanu
  • Palibe ndalama za per-zofuna pambuyo pa setup yoyamba
  • Custom fine-tuning pa wanu okha deta
  • Amafunikira zida za GPU (NVIDIA imalimbikitsa)
  • Mungathe kuyendetsa zosintha, scaling, ndi kutengerapo

Kugwiritsa ntchito TTS.ai Hosted API

Tikuwongolera kukhazikitsa kwa GPU, kusinthidwa kwa mapangidwe, kuwongolera kwa queue, ndi kukulitsa. Chimodzi mwazinthu za API zimakupatsani mwayi wopeza mapangidwe onse - palibe chifukwa chofunikira kuyendetsa kukhazikitsidwa kosiyana.

  • Palibe zida za GPU zofunikira
  • Zomwe 20 + zojambula ndi imodzi API
  • Kusintha kwa machitidwe ndi kuwonjezeka kwa machitidwe
  • 99.9% uptime ndi chitetezo chokwanira
  • Pezani ndalama zokha za zomwe mumagwiritsa ntchito

Kuyambira kwaulere: API kapena Self-Host

Mutha kugwiritsa ntchito API yathu yokhala ndi masamba, kapena kukhazikitsa Kokoro m'malo mwake m'maola angapo

Njira 1: TTS.ai Hosted API Yabwino kwambiri
import requests

response = requests.post("https://api.tts.ai/v1/tts", json={
    "text": "Open source TTS with a simple API.",
    "model": "kokoro",
    "voice": "af_heart",
    "format": "wav"
}, headers={"Authorization": "Bearer YOUR_API_KEY"})

with open("output.wav", "wb") as f:
    f.write(response.content)
Chosankha 2: Self-Host ndi pip Kuwongolera kwathunthu
# Install Kokoro locally
pip install kokoro

# Generate speech on your own GPU
import kokoro

pipeline = kokoro.KPipeline(lang_code="a")
generator = pipeline("Hello from your own server!", voice="af_heart")
for i, (gs, ps, audio) in enumerate(generator):
    kokoro.save(audio, f"output_{i}.wav")

Otsegulidwa Source, Zotsika mtengo

Timapereka API yathu yokhala ndi TTS yotsegulira popanda kuyendetsa GPUs.

Free Tier

$0

15,000 characters pa signup

  • 4 open-source zojambula zaulere
  • Simungalembetse pogwiritsa ntchito mfundo
  • Kugwiritsa ntchito kwamalonda kumaloledwa

Woyamba

$9

500,000 characters/mwezi

  • Zonse 20+ open-source mapangidwe
  • Kusintha kwa mawu
  • Kupeza kwa API

Pro

$29

2,000,000 characters/mwezi

  • Kugwiritsa ntchito GPU
  • Zomwe zimadziwika bwino
  • Kuthandizira kwa Enterprise
Pangani Full Pricing

Funso Lofunsidwa Kawirikawiri

Mafunso ofala kwambiri pa nkhani ya kusinthitsa malemba kukhala mawu

Yai. Kawirikawiri, ma TTS.ai amagwiritsa ntchito chilolezo chotsegulira chilolezo - MIT kapena Apache 2.0. Tikuletsa mosasamala ma TTS.ai ndi zilolezo zoletsa (monga CPML ya Coqui kapena CC-BY-NC yosagulitsa).

Apache 2.0 imawonjezera zopereka za patenti zosadziwika bwino ndipo imafunikira kufotokoza zosintha ngati musintha kodi. MIT ndi yosavuta ndi zofunikira zochepa. Ambiri ndi othandiza pabizinesi.

Yai. Kawirikawiri, maphunzirowa amapezeka pa GitHub, koma titha kukhazikitsanso maphunzirowa pa GitHub. Ndikofunika kuti mupange maphunziro anu pa GitHub, kukhazikitsa zofunikira, kutsitsa maphunziro, ndi kuyendetsa chidziwitso. Timapereka zidziwitso za zofunikira za maphunziro onse, kuphatikizapo GPU, RAM, ndi mtundu wa Python.

Mafunso amasiyana malinga ndi mtundu. Piper safunikira GPU (CPU yokha). Kokoro ndi MeloTTS amafunikira 1-2GB VRAM. Mapangidwe ambiri amafunikira 4GB VRAM. Tortoise ndi Sesame CSM amafunikira 8GB. A NVIDIA RTX 3060 (12GB) angagwiritsidwe ntchito bwino kwambiri.

Yes. Open-source licenses allow modification including fine-tuning. Models like GPT-SoVITS and Bark provide fine-tuning scripts. You can train models on your own voice data to create custom voices or improve performance for specific languages.

Top open-source mafano (Kokoro, StyleTTS 2, Chatterbox) tsopano kugwirizana kapena kupitilira malonda ntchito monga ElevenLabs ndi Google TTS mu quality benchmarks.The lalikulu vantaggio di servizi commerciali è gestita infrastruttura e supporto, non qualità audio.

Tidachotsapo XTTS/XTTS-v2 (Coqui's CPML — non-commercial), F5-TTS (CC-BY-NC — non-commercial), ndi Higgs-v2 (Boson License — restrictive) zonsezi. Kawirikawiri, TTS.ai imatsimikiziridwa kuti ndi yotetezeka kugwiritsa ntchito kwamalonda.

Ndikofunika. Mamodeli ambiri amavomereza maphunziro a anthu kudzera pa GitHub. Mukhoza kutumiza zidziwitso za zifukwa, zolemba za mawu za maphunziro atsopano, kuwonjezera kofunikira, ndi zolemba. Sankhani GitHub repository ya model iliyonse kuti mudziwe malangizo a maphunziro ndi mavuto ogwira ntchito.

GPU yathu ikugwira ntchito ndi 20 + mapangidwe pa 4x Tesla P40 (96GB VRAM yonse) pogwiritsa ntchito kutsitsa kwa dynamic. Pogwiritsa ntchito 24GB GPU, mutha kuyendetsa ma 3-5 mapangidwe nthawi imodzi.

Mapangidwe ambiri amapatsa zithunzi za Docker kapena Dockerfiles. Kuti mugwiritse ntchito mapangidwe ambiri, mutha kupanga setup ya Docker yosinthidwa ndi NVIDIA Container Toolkit kuti mupite ku GPU.

Mapangidwe ambiri amafuna Python 3.10-3.12. Coqui TTS (VITS) amafunikira Python 3.11. Tikulimbikitsa Python 3.12 kwa mapangidwe ambiri. Onani requirements.txt ya mapangidwe onse kuti mudziwe mtundu wogwirizana.

Ndikofunika. MIT ndi Apache 2.0 zilolezo mosasamala amalola kugwiritsa ntchito malonda. Mukhoza kulenga SaaS zinthu, mafoni mapulogalamu, masewera, ndi ntchito pogwiritsa ntchito izi mafano popanda ndalama licensing, royalties, kapena zofunikira kutchulidwa (kapena ngakhale kutchulidwa ndi kuthokoza).
5.0/5 (1)

Kodi tingachitire chiyani kuti tisinthe? Maganizo anu amatithandiza kuchotsa mavuto.

Pezani Open Source TTS Tsopano

20 + otsegulira mapangidwe, onse ndi mabizinesi ovomerezeka. Mutha kugwiritsa ntchito API yathu kapena kupangira nokha - kusankha ndi kwanga.