Ka tūwhera te pūtake o te kupu ki ngā tauira kōrero

He pūtake tūwhera ia tauira TTS i runga i a tātau pūnaha me ngā whakaaetanga whai tikanga. MIT, Apache 2.0 — kāore he kati ā-whāinga, kāore he whakawhāititanga whakamahinga, kāore he utu whakaaetanga tūkino. Ka whakamahia rātau mā tātau API hosta, ka noho rātau rānei i runga i tātau ake hanganga me te mana katoa.

Māmā te pūtake Whakawhiwhinga MIT Apache 2.0 Kātahi anō ka whakahohe GitHub

Ka whakamātautia ināianei

0/500
Waihoki me Kokoro, Piper, VITS, MeloTTS
Your generated audio will appear here
I hangaia
0:00 0:00
Waihoki
Pērā ki a TTS.ai? E kōrero ana ki ōna hoa!

Ko nga painga o te pūtake tūwhera TTS

He aha te tikanga o ngā tauira pūtake tūwhera mō ōna kaupapa?

Ko nga mea katoa kua whakaaetia e te pūtake tūwhera

Ko ia tauira i runga i te TTS.ai e whakamahi ana i tētahi whakaaetanga pūtake tūwhera. Kāore he kāwai mārō whai mana, kāore he kati tāngata, kāore he utu whakaaetanga kāore i te tūmanako.

MIT / Apache

Kua whakaaetia ngā tauira i raro i te MIT, i te Apache 2.0 rānei, ko ngā whakaaetanga pūtake tūwhera tino whakaaetia. Ka whakamahia ki te hokohoko, te whakarerekētanga, te tuaritanga anō — kāore he rāhuitanga.

Kātahi anō ka whakahohe

Ka whakataki i tētahi tauira me te whakahaere i ia i runga i ōna ake pūrere. Ka tino whakahaeretia ōna raraunga, te ātetetanga, me te hanganga. Kāore e hiahiatia te whakawhirinaki ki te mātao.

Kua pai ake te GPU

Kua whakaritea ngā tauira mō ngā GPU NVIDIA me te tautoko CUDA. Ka haere a Piper i runga i te CPU anake. E hiahiatia ana e te nuinga o ngā tauira he 2–8GB VRAM mō te whakawāteatanga mātauranga.

Ko te iwi i pupuritia

Ko ngā rōpū pūtake tūwhera hohe e pupuri ana, e whakarei ake ana hoki i ēnei tauira. E whakaaetia ana ngā koha — e tono ana i ngā hapa, ngā whakapainga, me ngā reo hōu i runga i te GitHub.

Ka tika te whakamahinga hokohoko

Ka whakaaetia e ngā tauira katoa te whakamahinga hokohoko i raro i to rātau whakaaetanga, te hanga hua, te hoko ratonga, me te waihanga ihirangi hokohoko kaore he utu utu, he utu whakamahi rānei.

Ko tātau pukapuka tauira pūtake tūwhera

He tauira ia tauira, tōna whakaaetanga, me te mea pai rawa e mahi ai.

KokoroKokoro

Free

Lightweight 82M parameter model delivering studio-quality speech with blazing-fast inference.

Fast 5/5

Ko te tino pai mo: Apache 2.0 — te tauira wātea pai rawa, 82M ngā tohu, he ngāwari ki te whakanoho i a ia anō

Whakamātautau Kokoro

PiperPiper

Free

A fast, local neural text to speech system optimized for Raspberry Pi and embedded devices.

Fast 3/5

Ko te tino pai mo: MIT — CPU- anake, tino pai mo ngā pūrere perehitini me te whakahoahoa ā-hinengaro

Whakamātautau Piper

VITSVITS

Free

Conditional variational autoencoder with adversarial learning for end-to-end text-to-speech.

Fast 3/5

Ko te tino pai mo: MIT — te hanganga taketake e whakamahia ana e ngā tauira maha o raro.

Whakamātautau VITS

BarkBark

Standard

Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.

Slow 4/5

Ko te tino pai mo: MIT — ngā āheinga whakanao oro motuhake i tua atu i te TTS paerewa

Whakamātautau Bark

Tortoise TTSTortoise TTS

Premium

Multi-voice text-to-speech focused on quality with autoregressive architecture.

Slow 5/5 Ko te tāruatanga reo

Ko te tino pai mo: Apache 2.0 — te pai rawa, te whakatinanatanga tohutoro whānui i akona

Whakamātautau Tortoise TTS

OpenVoiceOpenVoice

Premium

Instant voice cloning with granular control over style, emotion, and accent.

Medium 4/5 Ko te tāruatanga reo

Ko te tino pai mo: MIT — te tārua reo pūtake tūwhera me te whakahaere kāhua matatini

Whakamātautau OpenVoice

He pēhea te whakamahi i te pūtake tūwhera TTS

Ka whakamahia tātau te API kaihautū, ka whakahaere rānei i ngā tauira ki a koe

1

E torotoro ana i ngā tauira pūtake tūwhera

Ka tirohia tātau pukapuka o ngā tauira TTS pūtake tūwhera 20+. Ka whakaaturia e ia pātū tauira te whakaaetanga, te hanganga, ngā āheinga, me ngā whakaritenga whakahoahoa.

2

Ka whakamātau i roto i tō tou kaitirotiro

Whakamātautau i tētahi tauira hāngai i runga i te TTS.ai me te kore whakatū i tētahi mea. Ka whakahaeretia e a tātau pūnaha GPU te tukanga kia taea ai e koe te arotake i te kounga i mua i te whakawhiwhinga ki te whakawhiwhinga.

3

Māna-māna, Ka whakamahia rānei tātau API

Ko te tārua o ngā tāurunga tauira mai i te GitHub me te whakahaere ā-rohe, te whakamahi rānei i tātau API kaihautū mō te whakanaotanga. Ka hoatu e te kaihautū anō te mana katoa; Mā tātau API e whakarato ana i te hanganga whakahaere.

4

Hanganga i ōna taupānga

Ko te whakaurutanga o TTS ki roto i tōna hua mā te whakamahi i ngā tauira whaimana, mā tātau rānei REST API. Ka taea te whakamahi i ngā tauira katoa me te kore utu whakaaetanga, royalties rānei.

Tērā te whakataurite waehere

Ko ngā tauira katoa i runga i te TTS.ai e whakamahi ana i ngā whakaaetanga pūtake tūwhera ā-pūnaha

Kāhua Whakawhiwhinga Ka whakamahia ki te hokohoko Ka whakakētia Māmā te kaihautū Whakawhiwhinga
Kokoro Apache 2.0 E hiahiatia ana
Piper MIT Ka taea
VITS MIT Ka taea
MeloTTS MIT Ka taea
Chatterbox MIT Ka taea
Tortoise TTS Apache 2.0 E hiahiatia ana
StyleTTS 2 MIT Ka taea
OpenVoice MIT Ka taea
Sesame CSM Apache 2.0 E hiahiatia ana
Orpheus Llama 3.2 "Built with Llama"

Hopu vs Hosted API

Ka whakahaere i ngā tauira koe, ka whakaae rānei kia whakahaeretia e tātau te hanganga

Māna anō te kaihautū i runga i ōna pūrere

Kei te wātea ia tauira i runga i te TTS.ai hei kaupapa pūtake tūwhera i runga i te GitHub, i te Hugging Face rānei. Whakataki i ngā taumahatanga, te whakatū i ngā whakawhirinaki, me te whakahaere i te whakawāteatanga i runga i ōna GPU ake. He mana katoa tātou i runga i te ātetetanga, te ātetetanga, me te tauinetanga.

  • Ko te ātetetanga raraunga katoa — kāore te oro i tae atu ki tōna pūnaha
  • Kāore he utu-i-te tono i muri i te whakaritenga tuatahi
  • He tino tika te whakarerekētanga i runga i ōna raraunga
  • E hiahiatia ana ngā pūrere GPU (e whakaritea ana e NVIDIA)
  • Ka whakahaeretia e koe ngā whakahōutanga, te tauine, me ngā whakawhirinaki

Ka whakamahia te TTS.ai Hosted API

Ka whiwhi āheitanga tere ki ngā tauira 24+ katoa mā tētahi API REST kotahi. Ka whakahaeretia e tātau te GPU, ngā whakahōutanga tauira, te whakahaere kōwae, me te tauinetanga. Ko te kī API kotahi e whakarato ana i te āheitanga ki ia tauira — kāore e hiahiatia kia whakahaeretia ngā whakawhānuitanga motuhake.

  • Kāore he rauemi GPU e hiahiatia ana
  • Ko ngā tauira katoa 24+ mā te API kotahi
  • Whakahōutanga tauira me ngā whakapainga
  • 99.9% uptime me te hanganga nui rawa
  • Ko te utu anake mo te mea e whakamahia ana e koe

Ka tīmata tere: API, Māna rānei te kaihautū

Ka whakamahia tātau te API kaihautū, ka whakatū ā-rohe rānei a Kokoro i roto i ngā minu

Waihoki 1: TTS.ai Hosted API Māmā rawa
import requests

response = requests.post("https://api.tts.ai/v1/tts", json={
    "text": "Open source TTS with a simple API.",
    "model": "kokoro",
    "voice": "af_heart",
    "format": "wav"
}, headers={"Authorization": "Bearer YOUR_API_KEY"})

with open("output.wav", "wb") as f:
    f.write(response.content)
Waihoki 2: Tākaro-whakahaere me te pip Mana katoa
# Install Kokoro locally
pip install kokoro

# Generate speech on your own GPU
import kokoro

pipeline = kokoro.KPipeline(lang_code="a")
generator = pipeline("Hello from your own server!", voice="af_heart")
for i, (gs, ps, audio) in enumerate(generator):
    kokoro.save(audio, f"output_{i}.wav")

Māmā te pūtake, te utu ā-ringa

Ka taea e tātau te hopu i te TTS pūtake tūwhera me te kore whakahaere i ngā GPUs.

Waihoki

$0

50 ngā pūtea i te whakaingoatanga

  • 4 ngā tauira pūtake tūwhera
  • Kāore he whakaingoatanga mō te whakamahinga taketake
  • Ka whakaaetia te whakamahinga hokohoko

Ka tīmata

$9

500 ngā pūtea/whā

  • Ko ngā tauira pūtake tūwhera katoa 24+
  • Ko te tārua reo
  • Ka uru ki te API

Pro

$29

2000 ngā pūtea/whā

  • Whakahaeretanga GPU arotahi
  • Ko nga tauira katoa
  • Mā te tautoko pāpori
Tirohia te utu katoa

E pā ana ngā pātai

Ko ngā pātai noa iho mo te kupu pūtake tūwhera ki te kōrero

Ināianei. Ka whakamahia e ia tauira i runga i te TTS.ai tētahi whakaaetanga pūtake tūwhera - MIT rānei, Apache 2.0 rānei. E whakawātea ana mātou i ngā tauira me ngā whakaaetanga whakawhāiti (pēnei i te CPML a Coqui, i te CC-BY-NC kāore i te hokohoko rānei). Ka taea e koe te whakamātau i te whakaaetanga o ia tauira i runga i tōna puna GitHub.

He whakaaetanga pūtake tūwhera e whakaae ana ki te whakamahi hokohoko, te whakarerekētanga, me te tuaritanga anō. Ko te Apache 2.0 e tāpiri ana i ngā whakawhiwhinga whakawhiwhinga, ā, e hiahiatia ana kia whakapuakina ngā huringa mēnā ka whakarerekētia e koe te waehere. He ngāwari ake te MIT me ngā whakaritenga iti iho. He ngāwari te hokohoko.

He. Ka taea e ia tauira te whakanoho i a ia anō. Ko te tārua o te puna tauira mai i te GitHub, te whakatū i ngā whakawhirinaki, te whakataki i ngā taumahatanga tauira, me te whakahaere i te whakawāteatanga. Ka homai e tātau ngā tuhinga mō ngā whakaritenga whakanoho anō o ia tauira tae atu ki te GPU, RAM, me te putanga Python.

He rerekē ngā whakaritenga e ai ki te tauira. Kāore e hiahiatia ana e Piper he GPU (CPU anake). E hiahiatia ana e Kokoro me MeloTTS he 1-2GB VRAM. E hiahiatia ana e te nuinga o ngā tauira paerewa he 4GB VRAM. E hiahiatia ana e Tortoise me Sesame CSM he 8GB. Ka taea e NVIDIA RTX 3060 (12GB) te whakahaere i te nuinga o ngā tauira.

He. Ka whakaaetia e ngā raina pūtake tūwhera te whakarerekētanga tae atu ki te whakarerekētanga. Ko ngā tauira pēnei i te GPT-SoVITS me te Bark e whakarato ana i ngā tuhipānui whakarerekētanga. Ka taea e koe te whakaako i ngā tauira i runga i o koe ake ngā raraunga reo hei waihanga i ngā reo motuhake, hei whakapai ake rānei i te mahi mō ngā reo tauwhāiti.

Ko ngā tauira pūtake tūwhera o runga (Kokoro, StyleTTS 2, Chatterbox) ināianei e ōrite ana, e āhei ana rānei ki ngā ratonga hokohoko pēnei i te ElevenLabs me te Google TTS i roto i ngā tohu tika.

Kua tangohia kētia e tātau a rātau. XTTS/XTTS-v2 (Coqui's CPML — kāore i te hokohoko), F5-TTS (CC-BY-NC — kāore i te hokohoko), me Higgs-v2 (License Boson — whakawhāiti). Ka whakamātauria ia tauira i runga i te TTS.ai kia haumaru te whakamahinga hokohoko.

He. Ko te nuinga o ngā tauira e whakaae ana ki ngā koha hapori mā te GitHub. Ka taea e koe te tukuna o ngā pūrongo hapa, ngā pūkete reo mō ngā reo hou, ngā whakapainga waehere, me ngā tuhinga. Tirohia te puna GitHub o ia tauira mō ngā tohutohu koha me ngā take hohe.

Ka kawea ngā tauira i runga i te tono me te whakawātea i te wā e whakawāteatia ai te pūmahara GPU. Ka haere a tātau pūnaha GPU ki ngā tauira 20+ i runga i te 4x Tesla P40 (96GB VRAM katoa) mā te whakamahi i te whakawāteatanga hihiri. Mō te whakangahau, ka taea e te GPU 24GB kotahi te whakawhiwhi i ngā tauira 3-5 i te wā kotahi.

He maha ngā tauira e whakarato ana i ngā whakaahua Docker ā-kāwanatanga, i ngā Dockerfiles rānei. Mō te whakahaere i ngā tauira maha, ka taea e koe te hanga i tētahi whakaritenga Docker ā-kaupapa me te NVIDIA Container Toolkit mō te uru ki te GPU. Ka taea e tātau hanganga pāpāho API te mahi hei taupānga tohutoro.

Ko te nuinga o ngā tauira e hiahiatia ana a Python 3.10-3.12. Ko te Coqui TTS (VITS) e hiahiatia ana i te Python 3.11. E whakaaro ana tātau i te Python 3.12 mō te nuinga o ngā tauira. Tirohia ngā whakaritenga.txt o ia tauira mō te ōritetanga o te putanga tika.

Ināianei, ka whakaaetia e ngā whakaaetanga a MIT me Apache 2.0 te whakamahinga hokohoko. Ka taea e koe te hanga i ngā hua SaaS, ngā taupānga pāpāho, ngā kēmu, me ngā ratonga e whakamahi ana i ēnei tauira me te kore utu whakawhiwhinga, ngā utu whakawhiwhinga, ngā whakaritenga whakawhiwhinga rānei (ahakoa ko te whakawhiwhinga e manakohia ana).
5.0/5 (1)

Whakamātau i te pūtake tūwhera TTS i tēnei rā

24+ ngā tauira pūtake tūwhera, e whakaaetia ana katoa. Ka whakamahia e tātau te API, te kaihautū anō rānei — ko te kōwhiringa ko tātau.