Umbhalo kuya kuSpeech API abathuthukisi

Ukwakha izicelo ezikwazile ukukhuluma nge-REST API yethu. Engeza umbhalo ojwayelekile wokukhuluma, ukuklonya ukukhuluma, ukukhuluma nokubhala, nokucubungula umsindo kuma-apps akho, ama-chatbots, amalungu okukhuluma, nama-SaaS products. OpenAI-compatible format, 20+ models, simple integration.

I-REST API I-Chatbots Izisebenziso zomsindo Imikhiqizo ye SaaS Ukuzenzakalela

Zama manje

Imahhala neKokoro, Piper, VITS, MeloTTS
Umsindo wakho okhiqizwe uzovela lapha
Ikhiqizwe
Uthanda i-TTS.ai? Ncoma abangane bakho!

Izici ze-API ezisetshenziswa abathuthukisi

Konke okudingayo ukuthuthukisa izisebenziso ezikwaziyo ukulalela

I-REST API elula

Isicelo esisodwa se-POST sokwenza ulwimi. Isicelo se-JSON, impendulo yomsindo. Isebenza nganoma iyiphi ulwimi lokudweba oluxhasa i-HTTP.

OpenAI-compatible

I-drop-in replacement for OpenAI TTS API. Switch your base_url and API key — existing code works immediately.

24+ Amamodeli Atholakalayo

Ngena kunoma iyiphi imodeli nge-API eyodwa. Shicilela imodeli ngokuguqula ipharamitha eyodwa. Qaphela ukhwalithi, isivinini, nezindleko.

Isikhathi sokuzimela esingaphansi kwesithathu

I-Kokoro ikhiqiza umsindo ngaphansi kwesekondi eyodwa. Ilungile kuma-chatbots wesikhathi sangempela, abasiza bokukhuluma, namathuluzi axhumanayo.

Uhlu lwezwi

Uhlu lwezinhlamvu ezikhona ezisuka kusampula yomsindo omncane nge-API. Sebenzisa izinhlamvu ezikhona ezikhona kuzo zonke izizukulwane ezizayo.

Ifomati eminingi

I-Output njenge-WAV, MP3, OGG, noma i-FLAC. Khetha isilinganiso sesampula kanye ne-bit depth. Ukusakazwa kwesandi sosizo lwezinhlelo zokusebenza zesikhathi sangempela.

Imodeli engcono kakhulu yokuxhuma umthuthukisi

Khetha imodeli efanele yesicelo sakho sejubane, umgangatho, kanye nezidingo zemali

KokoroKokoro

Free

Lightweight 82M parameter model delivering studio-quality speech with blazing-fast inference.

Fast 5/5

Okungcono kakhulu: Imodeli ehamba ngokushesha kakhulu — isizini esingaphansi, elungele ama-apps ne-chatbots

Zama Kokoro

CosyVoice 2CosyVoice 2

Standard

Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.

Medium 5/5 Ukulungiswa kwezwi

Okungcono kakhulu: Ukusakazwa kwe-TTS ngezwi lokuklonya izisebenziso zomsiza wokukhuluma

Zama CosyVoice 2

Sesame CSMSesame CSM

Premium

Conversational speech model generating natural dialogue with appropriate timing and emotion.

Slow 5/5

Okungcono kakhulu: I-AI yokuxoxa ngesikhathi esijwayelekile se-chatbot nezwi lomsiza

Zama Sesame CSM

PiperPiper

Free

A fast, local neural text to speech system optimized for Raspberry Pi and embedded devices.

Fast 3/5

Okungcono kakhulu: Imodeli emahhala, ye-CPU kuphela yezinhlelo zokusebenza ezinomsindo ophezulu ngezindleko ezingenalutho

Zama Piper

BarkBark

Standard

Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.

Slow 4/5

Okungcono kakhulu: Ukukhishwa komsindo ngemiphumela yomsindo yezinhlelo zokusebenza ezithuthukisayo nezijabulisayo

Zama Bark

Indlela yokuxhuma i-TTS API

Ukusuka ekubhaliseni kuya ku-API yokuqala ukubiza ngezansi kwemizuzu emi-5

1

Thola isithonjana sakho se-API

Ubhalise mahhala futhi ukhiqize isithonjana se-API kusuka ku-akhawunti yakho ye-dashboard. Amaphawu angama-15,000 afakwe.

2

Yenza umlayezo wakho wokuqala

POST ku /v1/tts ngetekisi, imodeli, nezwi. Thola amabhayithi omsindo emuva. Ngemigqa emihlanu yekhodi.

3

Khetha imodeli yakho

Ukuhlolwa kwezinhlobo ezahlukene zesimo sakho sokusetshenziswa. Qaphela ijubane, ukhwalithi, kanye nezindleko ngesigaba ngasinye.

4

I-Ship to Production

Isilinganiso nge-pay-as-you-go characters. Akukho mazinga okungenani emikhankasweni ekhokhelwayo. Bona kusetshenziswa kwi-dashboard yakho.

Izinhlamvu zesiqalo esikhawulelwe

Ihlanganisa TTS.ai nganoma iyiphi ulwimi nge REST API yethu

Python Okuthandwayo
import requests

response = requests.post(
    "https://api.tts.ai/v1/tts",
    json={
        "text": "Hello from my app!",
        "model": "kokoro",
        "voice": "af_heart",
        "format": "mp3"
    },
    headers={
        "Authorization": "Bearer sk-tts-xxx"
    }
)

with open("output.mp3", "wb") as f:
    f.write(response.content)
JavaScript (Node.js) Node.js
const response = await fetch(
    "https://api.tts.ai/v1/tts",
    {
        method: "POST",
        headers: {
            "Content-Type": "application/json",
            "Authorization": "Bearer sk-tts-xxx"
        },
        body: JSON.stringify({
            text: "Hello from my app!",
            model: "kokoro",
            voice: "af_heart",
            format: "mp3"
        })
    }
);

const audio = await response.blob();
cURL I-Universal
curl -X POST https://api.tts.ai/v1/tts \
  -H "Authorization: Bearer sk-tts-xxx" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Hello from my app!",
    "model": "kokoro",
    "voice": "af_heart",
    "format": "mp3"
  }' \
  --output output.mp3
Ifomethi ehambisanayo ne-OpenAI Ukusuka
# Works with OpenAI client library
from openai import OpenAI

client = OpenAI(
    api_key="sk-tts-xxx",
    base_url="https://api.tts.ai/v1"
)

response = client.audio.speech.create(
    model="kokoro",
    voice="af_heart",
    input="Hello from my app!"
)

response.stream_to_file("output.mp3")

Okwenziwa ngabathuthukisi nge-TTS.ai

Izinhlelo zokusebenza ezijwayelekile zokuxhuma

I-AI Chatbots & Assistants

Engeza umsindo ophumayo ku-chatbot yakho noma ku-AI assistant. I-pipe LLM iphendula nge-TTS ye-voice-enabled interfaces. I-Kokoro inikeza i-sub-second latency yezingxoxo zesikhathi sangempela. I-Sesame CSM ikhiqiza umsindo wezingxoxo ngesikhathi sangempela.

  • Uphendulo lwe-LLM ku-speech pipeline
  • Isikhathi sokugcina esingaphansi kwesekondi nge-Kokoro
  • Ukukhuluma uma ukhuluma ngeSesame CSM
  • Ukukhishwa komsindo osakazwayo

Izinhlelo zokusebenza zeselula nezokukhuluma

Ukwakha izicelo zeselula ezikwaziyo ukulalela, amathuluzi okufinyeleleka, izicelo zokufundwa, kanye nezinhlelo zokufunda ulwimi. I-REST API yethu isebenza nanoma iyiphi iphrojekthi yeselula. Layisha phezulu amafayela omsindo noma udlulise ngqo kukhasimende.

  • React Native, Flutter, Swift, Kotlin
  • Ufinyelela nokufundwa kwezinhlelo zokusebenza
  • Izinkundla zokufundela ulwimi
  • Ukukhishwa kwesihloko somsindo

Imikhiqizo ye SaaS

Ikhono lokushicilela umsindo omhlophe-i-label kumkhiqizo wakho we-SaaS. Engeza i-TTS, i-STT, ukuklonya umsindo, nokucubungula umsindo njengezici ze-platform yakho. Sebenzisa i-API yethu njenge-backend yomsindo wakho ngaphandle kokuphatha i-GPU infrastructure.

  • Izici zomsindo we-white-label
  • Akukho sakhiwo se-GPU esidingekayo
  • Ukukhokha ngenqubo yokusetshenziswa
  • 20+ amamodeli ukunikela abasebenzisi bakho

Uhlelo lokuzenzakalela

Ihlanganisa ukukhishwa kwezwi ku-CI/CD pipelines, ukuphathwa kwezinto eziqukethwe, kanye nokuphathwa kwemisebenzi yokuphatha. Yenza amawaka efayela omsindo kusuka kudatha yespreadsheet, ukulawula ukukhishwa kwepodcast, noma ukwakha ukuphathwa kwezinto eziqukethwe.

  • Ukuphathwa kweqembu nge-API
  • Ingxenye yendawo yokuphatha ipayipi
  • Ukuhlanganisa kwe-CI/CD
  • Ispreadsheet yokusebenza ngokuzenzakalela kwesandi

Izinkomba ze-API

Ifakwe izicelo zokukhishwa

20+

Amamodeli we-TTS

100+

Izizwi

30+

Izilimi

<1s

Ukwehla (Kokoro)

Imibuzo ebuzwa kaningi

Imibuzo ejwayelekile mayelana ne-TTS.ai developer API

Yebo. I-API yethu ilandela i-OpenAI audio speech format. Uma usebenzisa i-OpenAI Python noma iJavaScript client library, ungashintsha u-TTS.ai ngokuguqula i-base_url ne-api_key parameters. Ikhodi yakho esekhona isebenza ngaphandle kokuguqulwa.

I-Kokoro ikhiqiza umsindo ngaphansi kwesekondi eyodwa yezilimi ezijwayelekile. I-CosyVoice 2 isekela ukusakazwa kwe-output ukuze kube khona ukubekezelelana okuphansi. I-chatbots ne-voice assistants, isikhathi sokuhamba-ngokugcwele sivame ukuba yisekondi ezingu-1-3 ngokuya ngesikhathi sokubhala kanye nemodeli yokukhethwa.

Amamodeli amahhala (iKokoro, iPiper, iVITS, iMeloTTS) amahhala ngokuphelele. Amamodeli ajwayelekile asebenzisa ama-2x ama-characters nge-1K yombhalo. Amamodeli e-Premium asebenzisa ama-4x ama-characters nge-1K yombhalo. Bhala ngokumahhala nge-15,000 ama-characters. Ama-plans aqala ku- $ 9 / ngenyanga nge-500,000 ama-characters.

Yebo. Layisha phezulu isampula yomsindo (amasekondi angama-5-30) kwindawo yokuqeda ukuklonywa komsindo, bese usebenzisa i-ID yomsindo eklonyelweyo kumacela we-TTS alandelayo. Amamodeli axhasa ukuklonywa kufaka phakathi i-CosyVoice 2, i-Chatterbox, i-Fish Speech, ne-GPT-SoVITS.

Izinga elimahhala linesilinganiso esiyinhloko sokunciphisa (izicelo ezi-3 ngehora ngaphandle kwe-akhawunti). Ama-plans akhokhelwayo anesilinganiso esincane esifanele izicelo zokukhishwa. Xhumana nathi ngezidingo ze-enterprise-level throughput.

WAV (akukho kucindezelwa, izinga eliphakeme), MP3 (kucindezelwe, amafayela amancane), OGG (fomethi evulekile), ne FLAC (ukucindezelwa okungalahleki). Cacisa ifomethi kusicelo sakho. Okuzenzakalelayo yi-WAV kusilinganiso sesampula semodeli.

Yebo. Yenza i-TTS API yethu ibe nemodeli yokukhuluma-nokubhala kanye ne-LLM ukwakha ipayipi eliphelele le-voice assistant. I-Kokoro inikeza i-sub-second latency efanelekayo yokuxoxa ngesikhathi sangempela. I-CosyVoice 2 isekela ukukhishwa kwe-streaming ukuze kube nesikhathi esiphansi sokuphendula.

I-CosyVoice 2 ne-Kokoro zixhasa ukusakazwa kwe-audio lapho ama-chunks we-audio ethunyelwe khona njengoba zikhiqizwa. Lokhu kunciphisa isikhathi-sokuqala-se-bytes zezinhlelo zesikhathi sangempela ezifana nabasebenzi bezwi kanye nezingxoxo.

I-API ibuyisela amakhodi wesimo se-HTTP esijwayelekile. Sebenzisa ukubuyela emuva okuqhubekayo kwephutha le-5xx kanye nemiphumela yokungenamkhawulo. Ukusebenzisa izicelo ezibalulekile, ngeza ifolokhwe ngokuphindaphinda. I-API yethu inesikhathi esiphezulu sokusebenza kodwa ukuphatha iphutha eliqinile kuvame ukukhuthazwa.

Yebo. I /v1/izwi kanye ne /v1/imodeli isigaba sokuphela sibuyisela uhlu lwe-JSON lwazo zonke izizwi ezikhona kanye nemodeli nge-metadata yabo (usizo lwesilimi, izibalo zekhwalithi, izibalo zejubane, kanye nezinga lokukhokha). Sebenzisa lezi ukuze udale izikhethi zemodeli ezinamandla kwisicelo sakho.

Amamodeli amahhala (iKokoro, iPiper, iVITS, iMeloTTS) asebenza njengebhokisi le-sandbox elisebenzayo njengoba limahhala ngokuphelele. Ukuhlola ukuxhumeka kwakho namamodeli amahhala, bese ushintsha kumamodeli aphezulu ekukhiqizeni ngokuguqula ipharamitha yemodeli. Akukho simo sokuhlolwa okuhlukile esidingekayo.

Ezinye zezimodeli zethu zivulekile futhi zingaba ne-hosting. Kodwa-ke, ukuhoxiswa kwe-self-hosting kudinga ama-GPU abalulekile (sisebenzisa i-4x NVIDIA Tesla P40 ne-96GB VRAM ephelele). I-API inikeza indlela engcono kakhulu ngaphandle kokuphathwa kwesakhiwo.
5.0/5 (1)

Yini esingayithuthukisa? Umbono wakho usiza ukuxazulula izinkinga.

Ukulungele ukuthuthukisa ngezwi AI?

Thola isithonjana sakho se-API esimahhala bese uqala ukwakha. 15,000 izibonakaliso zokubhalisa, amamodeli amahhala atholakalayo, incwadi ebanzi.