Text to Speech API yevagadziri

Kugadzira maapplication anotsigira mashoko neREST API yedu. Iva netext-to-speech, voice cloning, speech-to-text, uye audio processing kune yako maapps, chatbots, voice assistants, uye SaaS zvigadzirwa. OpenAI-inowirirana fomati, 20+ mamodheru, nyore kubatanidza.

REST API Chatbots Zvirongwa zvemutauro SaaS zvigadzirwa Automation

Tarisa ikozvino

Free with Kokoro, Piper, VITS, MeloTTS
Yako yakagadzirwa audio ichaonekwa pano
Yakagadzirwa
_Dhawunirodha
Love TTS.ai? Tiudza shamwari dzako!

API Features yeVagadziri

Zvese zvaunoda kuti uite zvirongwa zvinotsigira mashoko

Simple REST API

One POST request to generate speech. JSON request, audio response. Works with any programming language that supports HTTP.

OpenAI-inowirirana

Drop-mu replacement for OpenAI TTS API. Switch yako base_url uye API key — yazvino kodhi mabasa panguva imwe chete.

24+ Models Available

Kuwana chero model kuburikidza imwe API. Switch mamodheru nekugadzirisa imwe parameter. Kuenzanisa mhando, kugadzikana, uye mutengo.

Sub-second Latency

Kokoro inogadzira audio munguva diki kupfuura 1 sekondi.Inokurumidza uye inokurumidza.Inokurumidza uye inokurumidza.

Voice Cloning API

Clone chero izwi kubva kune audio sample yakafupi kuburikidza neAPI. Usashandisa zvakaklonwa zvirevo kune ese magenerations anotevera.

Mafomati akawanda

Output se WAV, MP3, OGG, kana FLAC. Choose sample rate uye bit depth. Streaming audio rutsigiro rwe real-time apps.

Best Models for Developer Kubatanidzwa

Choose the right model for your application's speed, quality, and cost requirements

KokoroKokoro

Free

Lightweight 82M parameter model delivering studio-quality speech with blazing-fast inference.

Fast 5/5

Yakanaka kune: Yakanyanya kuomarara mufananidzo - sub-second latency, yakakwana kune real-time apps uye chatbots

_Tarira Kokoro

CosyVoice 2CosyVoice 2

Standard

Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.

Medium 5/5 Voice Cloning

Yakanaka kune: Streaming TTS nezwi kloning yebasa remubatsiri wezwi

_Tarira CosyVoice 2

Sesame CSMSesame CSM

Premium

Conversational speech model generating natural dialogue with appropriate timing and emotion.

Slow 5/5

Yakanaka kune: Conversational AI neyakajairika timing yechatbot uye asistente voice

_Tarira Sesame CSM

PiperPiper

Free

A fast, local neural text to speech system optimized for Raspberry Pi and embedded devices.

Fast 3/5

Yakanaka kune: Free, CPU-only model yezvirongwa zvepamusoro-soro nezero mari yekubhadhara

_Tarira Piper

BarkBark

Standard

Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.

Slow 4/5

Yakanaka kune: Audio generation nezvokutaura zvemukati zvemaapplication ekugadzira uye emitambo

_Tarira Bark

Maitiro Ekubatanidza TTS API

Kutanga kubva pakutanga kweAPI kusvikira pakutanga kweAPI kusvikira pakutanga kweAPI

1

Get Your API Key

Sign up for free and generate an API key from your account dashboard. 15,000 characters included.

2

Zvaunofanira kuita

POST kuti / v1 / tts nemashoko, model, uye mashoko. Get audio bytes back. Under 5 lines of code.

3

Choose Your Model

Unogona kuongorora akasiyana mamodheru ekushandisa uye kuenzanisa kugadzikana, mhando, uye mutengo pagore.

4

Ship to Production

Scale ne pay-as-you-go zviratidzo. Hapana kurambidzwa pazvirongwa zvakabhadharwa. Monitor kushandisa mu dashboard yako.

Quick Start Code Mifananidzo

Integre TTS.ai mu chero rurimi neruzivo rwedu rweREST API

Python Yakakurumbira
import requests

response = requests.post(
    "https://api.tts.ai/v1/tts",
    json={
        "text": "Hello from my app!",
        "model": "kokoro",
        "voice": "af_heart",
        "format": "mp3"
    },
    headers={
        "Authorization": "Bearer sk-tts-xxx"
    }
)

with open("output.mp3", "wb") as f:
    f.write(response.content)
JavaScript (Node.js) Node.js
const response = await fetch(
    "https://api.tts.ai/v1/tts",
    {
        method: "POST",
        headers: {
            "Content-Type": "application/json",
            "Authorization": "Bearer sk-tts-xxx"
        },
        body: JSON.stringify({
            text: "Hello from my app!",
            model: "kokoro",
            voice: "af_heart",
            format: "mp3"
        })
    }
);

const audio = await response.blob();
cURL Universal
curl -X POST https://api.tts.ai/v1/tts \
  -H "Authorization: Bearer sk-tts-xxx" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Hello from my app!",
    "model": "kokoro",
    "voice": "af_heart",
    "format": "mp3"
  }' \
  --output output.mp3
OpenAI-inowirirana fomati Kudonha-mu
# Works with OpenAI client library
from openai import OpenAI

client = OpenAI(
    api_key="sk-tts-xxx",
    base_url="https://api.tts.ai/v1"
)

response = client.audio.speech.create(
    model="kokoro",
    voice="af_heart",
    input="Hello from my app!"
)

response.stream_to_file("output.mp3")

Chii Vagadziri Vagadzirisa neTTS.ai

Common integration mafomu uye maapplication

AI Chatbots & Vatsigiri

Kokoro inoburitsa sub-second latency ye real-time kubatana. Sesame CSM inogadzira kubatana kwechikumbiro nezvinoitika zvinoitika. Sesame CSM inogadzira kubatana kwechikumbiro nezvinoitika zvinoitika.

  • LLM mhedzisiro yemutauro wemutauro
  • Sub-second latency neKokoro
  • Kutaura kwechikumbiro neSesame CSM
  • Streaming audio output

Mobile & Voice Apps

Kugadzira maapplication efoni ane kubatana kwezwi, maturusi ekubatsira, maapplication ekuverenga, uye mapuratifomu edzidzo yechirungu. Isu tinopa REST API inoshanda nechero mobile framework.

  • React Native, Flutter, Swift, Kotlin
  • Apps dzekubatsira vanhu vane zvirwere zvinopesana nekuona
  • Platforms dzekudzidza rurimi
  • Audio zvemukati kuumbwa

SaaS zvigadzirwa

White-label mashoko mabasa mu SaaS yako zvigadzirwa. Add TTS, STT, mashoko cloning, uye audio processing sezvimiro mu yako platform.Use yedu API semashoko backend pasina kudzora GPU infrastructure.

  • White-label mashoko maficha
  • Hapana GPU infrastructure inodiwa
  • Pay-per-use mutengo
  • 20+ mamodheru ekupa vashandisi vako

Automation Pipelines

Kubatanidza mashoko kuumbwa CI / CD pipelines, zvemukati automation, uye batch processing workflows.Kugadzira mamirioni eaudio mafaera kubva spreadsheet data, otomatiki podcast kugadzirwa, kana kugadzira zvemukati localization pipelines.

  • Batch kuongorora kuburikidza API
  • Zvinyorwa zvekugara kwezvigadzirwa
  • CI / CD kubatanidzwa
  • Spreadsheet kune audio automation

API Specifications

Yakagadzirwa yekugadzira maapplication

20+

TTS Models

100+

Mazwi

30+

Zvinhu

<1s

Latency (Kudzidza)

Mibvunzo Inobvunzwa Kazhinji

Zvimwe mibvunzo nezve TTS.ai developer API

Yeah. Our API inotevera OpenAI audio speech format. Kana iwe uchishandisa OpenAI Python kana JavaScript client library, unogona kuchinja kune TTS.ai nekuchinja base_url uye api_key parameters. Yako yazvino kodhi inoshanda pasina kuchinja.

Kokoro inoburitsa mashoko munguva ye1 sekondi yemashoko anowanzotaurwa. CosyVoice 2 inotsigira kutumira mameseji nevhidhiyo kuti uwane nguva yakareba yekutumira mameseji. Kune chatbots nevanhu vanotaura, nguva yese yekutumira mameseji inowanzoita 1-3 sekondi zvichienderana nenguva yekunyora uye nemhando yebasa.

Free models (Kokoro, Piper, VITS, MeloTTS) are completely free. Standard models use 2x characters per 1K of text. Premium models use 4x characters per 1K of text. Sign up for free with 15,000 characters. Plans start at $9/month for 500,000 characters.

Yeah. Upload a reference audio sample (5-30 seconds) to the voice cloning endpoint, then use the cloned voice ID in subsequent TTS requests. Models that support cloning include CosyVoice 2, Chatterbox, Fish Speech, and GPT-SoVITS.

Free tier ine basa rekutanga rate limiting (3 mibvunzo paawa pasina account). Pachirongwa akabhadharwa vane rusununguko rate limits zvakakwana kugadzira maapplication.

WAV (yasina kumanikidzwa, yepamusoro mhando), MP3 (yakamanikidzwa, zvikamu zvidiki zvemafaira), OGG (yakavhurika fomati), uye FLAC (yakamanikidzwa pasina kumanikidza). Sarudza fomati mubvunzo wako. Zviri pachena kuti WAV ndiyo yakajairika sampling rate yeiyo model.

Yes. Combine our TTS API with a speech-to-text model and an LLM to build a complete voice assistant pipeline. Kokoro provides sub-second latency ideal for real-time conversation. CosyVoice 2 supports streaming output for even lower perceived response times.

CosyVoice 2 neKokoro zvinotsigira kutumira maaudio kubuda uko ma audio chunks anopihwa sezvavanogadzirwa. Izvi zvinoderedza nguva-ku-yekutanga-byte yenguva-yakazara maapplication sezvavanotaura vatsigiri uye zvinosangana nevanhu.

API inodzokera standard HTTP status codes. Kushandisa exponential backoff ye5xx matambudziko uye kumanikidza kurambidza mazano. Yemisiyano-critical maapplication, wedzera queue neretry logic. Our API ine high uptime asi resilient matambudziko kudzora anokurudzirawo.

Yeah. The /v1/voices and /v1/models endpoints return JSON lists of all available voices and models with their metadata (language support, quality ratings, speed ratings, and pricing tier). Use these to build dynamic model selectors in your application.

Free models (Kokoro, Piper, VITS, MeloTTS) kubatsira seyakajeka sandbox sezvo ivo kudhura zero mari. Test yako kubatanidzwa ne free models, uye wobva wachinja kune premium models mukugadzira nekugadzirisa parameter model. No zvakasiyana siyana kuedza nharaunda inodiwa.

Zvizhinji zvemamodeli edu anowanikwa pasina kupihwa kodzero uye anogona kupihwa nemunhu wega. Nekudaro, kupihwa nemunhu wega kunoda mari yakawanda yeGPU (tinoshandisa 4x NVIDIA Tesla P40 ine 96GB yeVRAM).
5.0/5 (1)

Chii chingatibatsira kuti tiite zvakanaka? Ruzivo rwako runogona kutibatsira kugadzirisa matambudziko.

Uchinetseka here neVoice AI?

15,000 characters on signup, free models available, comprehensive documentation.Kuwana yako yemahara API key uye kutanga kugadzira.