API Text-to-Speech pou Developers

Kreye aplikasyon ki pèmèt vwa ak API REST nou an. Ajoute tèks-nan-parole natirèl, klonaj vwa, pale-nan-tèks, ak pwosesis odyo nan aplikasyon ou yo, chatbots, asistans vwa, ak pwodwi SaaS. OpenAI-kompatib fòma, 24 + modèl, entègrasyon senp.

REST API Chatbots Aplikasyon Vokal Pwodwi SaaS Automatisation

Tcheke li kounye a

0/500
Gratis ak Kokoro, Piper, VITS, MeloTTS
Your generated audio will appear here
Pwodui
0:00 0:00
Telechaje
Ou renmen TTS.ai? Di zanmi ou yo!

Karakteristik API pou Developers

Tout sa ou bezwen pou bati aplikasyon ki ka pale

Simple REST API

One POST request to generate speech. JSON request, audio response. Works with any programming language that supports HTTP.

Konpatib ak OpenAI

Drop-an ranplasman pou OpenAI TTS API. Switch ou base_url ak kle API - kòd ki egziste deja travay imedyatman.

24+ modèl ki disponib

Accéder chak modèl atravè yon sèl API. Switch modèl pa chanje yon paramèt. Konpare bon jan kalite, vitès, ak pri.

Sub-second Latency

Kokoro jenere odyo nan mwens pase 1 segonn. Perfektè pou chatbots tan reyèl, asistan vwa, ak aplikasyon interactive.

API klonaj vwa

Klone nenpòt vwa soti nan yon echantiyon son kout via API. Itilize vwa klone pou tout jenerasyon kap vini yo.

Divès fòma

Sortie kòm WAV, MP3, OGG, oswa FLAC. Choisir sample rate ak bit profondeur. Streaming audio sipò pou apps tan reyèl.

Pi bon Models pou Developer Entègrasyon

Chwazi modèl la dwa pou aplikasyon w lan

KokoroKokoro

Free

Lightweight 82M parameter model delivering studio-quality speech with blazing-fast inference.

Fast 5/5

Pi bon pou: Pi vit modèl - sub-dezyèm latency, ideyal pou aplikasyon an tan reyèl ak chatbots

Eseye Kokoro

CosyVoice 2CosyVoice 2

Standard

Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.

Medium 5/5 Klonaj Vokal

Pi bon pou: Streaming TTS ak klonaj vwa pou aplikasyon asistan vwa

Eseye CosyVoice 2

Sesame CSMSesame CSM

Premium

Conversational speech model generating natural dialogue with appropriate timing and emotion.

Slow 5/5

Pi bon pou: AI konvèsatif ak tan natirèl pou chatbot ak asistan vwa

Eseye Sesame CSM

PiperPiper

Free

A fast, local neural text to speech system optimized for Raspberry Pi and embedded devices.

Fast 3/5

Pi bon pou: Free, CPU-only model for high-volume applications with zero credit cost

Eseye Piper

BarkBark

Standard

Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.

Slow 4/5

Pi bon pou: Kreyasyon son ak efè son pou aplikasyon kreyatif ak distraksyon

Eseye Bark

Kijan Pou Entègrasyon TTS API

Soti nan enskripsyon pou premye apèl API nan mwens pase 5 minit

1

Jwenn Chèn API ou

Enskri pou gratis epi jenere yon kle API soti nan tablodbò kont ou. 50 kredi enkli.

2

Fè premye apèl ou

POST to /v1/tts with text, model, and voice. Get audio bytes back. Under 5 lines of code.

3

Chwazi modèl ou

Teste modèl diferan pou ka ou itilize. Konpare vitès, kalite, ak pri pou chak jenerasyon.

4

Ship to Production

Scale ak pay-as-you-go kredi. Pa gen limit pousantaj sou plan peye. Monitè utilisation nan tablodbò ou.

Quick Start Kòd Egzamen

Intégrer TTS.ai nan nenpòt lang ak REST API nou an

Python Populè
import requests

response = requests.post(
    "https://api.tts.ai/v1/tts",
    json={
        "text": "Hello from my app!",
        "model": "kokoro",
        "voice": "af_heart",
        "format": "mp3"
    },
    headers={
        "Authorization": "Bearer sk-tts-xxx"
    }
)

with open("output.mp3", "wb") as f:
    f.write(response.content)
JavaScript (Node.js) Node.js
const response = await fetch(
    "https://api.tts.ai/v1/tts",
    {
        method: "POST",
        headers: {
            "Content-Type": "application/json",
            "Authorization": "Bearer sk-tts-xxx"
        },
        body: JSON.stringify({
            text: "Hello from my app!",
            model: "kokoro",
            voice: "af_heart",
            format: "mp3"
        })
    }
);

const audio = await response.blob();
cURL Univèsèl
curl -X POST https://api.tts.ai/v1/tts \
  -H "Authorization: Bearer sk-tts-xxx" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Hello from my app!",
    "model": "kokoro",
    "voice": "af_heart",
    "format": "mp3"
  }' \
  --output output.mp3
OpenAI-kompatib fòma Drop-in
# Works with OpenAI client library
from openai import OpenAI

client = OpenAI(
    api_key="sk-tts-xxx",
    base_url="https://api.tts.ai/v1"
)

response = client.audio.speech.create(
    model="kokoro",
    voice="af_heart",
    input="Hello from my app!"
)

response.stream_to_file("output.mp3")

Ki sa ki Developers bati ak TTS.ai

Modèl ak aplikasyon pou integrasyon komen

AI Chatbots & Asistans

Ajoute pwodiksyon vwa a chatbot ou a oswa asistan AI. Pipe repons LLM via TTS pou entèfas ki pèmèt vwa. Kokoro bay sub-dezyèm latency pou konvèsasyon an tan reyèl. Sesame CSM jenere pale konvèsasyon ak tan natirèl.

  • LLM response to speech pipelineComment
  • Sub-second latency with Kokoro
  • Konvèsasyon ak Sesame CSM
  • Streaming audio output

Aplikasyon mobil ak vwa

Kreye aplikasyon mobil ki pèmèt vwa, zouti aksè, aplikasyon lekti, ak platfòm pou aprann lang. REST API nou an travay ak nenpòt framework mobil.Téléchargez fichiers audio ou stream dirèkteman nan kliyan an.

  • Reaksyon natif natal, Flutter, Swift, Kotlin
  • Aplikasyon aksè ak lekti
  • Platfòm pou aprann lang
  • Kreyasyon kontni odyo

Pwodwi SaaS

Ajoute TTS, STT, klonaj vwa, ak pwosesis odyo kòm karakteristik nan platfòm ou. Itilize API nou an kòm backend vwa ou san yo pa jere enfrastrikti GPU.

  • Fonksyonèlite vwa étiquettes blan
  • Pa gen enfrastrikti GPU nesesè
  • Pay-per-use pri
  • 24 + modèl yo ofri itilizatè ou yo

Automation Pipelines

Entègrasyon jenerasyon vwa nan pipelines CI / CD, automatisation kontni, ak batch workflows pwosesis.Jenerasyon milye de dosye odyo soti nan done spreadsheet, automatisation pwodiksyon podcast, oswa bati pipelines lokalizasyon kontni.

  • Pwosesis batch via API
  • Konpayi lokalizasyon kontni
  • Integrasyon CI/CD
  • Spreadsheet to audio automation

Espesifikasyon API

Konpoze pou aplikasyon pou pwodiksyon

24+

Modèles TTS

100+

Vokal

30+

Lang

<1s

Latency (Kokoro)

Kesyon ki poze souvan

Kesyon komen sou TTS.ai Developer API

Yes. Our API follows the OpenAI audio speech format. If you are using the OpenAI Python or JavaScript client library, you can switch to TTS.ai by changing the base_url and api_key parameters. Your existing code works without modification.

Kokoro bay son an nan mwens pase 1 segonn pou fraz tipik. CosyVoice 2 sipòte pwodiksyon streaming pou menm pi ba latency pèsepte. Pou chatbots ak asistans vwa, tan total de toune a se anjeneral 1-3 segonn depann de longè tèks la ak chwa modèl.

Modèles gratis (Kokoro, Piper, VITS, MeloTTS) coûtent zéro crédits. Modèles standard coûtent 2 crédits par 1,000 caractères. Modèles Premium coûtent 4 crédits par 1,000 caractères. Inscrivez-vous gratuitement avec 50 crédits. Plans commencent à $9/mois pour 500 crédits.

Wi. Telechaje yon echantiyon son referans (5-30 segonn) nan pwent fen klonaj vwa a, lè sa a itilize ID vwa klone a nan demann TTS ki vin apre yo. Modèles ki sipòte klonaj gen ladan CosyVoice 2, Chatterbox, Fish Speech, ak GPT-SoVITS.

Nivo gratis gen limit debaz (3 demann pou chak èdtan san yon kont). Plan ki peye yo gen limit pousantaj generous ki apwopriye pou aplikasyon pou pwodiksyon.Kontakte nou pou kondisyon depase nivo enterè.

WAV (pa-kouchye, pi bon kalite), MP3 (kouchye, fichye pi piti), OGG (format louvri), ak FLAC (kouchye san pèt). Espesifike fòma a nan demann ou an. Par défaut se WAV ak frekans sampling natif modèl la.

Wi. Konbine TTS API nou an ak yon modèl pale-a-tèks ak yon LLM pou konstwi yon pipeline asistans vwa konplè. Kokoro bay yon latens sub-segondè ideyal pou konvèsasyon an tan reyèl. CosyVoice 2 sipòte pwodiksyon streaming pou tan repons ki pi ba.

CosyVoice 2 ak Kokoro sipòte emisyon odyo sou entènèt kote moso odyo yo bay lè yo kreye. Sa diminye tan pou premye byte pou aplikasyon tan reyèl tankou asistan vwa ak eksperyans entèaktif.

API a retounen kòd estati HTTP standard. Implement exponential backoff pou erè 5xx ak repons limit pousantaj. Pou aplikasyon misyon-kritik, ajoute yon queue ak retry logic. Our API has high uptime but resilient error handling is always recommended.

Yes. The /v1/voices and /v1/models endpoints return JSON lists of all available voices and models with their metadata (language support, quality ratings, speed ratings, and pricing tier). Use these to build dynamic model selectors in your application.

Modèl gratis (Kokoro, Piper, VITS, MeloTTS) sèvi kòm yon sandbox efikas depi yo koute zewo kredi. Teste entègrasyon ou ak modèl gratis, Lè sa a, chanje nan modèl premium nan pwodiksyon an pa chanje paramèt modèl la. Pa gen okenn anviwònman tès separe ki nesesè.

Pifò nan modèl nou yo gen sous louvri e yo ka òganize tèt yo. Sepandan, òganize tèt yo mande pou resous GPU enpòtan (nou itilize 4x NVIDIA Tesla P40 ak 96GB VRAM total).
5.0/5 (1)

Èske w pare pou konstwi ak Voye AI?

50 kredi sou enskripsyon, modèl gratis ki disponib, dokimantasyon konplè, 24/7 sipò, 24/7 sipò, 24/7 sipò.