Xuquuqda API

Isticmaal TTS.ai codsiyadaaga oo leh API-keena REST. OpenAI-ka-qaybgal ah qaabka u dhaqaaqista fudud.

REST API OpenAI la jaanqaada Jawaabaha JSON taageerada Streaming

Aragtida guud

The TTS.ai API siiyaa access programmic si ay dhammaan astaamaha platform: qoraal-to-dhaqanka, hadal-to-dhaqanka, codka isku-dhafan, audio kordhinta, iyo in ka badan. API isticmaalaa REST standard heshiisyada la JSON codsiga / jawaabta jirka.

Fadlan dooro

Ka hesho furaha API ka Goobaha xisaabta. Laga heli karaa qorshayaasha Pro iyo Enterprise.

URL-ka aasaasiga ah

https://api.tts.ai/v1/

Awoodsiinta

Tallaabo qaade Authorization madaxa

Aaminaad

Free tier - ma aha furaha loo baahan yahay. Anonymous POSTs in /v1/tts/ shaqeeyo aan waxba u baahnayn, ilaa 5,000 xaraf / maalin IP kasta, iyadoo la adeegsanayo mid ka mid ah moodooyinkayaga bilaashka ah (piper, vits, melotts, kokoro). Ku soo biir xisaab bilaash ah si aad u hesho 15,000 xarafka bonus iyo helitaanka moodooyinka premium.

For models premium iyo xaddidaha heerka sare, la xaqiijiyo la Bearer calaamad ku Authorization madaxa.

Madaxa HTTP
Authorization: Bearer sk-tts-your-api-key-here
Ilaalinta aad API sirta ah. Ha wadaagaan in code dhinac macaamiisha, repositories dadweynaha, ama logs. Dooro furaha si joogto ah oo ka mid ah settings xisaabtaada.

SDK-yada

SDKs rasmi ah waxay fududeeyaan in la isku daro TTS.ai codsigaaga. Labaduba waa asal furan oo ku yaal GitHub.

Python

pip install ttsai
from tts_ai import TTSClient

client = TTSClient(api_key="sk-tts-...")
audio = client.generate(
    text="Hello world!",
    model="kokoro"
)
client.save(audio, "output.wav")
GitHub

JavaScript / Node.js

npm install @ttsainpm/ttsai
const { TTSClient } = require('@ttsainpm/ttsai');

const client = new TTSClient({
  apiKey: 'sk-tts-...'
});
const audio = await client.generate({
  input: 'Hello world!',
  model: 'kokoro'
});
await client.saveToFile(audio, 'output.wav');
GitHub

URL-ka aasaasiga ah

URL-ka aasaasiga ah: https://api.tts.ai/v1/

dhammaan dhamaadka waa la xiriira this URL-ka. tusaale ahaan, TTS dhamaadka waa:

POST https://api.tts.ai/v1/tts/

Xaddidaadda

API heerka xaddidaadda kala duwanaan karaan qorshaha:

Qorshe Talooyin/daqiiqo Isku-dhafan Dhererka ugu badan ee qoraalka
Bilaash 10 2 500 xarfo
Bilow 30 3 1,000,000 xarfo
Pro 60 5 1,000,000 xarfo
Shirkad 300 20 50,000 xarfo

Heerka xaddidaadda headers waxaa ku jira jawaab kasta: X-RateLimit-Limit, X-RateLimit-Remaining, X-RateLimit-Reset.

Qiimaha deynta

Adeeg Qiimaha Midab
TTS (models Bilaash ah: Piper, VITS, MeloTTS) 1,000 xaraf 1,000 xaraf kasta
TTS (Standard noocyada: Kokoro, CosyVoice 2, iwm) 2,000 xarfo 1,000 xaraf kasta
TTS (Models Premium: Tortoise, Chatterbox, iwm.) 4,000 xarfo 1,000 xaraf kasta
Hadal u beddel qoraal 2,000 xarfo per minute of audio
Duubista Codka 4,000 xarfo 1,000 xaraf kasta
Isbedelka Codka 3,000 xaraf per minute of audio
Kordhinta Muuqaalka 2,000 xarfo per minute of audio
Voice ka saarka / Qaybta 3,000-4,000 xarfo per minute of audio
Tarjumaadda hadalka 5,000 xarfo per minute of audio
Hadal hadal ah 3,000 xaraf wareeg kasta
Key Finder BPM Bilaash --
Audio Beddelaan Bilaash --

Qoraalka u beddel hadal

POST /v1/tts/

U beddel qoraalka hadalka audio. Ku soo celiya faylka audio ee qaabka la dalbaday.

Fadliga dalabka

FalanqayntaNoocWaa in la buuxiyaaFaahfaahin
model string Ha Model ID (eg, kokoro, chatterbox, piper). Haddii la iska indho tiri, waxaanu si otomaatig ah u dooranaynaa qaabka taageeraya language la dalbaday — kokoro ee en/ja/zh/ko/fr/de/it/pt/es/hi/ru, piper ee luqadaha kale ee la taageerayo (ar/pl/nl/cs/da/fi/el/hu/tr/uk/vi/etc.).
text string Haa Qoraalka loo beddelo hadalka. Per-dalbaday cap: 500 xarfaha (anonymous), 5,000 (qaybta bilaashka ah), 1,000,000 (lacagta qorshaha). Long injiisyada waa auto-chunked server-jiid.
voice string Haa Aqoonsiga Codka (u isticmaal /v1/voices/ si aad u soo bandhigto codadka jira)
format string Ha Nooca natiijada: mp3 (default), wav, flac, ogg
speed float Ha Kaaliyaha xawaaraha hadalka. Default: 1.0. Range: 0.5 to 2.0
language string Ha Koodka afka (tusaale ahaan, en, es). Si otomaatig ah ayaa loo ogaan doonaa haddii la joojiyo.
instructions string Ha Dhaqanka / soo dejinta cues (≤500 xarfaha). tusaale ahaan \
pronunciations object | array Ha Codsiga-per afka ku beddelaya. Midkood {\
stream boolean Ha Fur jawaabta qulqulka. Fadlan: false

Talooyin tusaale ah

cURL
curl -X POST https://api.tts.ai/v1/tts/ \
  -H "Authorization: Bearer sk-tts-your-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kokoro",
    "text": "Hello from TTS.ai! This is a test.",
    "voice": "af_bella",
    "format": "mp3"
  }' \
  --output output.mp3

Calaamadaha ee SSML

Tirada Wrap, taariikhda, lacag, lambarrada telefoonka, iyo acronyms in

FaahfaahinInputLa Yidhi
cardinal1234one thousand two hundred thirty-four
ordinal21twenty-first
date1999-12-31December soddon iyo mid, toddobaatan iyo sagaashan iyo sagaal
time14:30two thirty PM
telephone+1-555-867-5309plus one five five five eight six seven…
currency$1,234.56one thousand two hundred thirty-four dollars and fifty-six cents
spell-outNASAN A S A

qaabka taariikhda ee asalka ah mdy Ingiriis iyo dmy meel kale; ku beddel format=\

tusaale
{
  "model": "kokoro",
  "voice": "af_bella",
  "text": "Your appointment is on <say-as interpret-as=\"date\">2026-04-26</say-as> at <say-as interpret-as=\"time\">14:30</say-as>. Please call <say-as interpret-as=\"telephone\">+1-555-867-5309</say-as> if you need to reschedule."
}

Jawaab

The TTS endpoint queues your request and returns a JSON response with a job UUID. You then poll for the result.

Step 1: Submit request

Response (JSON)
{
  "uuid": "77b71db532874ce98e84a69a2d740d4c",
  "job_id": "f21316bb-aefa-480d-8523-701d1e3184ce",
  "status": "queued",
  "credits_used": 11,
  "credits_remaining": 15000
}

Step 2: Poll for result

GET /v1/speech/results/?uuid=<job_uuid>

Poll this endpoint every 1-2 seconds until status is completed or failed.

Polling response (completed)
{
  "status": "completed",
  "result_url": "https://api.tts.ai/static/downloads/77b71db5.../output.mp3"
}
Polling response (still processing)
{
  "status": "processing"
}

Step 3: Download audio

Fetch the result_url from the completed response to download the audio file.

tusaale buuxa

Python
import requests, time

API_KEY = "sk-tts-your-key"
BASE = "https://api.tts.ai"

# 1. Submit TTS request
resp = requests.post(f"{BASE}/v1/tts/", json={
    "model": "kokoro",
    "text": "Hello from TTS.ai!",
    "voice": "af_bella"
}, headers={"Authorization": f"Bearer {API_KEY}"})
data = resp.json()
uuid = data["uuid"]

# 2. Poll for result
while True:
    result = requests.get(f"{BASE}/v1/speech/results/",
        params={"uuid": uuid}).json()
    if result["status"] == "completed":
        # 3. Download audio
        audio = requests.get(result["result_url"])
        with open("output.mp3", "wb") as f:
            f.write(audio.content)
        break
    elif result["status"] == "failed":
        raise Exception(result.get("error", "Generation failed"))
    time.sleep(1.5)

Streaming alternative: For supported models (Kokoro, MeloTTS), use POST /v1/tts/stream/ for real-time Server-Sent Events (SSE) streaming — no polling needed.

Hadal u beddel qoraal

POST /v1/stt/

Dhageyso audio in qoraalka. taageertaa 99 luqadood oo leh auto-ogaanshaha.

Fadliga dalabka (multipart/form-data)

FalanqayntaNoocWaa in la buuxiyaaFaahfaahin
file file Haa Audio file (MP3, WAV, FLAC, OGG, M4A, MP4, WebM). Max 100MB.
model string Ha STT qaab: whisper (default), faster-whisper, sensevoice
language string Ha Koodka afka. auto si loo ogaado si otomaatig ah (default).
timestamps boolean Ha Ku dar calaamadaha wakhtiga ee heerka ereyga. Fadlan: false
diarize boolean Ha Faahfaahin: false

Jawaab

Jawaab JSON
{
  "text": "Hello, this is a transcription test.",
  "language": "en",
  "duration": 3.5,
  "segments": [
    {
      "start": 0.0,
      "end": 1.8,
      "text": "Hello, this is",
      "speaker": "SPEAKER_00"
    },
    {
      "start": 1.8,
      "end": 3.5,
      "text": "a transcription test.",
      "speaker": "SPEAKER_00"
    }
  ]
}

Duubista Codka

POST /v1/tts/clone/

abuuro hadalka in codka la duubtay. soo dejisan tilmaame audio iyo qoraalka.

Fadliga dalabka (multipart/form-data)

FalanqayntaNoocWaa in la buuxiyaaFaahfaahin
reference_audio file Haa Codka codka ee codka ah (10-30 ilbiriqsi oo la soo jeediyay). Max 20MB.
text string Haa qoraalka lagu hadli doono codka lakala soocay.
model string Ha Qaab-dhismeedka isku-dhafka: chatterbox (default), cosyvoice2, gpt-sovits
format string Ha Nidaamka soo bixitaanka: mp3 (default), wav, flac
language string Ha Koodka afka la doonayo. Waa in ay taageeraan qaabka la doortay.

Jawaab

Ku soo celin doonaa faylka audio sida xogta binary, sida TTS endpoint.

Isbedelka Codka

POST /v1/voice-convert/

Audio u beddelaan si ay u maqlaan sida codka kala duwan. Upload audio asalka ah oo dooro codka la doonayo.

Fadliga dalabka (multipart/form-data)

FalanqayntaNoocWaa in la buuxiyaaFaahfaahin
file file Haa Faylka audio asalka ah (MP3, WAV, FLAC). Max 50MB.
target_voice string Haa Target voice ID to convert to (use /v1/voices/ to list available voices)
model string Ha Muuqaalka isbeddelka codka: openvoice (default), knn-vc
format string Ha Nooca natiijada: wav (default), mp3, flac

Talooyin tusaale ah

cURL
curl -X POST https://api.tts.ai/v1/voice-convert/ \
  -H "Authorization: Bearer sk-tts-your-key" \
  -F "file=@source_audio.mp3" \
  -F "target_voice=af_bella" \
  -F "model=openvoice" \
  -o converted.wav

Jawaab

Ku soo celin doonaa file audio la beddelay sida xogta binary.

Tarjumaadda hadalka

POST /v1/speech-translate/

Translate audio ku hadla ka afka mid kale. isku darka hadal-to-text, tarjumaadda, iyo qoraalka-to-speech in a telefoonka kaliya.

Fadliga dalabka (multipart/form-data)

FalanqayntaNoocWaa in la buuxiyaaFaahfaahin
file file Haa Faylka maqalka ee asalka ah. Max 100MB.
target_language string Haa Koodka afka la doonayo (tusaale, es, fr, de, ja)
voice string Ha Codka loo yaqaan "%s"
preserve_voice boolean Ha Dambiil si loo ilaaliyo astaamaha codka afka asalka ah. Default: false

Jawaab

Jawaab JSON
{
  "original_text": "Hello, how are you?",
  "translated_text": "Hola, como estas?",
  "source_language": "en",
  "target_language": "es",
  "audio_url": "https://api.tts.ai/v1/results/translate_abc123.mp3",
  "credits_used": 5
}

Hadal ilaa Hadal

POST /v1/speech-to-speech/

U beddel qaabka hadalka, dareenka, ama soo bandhigida inta lagu jiro xakamaynta waxyaabaha. Faa'iido u ah hagaajinta toonta, pacing, iyo expressionness.

Fadliga dalabka (multipart/form-data)

FalanqayntaNoocWaa in la buuxiyaaFaahfaahin
file file Haa Fayl maqal ah. Max 50MB.
voice string Haa Target voice ID for the output speech
model string Ha Midab: openvoice (default), chatterbox
emotion string Ha Target emotion: neutral, happy, sad, angry, excited
speed float Ha Isbeddelka xawaaraha. Default: 1.0. Range: 0.5 to 2.0

Jawaab

Ku soo celin doonaa faylka audio la beddelay sida xogta binary.

Qalabka Dhagaysiga

Audio processing endpoints si kor loogu qaado, tirtirka codka, stem kala qaybinta, iyo in ka badan.

POST /v1/audio/enhance/

Kordhinta tayada audio: denoise, hagaajinta caddaaladda, go'aan super.

file fileFayl maqal ah oo la xoojinayo
denoise booleanSamee xakamaynta codka (default: true)
enhance_clarity booleanKor u qaad caddaaladda hadalka (default: true)
super_resolution booleanTayada maqalka u kordhi (default: false)
strength integer1-3 (cagaaran, dhexdhexaad ah, xoog leh). Default: 2
POST /v1/audio/separate/

kala sooca vocals ka instrumentals (joojin vocal) ama kala qaybsan yihiin in ay tiir.

file fileFaylka audio ee la kala saari doono
model stringdemucs (default) ama spleeter
stems integerTirada dhir: 2, 4, 5, ama 6 (default: 2)
format stringNidaamka soo bixitaanka: wav, mp3, flac
POST /v1/audio/dereverb/

Ka saar echo iyo reverb ka recordings audio.

file fileFaylka maqalka ee la xalin doono
type stringecho or reverb (default: both)
intensity integer1-5 (default: 3)
POST /v1/audio/analyze/ Bilaash

Falanqee audio si ay u ogaadaan furaha, BPM, iyo waqti saxiixa.

file fileFadlan dooro faylka la doonayo in la faallo
Jawaab
{
  "key": "C",
  "scale": "Major",
  "bpm": 120.0,
  "time_signature": "4/4",
  "camelot": "8B",
  "compatible_keys": ["C Major", "G Major", "F Major", "A Minor"]
}
POST /v1/audio/convert/ Bilaash

Audio u beddelaan u dhexeeya qaabab.

file fileFaylka audio ee la beddeli doono
format stringNidaamka la rabo: mp3, wav, flac, ogg, m4a, aac
bitrate integerBitrate soo bixitaan oo ku jira kbps: 64, 128, 192, 256, 320
sample_rate integerTirada tusaale: 22050, 44100, 48000
channels stringmono ama stereo

Hadal hadal ah

POST /v1/voice-chat/

U dir audio ama qoraalka iyo hel jawaabta AI la hadalka synthesized.

Fadliga dalabka (multipart/form-data ama JSON)

FalanqayntaNoocWaa in la buuxiyaaFaahfaahin
audio file Ha* Input audio (ama audio ama text ayaa loo baahan yahay)
text string Ha* Input-ka qoraalka (ama audio ama text ayaa loo baahan yahay)
voice string Ha Codka jawaabta AI. Fadlan: af_bella
tts_model string Ha Midab TTS oo loogu talagalay jawaabta. Fadlan: kokoro
system_prompt string Ha Qalabka codsiga ee loogu talagalay AI
conversation_id string Ha Ku sii wad wada hadalka jira

Jawaab

Jawaab JSON
{
  "conversation_id": "conv_abc123",
  "user_text": "What is the capital of France?",
  "ai_text": "The capital of France is Paris.",
  "audio_url": "https://api.tts.ai/v1/audio/tmp/resp_xyz.mp3",
  "credits_used": 3
}

TTS

POST /v1/tts/batch/

Soo gudbinta qoraalo badan oo isku xigta TTS dhalasho. Haddii aad rabto, ka heli callback webhook marka shaqada oo dhan dhamaystiran.

Parameters

ParameterNoocTilmaamaha
textsarrayArray of objects: {text, model, voice}. Max 50 items.
webhook_urlstringURL doorasho ah in ay natiijooyinka POST marka batch dhamaystiro.

Jawaab

Jawaab JSON
{
  "batch_id": "abc123",
  "total": 3,
  "completed": 0,
  "status": "processing"
}

Horumarka codbixinta la GET /v1/tts/batch/result/?batch_id=abc123

Dhaqdhaqaaqa

POST /v1/voice-embed/

Ka hor-dhis codka ku dhejisan ka soo xigasho audio. U isticmaal embed_id ku soo laabtay codka ka dib codka ku dhejin codka ee soo saarida ku dhow-instant.

Parameters

ParameterNoocTilmaamaha
filefileReference audio file (WAV, MP3, FLAC).
modelstringCloning model (default: chatterbox). Supported: chatterbox, cosyvoice2, openvoice, gpt-sovits, spark, indextts2, qwen3-tts.

Jawaab

Jawaab JSON
{
  "embed_id": "emb_abc123",
  "model": "chatterbox",
  "duration_ms": 450
}

Tijaabada caafimaad

GET /v1/health/

Ka hubi xaaladda GPU server, qaabab la soo dejiyey, iyo miisaanka fariinta. Aan la xaqiijin loo baahnaa. Cached for 30 seconds.

Jawaab

Jawaab JSON
{
  "status": "online",
  "latency_ms": 45,
  "queue_size": 3,
  "models_loaded": ["kokoro", "chatterbox", "cosyvoice2"]
}

Liiska qaababka

GET /v1/models/

Ku soo laabtaa liiska dhammaan qaababka la heli karo oo leh awoodahooda.

Jawaab

Jawaab JSON
{
  "models": [
    {
      "id": "kokoro",
      "name": "Kokoro",
      "type": "tts",
      "tier": "standard",
      "languages": ["en", "ja", "ko", "zh", "fr"],
      "supports_cloning": false,
      "supports_streaming": true,
      "credits_per_1k_chars": 2
    },
    {
      "id": "chatterbox",
      "name": "Chatterbox",
      "type": "tts",
      "tier": "premium",
      "languages": ["en"],
      "supports_cloning": true,
      "supports_streaming": true,
      "credits_per_1k_chars": 4
    }
  ]
}

Liiska Codadka

GET /v1/voices/

Ku soo celiya liiska codadka oo dhan oo la heli karo, si ikhtiyaari ah ayaa loo cusbooneysiiyay qaabka ama afka.

Qiyaamada Su'aasha

FalanqayntaNoocFaahfaahin
model string Ciwaanka qaabka (tusaale ahaan, kokoro)
language string Ciwaanka luqadda (tusaale ahaan, en)
gender string Ciwaanka jinsiga: male, female, neutral

Jawaab

Jawaab JSON
{
  "voices": [
    {
      "id": "af_bella",
      "name": "Bella",
      "model": "kokoro",
      "language": "en",
      "gender": "female",
      "preview_url": "https://api.tts.ai/v1/voices/preview/af_bella.mp3"
    }
  ],
  "total": 142
}

Subtitles (SRT / VTT) cusub

GET /v1/speech/subtitles/?uuid=<job_uuid>&format=srt|vtt&download=1

Soo saar subtitles synchronized u shaqeeyo TTS dhamaystiran. Runs Whisper isku dheelitirka ka badan audio iyo SRT ama WebVTT soo celin. Natiijadu waa diskka ku kaydsan si ay u codsiga labaad ee uuid isku mid ah waa disk akhrin.

Qiyaamada Su'aasha

FalanqayntaWaa in la buuxiyaaFaahfaahin
uuidHaaUUID-ka shaqada waxaa soo celiyay /v1/tts/ ama /v1/voice-clone/.
formatHasrt (default) ama vtt.
downloadHa1 si ay u diraan Content-Disposition: attachment si browser badbaadin ka badan oo muujinaya.
languageHaTilmaam in qaabka isku dheelitirka (auto-la ogaado haddii la joojiyay).
cURL
curl "https://api.tts.ai/v1/speech/subtitles/?uuid=$UUID&format=srt&download=1" -o subtitles.srt

Diccionario de pronunciación cusub

GET POST DELETE /api/v1/pronunciations/

TTS engine sheeg sida loo dhawaaqo erayo gaar ah. Entries kaydinta auto-ku habboon in TTS codsiga aad samayso. 200-entry per-account xad.

Fadliga dalabka (POST)

FalanqayntaNoocFaahfaahin
wordstringErayga la doonayo in la bedelo (tusaale ahaan GIF, Anthropic). Xeerka erayga ayaa la isku raacay.
replacementstringSida loo qoro oo loogu talagalay qaabka (tusaale ahaan jiff, ann THROP ick).
languagestringCodsiga ISO code. Xogta = ku saabsan luuqadaha oo dhan.
case_sensitivebooleanfalse oo la doortay. Ku dheji xarafka marka true la doorto.
cURL
# Save an entry
curl -X POST https://tts.ai/api/v1/pronunciations/ \
  -H "Authorization: Bearer sk-tts-..." \
  -H "Content-Type: application/json" \
  -d '{"word": "GIF", "replacement": "jiff"}'

# List your entries
curl https://tts.ai/api/v1/pronunciations/ -H "Authorization: Bearer sk-tts-..."

# Delete entry by id
curl -X DELETE "https://tts.ai/api/v1/pronunciations/?id=42" -H "Authorization: Bearer sk-tts-..."

Waxaad sidoo kale ku gudbi kartaa per-dalbaday overrides iyaga badbaadin la’aan — ku jiraan pronunciations on kasta / v1 / tts / wicitaan sida mid ka mid ah wax ama array (eeg TTS endpoint params).

Qore Maqaal cusub

Ku rid