Documenti
Integate TTS.ai katika maombi yenu ya maombi kwa ajili ya REST API.
Muhtasari
The TTS.ai API provides programmatic access to all platform features: text-to-speech synthesis, speech-to-text transcription, voice cloning, audio enhancement, and more. The API uses standard REST conventions with JSON request/response bodies.
API Key
Tafuta funguo za API Mambo Yanayotozwa Hesabu. Yapatikana katika ramani ya kutazamia na kuingilia mambo.
Base URL
https://api.tts.ai/v1/
Authi
Ishara ya Dubu kupitia Authorization kichwa
Kupatana
Maombi yote ya API yahitaji uhakikisho kupitia ishara ya Dubu katika eneo hilo Authorization kichwa.
Authorization: Bearer sk-tts-your-api-key-here
Base URL
Kwa mfano, alama ya mwisho ya TTS inahusiana na msingi huu wa URL:
Mipaka ya Kiwango
MSIBA wa kiasi unatofautiana kwa mpango:
| Panga | Maombi/min | Concurrent | Urefu wa Ujumbe wa Max |
|---|---|---|---|
| Project | 60 | 5 | [ Maelezo ya Chini] |
| ↓ ↓ | 300 | 20 | Majumba ya taa 50,000 |
Wasimamizi wa takwimu wanahusishwa katika kila jibu: X-RateLimit-Limit, X-RateLimit-Remaining, X-RateLimit-Reset.
Gharama za Gharama
| Utumishi | Gharama | Kipande |
|---|---|---|
| TTS (Violezo vya Uhuru: Piper, VITS, MetloTTS) | 1 sifa | kwa herufi 1,000 |
| TTS (Violezo vya Lifti: Kokoro, CosyVoice 2, n.k.) | 2 sifa | kwa herufi 1,000 |
| TTS (Violezo vya taremiamu: Tortoise, Chatterbox, n.k.) | 4 sifa | kwa herufi 1,000 |
| Usemi kwa Maandiko | 2 sifa | kila dakika ya sauti |
| Sauti Yaungana | 4 sifa | kwa herufi 1,000 |
| Mbadili wa Sauti | 3 sifa | kila dakika ya sauti |
| Utetezi wa Kaseti | 2 sifa | kila dakika ya sauti |
| Uondoaji wa Sauti / Msingi | Sifa 3-4 | kila dakika ya sauti |
| Utafsiri wa Usemi | 5 sifa | kila dakika ya sauti |
| Mazungumzo ya Sauti | 3 sifa | kwa kila kona |
| Ufunguo wa & BPM Tafuta | Huru | -- |
| Audio Goover | Huru | -- |
Text to Speech
Fungua maandishi kwenye kaseti za usemi.
Kuomba Msamaha
| Penseli | Aina | Zahitajiwa | Maelezo |
|---|---|---|---|
| model | string | Ndiyo | Model ID (e.g., ÉchedePoventokoro 2.0/codecodetoly, comecodeÉchebox/codecis, comechtoly mbingunipton/codetoptom/codeavidetom) |
| text | string | Ndiyo | Maandishi ya kubadili maneno (mex 5,000 chars for Pro, 50,000 kwa Ajili ya Kuingizwa) |
| voice | string | Ndiyo | Sauti ID (appice ly icococococop mbinguni/v1/vosices/Échede/codetop/codetoptory/codeationtoly/totherations) |
| format | string | Hapana | Mtindo wa kompyuta: Uandishi wa Maandiko ya Kieletropop3/codetocacience/debault), ÉquidelytovenwavÉchede/codecodePoptoly/codeptoptolyptom, BAPechtodeGogg/codem/codeptoptoptoptom. |
| speed | float | Hapana | Kusema juu ya kuongezeka kwa mwendo wa kasi. Default: ÉcocococococoP.0/codep. Ranger: 0.45 0.5/codetoptoptoptodealy.08/08/925 |
| language | string | Hapana | Mfumo wa lugha (e.g., USP., UVcococococococonde 175/cococococodetones mbinguni). Auto-detited endapo ataachwa. |
| stream | boolean | Hapana | Jibu lenye nguvu. Defrault: NJOGOGOGOPH/cococococope |
Ombi la Mfano
curl -X POST https://api.tts.ai/v1/tts/ \
-H "Authorization: Bearer sk-tts-your-key" \
-H "Content-Type: application/json" \
-d '{
"model": "kokoro",
"text": "Hello from TTS.ai! This is a test.",
"voice": "af_bella",
"format": "mp3"
}' \
--output output.mp3
Itikio
Returns the audio file as binary data with appropriate Content-Type header (audio/mpeg, audio/wav, etc.).
Content-Type: audio/mpeg
Content-Length: 48256
X-Credits-Used: 2
X-Credits-Remaining: 498
Usemi kwa Maandiko
Insha sauti ya maandishi. Taasisi ya Lugha 99 kwa kutumia kompyuta.
Kuomba Msamaha (multipart/form-data)
| Penseli | Aina | Zahitajiwa | Maelezo |
|---|---|---|---|
| file | file | Ndiyo | Rafu yaudio (MP3, WAV, FARAC, OG, M4A, MP4, WebM). |
| model | string | Hapana | KIKUNDI CHA STET: HEPEPPLLDsper mbingunip/codepault/debault), complace cafter-whisper mbingunip/codeptoptop, BAPHSEnsensice[5]/cowisper |
| language | string | Hapana | Mfumo wa mawasiliano ya lugha. |
| timestamps | boolean | Hapana | Ambatanisha maneno. Defrault: HELPPPPPH/cocococococope |
| diarize | boolean | Hapana | Msemaji mwenye nguvu diarifing. Defrault: NJOGOPH BAL/cocococope |
Itikio
{
"text": "Hello, this is a transcription test.",
"language": "en",
"duration": 3.5,
"segments": [
{
"start": 0.0,
"end": 1.8,
"text": "Hello, this is",
"speaker": "SPEAKER_00"
},
{
"start": 1.8,
"end": 3.5,
"text": "a transcription test.",
"speaker": "SPEAKER_00"
}
]
}
Sauti Yaungana
Toa hotuba kwa sauti iliyotokezwa upya na upakiaji wa sauti na maandishi ya marejezo.
Kuomba Msamaha (multipart/form-data)
| Penseli | Aina | Zahitajiwa | Maelezo |
|---|---|---|---|
| reference_audio | file | Ndiyo | Marejezo ya sauti 10-30 yalipendekeza. Max 20MB. |
| text | string | Ndiyo | Maandishi ya kusema kwa sauti iliyotokezwa. |
| model | string | Hapana | Clone migao: Échedetocix Montano/cococobault), NORGERPANGESPANGES/cosice2/cococococosice225/cocococococococococococode, alcococococodetoptoptopse, alcocolations-sits-sovations-sovpt-sovations/topton/topton/todeations/totic/totic/topton |
| format | string | Hapana | Mtindo wa kompyuta: Uandishi wa mwaka wa 1725/default) |
| language | string | Hapana | Ni lazima mtu ategemee mfumo wake wa lugha. |
Itikio
Rudia faili ya sauti kama vile data ya binary, sawa na nambari ya TTS.
Mbadili wa Sauti
Sikiliza sauti kama sauti tofauti.
Kuomba Msamaha (multipart/form-data)
| Penseli | Aina | Zahitajiwa | Maelezo |
|---|---|---|---|
| file | file | Ndiyo | Fisi ya sauti (MP3, WAV, FARAC). |
| target_voice | string | Ndiyo | Sauti hatari ya ID kugeuza watu wafuate (kwa kutumia mtandao wa twita/v1/voices/Poggia/cows/codePechtoly ili kuorodhesha sauti zilizopo) |
| model | string | Hapana | Mfano wa kubadili sauti: Écocococococopevoice mbingunitones/codetocault (debault), common-vcÉchede/codePocations/codethepton |
| format | string | Hapana | Mtindo wa kompyuta: Uandishi wa mwaka wa 1725/default/default), ÉquidePop3/cococococode, SIPEPANAPANAPANGHAPANGS/codeX |
Ombi la Mfano
curl -X POST https://api.tts.ai/v1/voice-convert/ \
-H "Authorization: Bearer sk-tts-your-key" \
-F "file=@source_audio.mp3" \
-F "target_voice=af_bella" \
-F "model=openvoice" \
-o converted.wav
Itikio
Arudisha faili ya sauti iliyogeuzwa kuwa data ya binari.
Utafsiri wa Usemi
Translate isemwayo kutoka lugha moja hadi nyingine. huunganisha mazungumzo ya lugha, uandishi, tafsiri, na maandishi-to-speech katika simu moja.
Kuomba Msamaha (multipart/form-data)
| Penseli | Aina | Zahitajiwa | Maelezo |
|---|---|---|---|
| file | file | Ndiyo | Kutoka kwa kaseti katika lugha ya awali. Max 100MB. |
| target_language | string | Ndiyo | Kanuni za lugha zilizolengwa (e.g., USP.P.P.P.P.P.P.P.P). |
| voice | string | Hapana | Sauti kwa ajili ya kitolewaji kilichotafsiriwa. Auto-toto-alichaguliwa kama ataachwa. |
| preserve_voice | boolean | Hapana | Jaribio la kumhifadhi msemaji wa awali |
Itikio
{
"original_text": "Hello, how are you?",
"translated_text": "Hola, como estas?",
"source_language": "en",
"target_language": "es",
"audio_url": "https://api.tts.ai/v1/results/translate_abc123.mp3",
"credits_used": 5
}
Usemi kwa Usemi
Badili mtindo wa usemi, hisia - moyo, au utoaji huku ukidumisha yaliyomo.
Kuomba Msamaha (multipart/form-data)
| Penseli | Aina | Zahitajiwa | Maelezo |
|---|---|---|---|
| file | file | Ndiyo | Kutoka kwa mfumo wa sauti ya sauti. Max 50MB. |
| voice | string | Ndiyo | Sauti hatari kwa ajili ya hotuba ya kutoa misaada |
| model | string | Hapana | Model: Échedecococotolysiesitologie/defrault (default), NORGORLLESÉcheR/codeK |
| emotion | string | Hapana | Hisia - moyo: "Copecocococococolytoral mbingunin/codecococococodeXovenpner/codeodeXisad/codecodem, 1725/codembertoly/codembertom, comechtodeXodeXodelytode/codemuptodein/code |
| speed | float | Hapana | Mabadiliko ya Mwendo. Defrault: Expecocococonde 141.025/cocodely. Ramber: 0.095/codetom/codeaptoptodealy.025/cocode/codeateX |
Itikio
Rudia faili iliyobadilishwa ya sauti kama data ya binari.
Vifaa vya Muziki
Utayarishaji wa kinanda cha sauti huashiria kuongezwa kwa sauti, kuondolewa kwa sauti, kugawanyika kwa shina, na kadhalika.
Ubora wa sauti: kufuta, kuboresha uwazi, azimio kubwa.
| file file | Faili ya sauti ili kuongezea |
| denoise boolean | Kukataa (kuhukumiwa: ukweli) |
| enhance_clarity boolean | Uneni wa kusema ni wazi (ajali: kweli) |
| super_resolution boolean | Ubora wa sauti za juu (ault: si kweli) |
| strength integer | 1-3 (nuru, mwasiliani, imara). Defrault: 2 |
Sauti zilizotengwa kutoka kwenye vyombo (kuondoa vitu) au kugawanyika kuwa mashina.
| file file | Faili ya sauti ili kutenganisha |
| model string | demucs (keyboard label) au spleeter |
| stems integer | Idadi ya mashina: 2, 4, 5, au 6 (mauaji: 2) |
| format string | Mtindo wa kompyuta: Uandishi wa mwaka wa 1725/cococoPH/cococodeXechtop15/cocococococococococococococococode mbinguni-11/cowntoly/codeXechtoly/codep325/codetoptoptoptoptoly, todecococococococodep mbingunip mbinguni.[5] |
Ondoa mwangwi na sauti kutoka kwenye rekodi za sauti.
| file file | Faili ya kuonyeshea muziki |
| type string | echo or reverb (default: both) |
| intensity integer | 1-5 (default: 3) |
Chunguza sauti ili kugundua ufunguo, BPM, na sahihi ya wakati.
| file file | Audio file to analyze |
{
"key": "C",
"scale": "Major",
"bpm": 120.0,
"time_signature": "4/4",
"camelot": "8B",
"compatible_keys": ["C Major", "G Major", "F Major", "A Minor"]
}
ANDIKA sauti kati ya miundo ya sauti.
| file file | Faili ya maneno ili kugeuza |
| format string | Network: Écodecode mbingunipshire/codePodetopÉwavÉton/codecodely/codecodePechtoly/codeoptoptoptoptaly/codem4a 1725/code/cocodemuptoptoptotheratopto, BARlaactoly/codemacipto/codeX/codeaciXesia |
| bitrate integer | Toa maelezo katika kbps: 64, 128, 192, 256, 320 |
| sample_rate integer | Kiwango cha Sample: 22050, 44100, 48000 |
| channels string | mono au stereo |
Mazungumzo ya Sauti
Tuma sauti au ujumbe kisha upokee jibu la AI kwa kutumia maneno ya kubuniwa.
Kuomba Msamaha (multipart/form-data au JSON)
| Penseli | Aina | Zahitajiwa | Maelezo |
|---|---|---|---|
| audio | file | Hapana* | Audio input (hu ama quoch mbinguni mbinguni 1725/cocococolyXephantum/codeum/Pecodelyptus) |
| text | string | Hapana* | Maandishi ya kompyuta (na ama mediacode mbinguni mbinguni mbinguni mbinguni mbinguni na pia ya mwaka wa 1725/cococope/codemuptoum/Podepton) yalihitaji) |
| voice | string | Hapana | Sauti kwa ajili ya AI jibu. Default:/JodecocoPEAf_bella 174/cocococodeum |
| tts_model | string | Hapana | TEHAMA ni kigezo cha majibu. Defrault: Équide mbingunikoro 2.0/cococococodeXech |
| system_prompt | string | Hapana | Mfumo wa desturi wa AI huanza haraka |
| conversation_id | string | Hapana | Endelea na mazungumzo yaliyopo |
Itikio
{
"conversation_id": "conv_abc123",
"user_text": "What is the capital of France?",
"ai_text": "The capital of France is Paris.",
"audio_url": "https://api.tts.ai/v1/audio/tmp/resp_xyz.mp3",
"credits_used": 3
}
Vigezo vya Orodha
Arudisha orodha ya violezo vyote vinavyopatikana vikiwa na uwezo wao.
Itikio
{
"models": [
{
"id": "kokoro",
"name": "Kokoro",
"type": "tts",
"tier": "standard",
"languages": ["en", "ja", "ko", "zh", "fr"],
"supports_cloning": false,
"supports_streaming": true,
"credits_per_1k_chars": 2
},
{
"id": "chatterbox",
"name": "Chatterbox",
"type": "tts",
"tier": "premium",
"languages": ["en"],
"supports_cloning": true,
"supports_streaming": true,
"credits_per_1k_chars": 4
}
]
}
Orodha ya Sauti
Arudi orodha ya sauti zote zipatikanazo, zilizochujwa kwa hiari na kiolezo au lugha.
Juzi Zenye Nguvu
| Penseli | Aina | Maelezo |
|---|---|---|
| model | string | Filter by kimitindo ID (e.g., comrach mbingunikoro 1725/cocochtoly) |
| language | string | Filter kupitia mfumo wa lugha (e.g., 25cocococolytolen mbingunin/codeum) |
| gender | string | Filter by manism: "Jopecococococomamaly/codemlytolily/codeabling echtothers/codecodecodeptol mbinguniptus/cowntomber |
Itikio
{
"voices": [
{
"id": "af_bella",
"name": "Bella",
"model": "kokoro",
"language": "en",
"gender": "female",
"preview_url": "https://api.tts.ai/v1/voices/preview/af_bella.mp3"
}
],
"total": 142
}
Vielelezo vya Sheria
Text to Speech
import requests
API_KEY = "sk-tts-your-key"
# Text to Speech
response = requests.post(
"https://api.tts.ai/v1/tts/",
headers={"Authorization": f"Bearer {API_KEY}"},
json={
"model": "kokoro",
"text": "Hello from TTS.ai!",
"voice": "af_bella",
"format": "mp3"
}
)
with open("output.mp3", "wb") as f:
f.write(response.content)
print(f"Credits used: {response.headers.get('X-Credits-Used')}")
Usemi kwa Maandiko
# Speech to Text
with open("recording.mp3", "rb") as f:
response = requests.post(
"https://api.tts.ai/v1/stt/",
headers={"Authorization": f"Bearer {API_KEY}"},
files={"file": f},
data={"model": "faster-whisper", "timestamps": "true"}
)
result = response.json()
print(result["text"])
Sauti Yaungana
# Voice Cloning
with open("reference.wav", "rb") as ref:
response = requests.post(
"https://api.tts.ai/v1/tts/clone/",
headers={"Authorization": f"Bearer {API_KEY}"},
files={"reference_audio": ref},
data={
"text": "This speech uses a cloned voice.",
"model": "chatterbox"
}
)
with open("cloned_output.mp3", "wb") as f:
f.write(response.content)
Text to Speech
const API_KEY = 'sk-tts-your-key';
// Text to Speech
const response = await fetch('https://api.tts.ai/v1/tts/', {
method: 'POST',
headers: {
'Authorization': `Bearer ${API_KEY}`,
'Content-Type': 'application/json'
},
body: JSON.stringify({
model: 'kokoro',
text: 'Hello from TTS.ai!',
voice: 'af_bella',
format: 'mp3'
})
});
const audioBlob = await response.blob();
const audioUrl = URL.createObjectURL(audioBlob);
const audio = new Audio(audioUrl);
audio.play();
Usemi kwa Maandiko
// Speech to Text
const formData = new FormData();
formData.append('file', audioFile);
formData.append('model', 'faster-whisper');
const response = await fetch('https://api.tts.ai/v1/stt/', {
method: 'POST',
headers: { 'Authorization': `Bearer ${API_KEY}` },
body: formData
});
const result = await response.json();
console.log(result.text);
Text to Speech
# Text to Speech
curl -X POST https://api.tts.ai/v1/tts/ \
-H "Authorization: Bearer sk-tts-your-key" \
-H "Content-Type: application/json" \
-d '{"model":"kokoro","text":"Hello!","voice":"af_bella","format":"mp3"}' \
-o output.mp3
Usemi kwa Maandiko
# Speech to Text
curl -X POST https://api.tts.ai/v1/stt/ \
-H "Authorization: Bearer sk-tts-your-key" \
-F "file=@recording.mp3" \
-F "model=faster-whisper" \
-F "timestamps=true"
Sauti Yaungana
# Voice Cloning
curl -X POST https://api.tts.ai/v1/tts/clone/ \
-H "Authorization: Bearer sk-tts-your-key" \
-F "reference_audio=@reference.wav" \
-F "text=This uses a cloned voice." \
-F "model=chatterbox" \
-o cloned.mp3
Utetezi wa Kaseti
# Audio Enhancement
curl -X POST https://api.tts.ai/v1/audio/enhance/ \
-H "Authorization: Bearer sk-tts-your-key" \
-F "file=@noisy_audio.mp3" \
-F "denoise=true" \
-F "enhance_clarity=true" \
-o enhanced.mp3
Makosa
Makosa yote yarudi itikio la JSON kwa jibu error shambani.
{
"error": {
"code": "insufficient_credits",
"message": "You do not have enough credits for this request.",
"credits_required": 4,
"credits_available": 2
}
}
| Hali ya HTTP | Error Code | Maelezo |
|---|---|---|
| 400 | bad_request |
Chunguza ujumbe wenye kosa ili upate habari zaidi. |
| 401 | unauthorized |
Siri ya Kupotea au Kutofaulu. |
| 402 | insufficient_credits |
Haitoshi. Purchase zaidi kwenye /pricing/. |
| 403 | forbidden |
MSIBA usipopatikana kwa mpango wako. |
| 404 | not_found |
Mfano au sauti haikupatikana. |
| 413 | file_too_large |
Faili zilizopakiwa juu hupita mpaka. |
| 429 | rate_limited |
Wakaguzi wa hundi hupunguza vichwa vyao mara nyingi sana. |
| 500 | internal_error |
Jaribu tena baadaye. |
| 503 | model_loading |
Mfano unapakiwa kwa muda wa sekunde chache. |
Vituo vya Intaneti
Kwa ajili ya kazi za muda mrefu (kugawanya, kushughulikia wajibu), unaweza kuandaa jopo la maneno ya dharura_urlBarter. Kazi itakapokamilika, tutaipima matokeo kwa URL wako.
{
"event": "task.completed",
"task_id": "task_abc123",
"status": "success",
"result_url": "https://api.tts.ai/v1/results/task_abc123",
"credits_used": 12,
"created_at": "2025-01-15T10:30:00Z",
"completed_at": "2025-01-15T10:30:45Z"
}
Je, Uko Tayari Kujenga?
Tafuta funguo zako na uanze kushirikiana na TTS.ai katika matumizi yako.