API na rubutu zuwa magana ga masu haɓakawa

Yi amfani da REST API don ƙirƙirar aikace-aikacen magana. Ƙara rubutu na halitta zuwa magana, ƙirƙirar magana, magana zuwa rubutu, da sarrafa sauti zuwa aikace-aikacenka, chatbots, masu taimakon magana, da samfuran SaaS. OpenAI-compatible format, 20+ models, simple integration.

API na REST QDialogButtonBox Shiryoyin Ayuka na Sauti Kayan Aikin SaaS QDialogButtonBox

@ action

Free with Kokoro, Piper, VITS, MeloTTS
Za'a nuna sauti da ka samar a nan
@ action
QFileDialog
Yaushe kake son TTS.ai? Ka gaya wa abokanka!

QDialogButtonBox

Duk abin da kuke buƙata don ƙirƙirar shiryoyin ayuka masu karɓar magana

QPrintPreviewDialog

Tambayar POST daya don ƙirƙirar magana. Tambayar JSON, amsawar sauti. Yana aiki tare da kowane harshe na shirye-shirye da ke goyon bayan HTTP.

KCharselect unicode block name

Ɗaukar-a cikin mai mayewa ga OpenAI TTS API. Sauya maɓallinka na base_url da API — Ƙididdigar da ke akwai tana aiki da sauri.

24+ Models Available

Cire kowane nau'i ta hanyar API guda. Sauya nau'i ta hanyar canza paramita guda. Ka kwatanta inganci, gudu, da farashin.

KCharselect unicode block name

Kokoro na samar da sauti cikin sakan 1. Kyakkyawan abu ne ga masu magana da kai, masu taimakawa da magana, da kuma shiri-shiri masu tattaunawa.

QDialogButtonBox

@ action

@ item font

Fara fitarwa kamar WAV, MP3, OGG, ko FLAC. Zaɓi adadin misali da zurfin bita. Taimako na fitarwa na sauti ga shirye-shirye na lokaci-da-lokaci.

QDialogButtonBox

Zaɓi maɓallin da ya dace da sauri, inganci, da bukatun farashin shirin ayuka na ku

KokoroKokoro

Free

Lightweight 82M parameter model delivering studio-quality speech with blazing-fast inference.

Fast 5/5

Mafi kyawun ga: Mafi sauri model - sub-second latency, mafi kyau ga real-time aikace-aikace da chatbots

QDialogButtonBox Kokoro

CosyVoice 2CosyVoice 2

Standard

Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.

Medium 5/5 QShortcut

Mafi kyawun ga: Streaming TTS tare da ƙirƙirar sauti ga aikace-aikacen mai taimakon magana

QDialogButtonBox CosyVoice 2

Sesame CSMSesame CSM

Premium

Conversational speech model generating natural dialogue with appropriate timing and emotion.

Slow 5/5

Mafi kyawun ga: AI mai magana da lokaci na dabi'a ga mai magana da sauti da mai taimako

QDialogButtonBox Sesame CSM

PiperPiper

Free

A fast, local neural text to speech system optimized for Raspberry Pi and embedded devices.

Fast 3/5

Mafi kyawun ga: Free, CPU-only model for high-volume applications at zero cost

QDialogButtonBox Piper

BarkBark

Standard

Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.

Slow 4/5

Mafi kyawun ga: Yiwa sauti halitta tare da sakamako na sauti ga shiri-shiri na halitta da nishaɗi

QDialogButtonBox Bark

Yadda za a haɗa TTS API

Daga shiga zuwa kiran API na farko a cikin minti 5

1

QDialogButtonBox

Yi rajista don kyauta kuma ka samar da maɓallin API daga dashboard ɗin asusunka. 15,000 na haruffa an haɗa su.

2

Yi kira na farko

Post to /v1/tts with text, model, and voice. Get audio bytes back. Under 5 lines of code.

3

Zaɓi Nau'in Ka

Yi gwajin nau'ikan daban-daban don amfani da ka. Ka kwatanta sauri, inganci, da farashin kowace halitta.

4

@ action

Yi gwaji da alamomin biyan-a-kamar-ka-yi. Babu iyaka na farashin kan shirye-shiryen biyan kuɗi. Kula da amfani a cikin dashboard ɗinka.

@ action

Haɗa TTS.ai a cikin kowane harshe tare da API na REST

Python QShortcut
import requests

response = requests.post(
    "https://api.tts.ai/v1/tts",
    json={
        "text": "Hello from my app!",
        "model": "kokoro",
        "voice": "af_heart",
        "format": "mp3"
    },
    headers={
        "Authorization": "Bearer sk-tts-xxx"
    }
)

with open("output.mp3", "wb") as f:
    f.write(response.content)
JavaScript (Node.js) Node.js
const response = await fetch(
    "https://api.tts.ai/v1/tts",
    {
        method: "POST",
        headers: {
            "Content-Type": "application/json",
            "Authorization": "Bearer sk-tts-xxx"
        },
        body: JSON.stringify({
            text: "Hello from my app!",
            model: "kokoro",
            voice: "af_heart",
            format: "mp3"
        })
    }
);

const audio = await response.blob();
cURL QFontDatabase
curl -X POST https://api.tts.ai/v1/tts \
  -H "Authorization: Bearer sk-tts-xxx" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Hello from my app!",
    "model": "kokoro",
    "voice": "af_heart",
    "format": "mp3"
  }' \
  --output output.mp3
KCharselect unicode block name QPrintPreviewDialog
# Works with OpenAI client library
from openai import OpenAI

client = OpenAI(
    api_key="sk-tts-xxx",
    base_url="https://api.tts.ai/v1"
)

response = client.audio.speech.create(
    model="kokoro",
    voice="af_heart",
    input="Hello from my app!"
)

response.stream_to_file("output.mp3")

Abin da Masu Cigaban Ke Ginawa da TTS.ai

KCharselect unicode block name

QShortcut

Ƙara fitar da magana zuwa mai magana da kai ko mai taimakawa AI. Yi amsa LLM ta hanyar TTS ga maɓallan haɗi masu iya magana. Kokoro yana bayar da ƙarin sa'a don tattaunawa ta lokaci-da-lokaci. Sesame CSM yana samar da maganar tattaunawa tare da lokacin halitta.

  • Mai amsawa LLM zuwa maɓuɓɓugar magana
  • Sub-second latency with Kokoro
  • Zaɓuɓɓukan magana da Sesame CSM
  • Phonon:: MMF:: EffectFactory

KCharselect unicode block name

Build voice-enabled mobile apps, accessibility tools, reading apps, and language learning platforms. Our REST API works with any mobile framework. Download audio files or stream directly to the client.

  • React Native, Flutter, Swift, Kotlin
  • Shiryoyin Ayuka na Ciniki da karantawa
  • Manhajojin koyon harshe
  • KCharselect unicode block name

KCharselect unicode block name

White-label voice capabilities in your SaaS product. Add TTS, STT, voice cloning, and audio processing as features in your platform. Use our API as your voice backend without managing GPU infrastructure.

  • KCharselect unicode block name
  • Babu bukatar ginin GPU
  • Pay-per-use pricing
  • 20+ models don bayar da masu amfani da ku

QShortcut

Yi haɗin halittar magana cikin hanyoyin CI/CD, sarrafa abun ciki, da kuma ayyukan sarrafawa na baƙi. Yi halittar dubban fayilolin sauti daga bayanan spreadsheet, sarrafa samar da podcast, ko gina hanyoyin sarrafa abun ciki.

  • Phonon:: MMF:: EffectFactory
  • KCharselect unicode block name
  • QDialogButtonBox
  • Sheet zuwa audio automation

QDialogButtonBox

@ info: status

20+

KCharselect unicode block name

100+

QShortcut

30+

@ action

<1s

Latency (Kokoro)

Tambayar da ake yi da yawa

Tambayoyi masu yawa game da TTS.ai developer API

Na'am. API ɗinmu na bi tsarin maganar sauti na OpenAI. Idan kana amfani da ɗakin karatun mai amfani da OpenAI Python ko JavaScript, zaka iya canja zuwa TTS.ai ta hanyar canja paramita na base_url da api_key. Shirin ka na yanzu yana aiki ba tare da canjawa ba.

Kokoro na samar da sauti cikin sakan 1 ga kalmomi masu kama da juna. CosyVoice 2 na goyon bayan fitarwa ta hanyar gudu don rage jinkirin da aka gani. Ga bots na tattaunawa da masu taimakawa da magana, lokacin tafiyar da ke tafiyar da kai gaba ɗaya shine sakan 1-3 bisa ga tsawon rubutu da zaɓin maɓalli.

Free models (Kokoro, Piper, VITS, MeloTTS) ne gaba ɗaya free. Standard models amfani 2x alamomi ga 1K na rubutu. Premium models amfani 4x alamomi ga 1K na rubutu. Sign up free da 15,000 alamomi. Plans fara a $ 9 / watan ga 500,000 alamomi.

Na'am. Ka aiko da misalin sauti na alaƙa (dakika 5-30) zuwa maɓallin ƙarshe na ƙãga halittar magana, sa'an nan ka yi amfani da shaidar maganar da aka ƙãga halitta a cikin tambayoyin TTS masu zuwa. Nau'ukan da ke goyon bayan ƙãga halittar sun haɗa da CosyVoice 2, Chatterbox, Fish Speech, da GPT-SoVITS.

free tier has basic rate limiting (3 requests per hour without a account). Paid plans have generous rate limits suitable for production applications. Contact us for enterprise-level throughput requirements.

WAV (ba'a ƙuntata ba, mafi kyawun inganci), MP3 (an ƙuntata, fayiloli mafi ƙaranci), OGG (farashin siffar budewa), da FLAC (ƙuntata ba tare da asara ba). Ka ƙayyade siffar cikin tambayoyinka. Diff ɗin shi ne WAV a cikin adadin misali na asali na siffar.

Yes. Combine our TTS API with a speech-to-text model and a LLM to build a complete voice assistant pipeline. Kokoro provides sub-second latency ideal for real-time conversation. CosyVoice 2 supports streaming output for even lower perceived response times.

CosyVoice 2 da Kokoro suna goyon bayan fitarwa ta sauti mai gudu inda ake bayar da ɓangaren sauti kamar yadda aka halitta su. Wannan yana rage lokaci zuwa bayt na farko ga shiri-na-aiki na lokaci-da-kamfani kamar masu taimakon magana da dabaru masu tattaunawa.

API na mayar da alamun halin HTTP na gabaɗaya. Yi amfani da backoff na exponential ga kurakurai 5xx da amsawar iyaka. Ga aikace-aikace masu muhimmanci, ƙara wata ƙuri'a tare da sake kokarin lissafi. API namu yana da lokaci mai tsawo amma ana shawartar da kula da kurakurai masu ƙarfi.

Na'am. /v1/voices da /v1/models maɓallan ƙarshe suna mayar da jerin JSON na duk waƙoƙin da ake da su da kuma maɓallan tare da metadata na su (goyon bayan harshe, darajar inganci, darajar gudu, da maƙasudin farashi). Yi amfani da waɗannan don gina masu zaɓar maɓallan da suke canzawa a cikin shirin ayuka na ka.

@ action: inmenu

Mafi yawan samfuranmu suna da ma'ana mai budewa kuma ana iya yin su da kansu. Amma, yin kansu yana buƙatar albarkatun GPU masu mahimmanci (muna amfani da 4x NVIDIA Tesla P40 tare da 96GB VRAM gabaɗaya). API yana ba da zaɓi mai rahusa ba tare da kula da ginin ginin ba.
5.0/5 (1)

@ info

An shirya ka ka gina da AI na magana?

Ka samu maɓallin API kyauta kuma ka fara gina. 15,000 na haruffa a kan rajista, samfuran kyauta masu samuwa, takardun shaida masu zurfi.