Libreng AI > Teksto sa Pagsasalita
> 22+ open-source na mga modelo, 100+ boses, 32+ tl> wika. Walang account kinakailangan.
> Lahat ng kailangan mo para sa Voice AI
> 26 mga tool na pinalakas ng 24+ open-source AI modelo
> 22+ AI modelo ng boses
> Ang pinaka-komprehensibong koleksyon ng mga modelo ng open-source TTS sa isang platform
Kokoro Free
Kokoro is an 82 million parameter text-to-speech model that punches well above its weight class. Despite its tiny size, it produces remarkably natural and expressive speech. Kokoro supports multiple languages including English, Japanese, Chinese, and Korean with a variety of expressive voices. It runs incredibly fast — generating audio nearly 100x faster than real-time on a GPU.
Pinakamahusay para sa: High-quality TTS with minimal latency, streaming applications
> Subukan ang Libre
Piper Free
Piper is a lightweight text-to-speech engine developed by Rhasspy that uses VITS and larynx architectures. It runs entirely on CPU, making it ideal for edge devices, home automation, and applications requiring offline TTS. With over 100 voices across 30+ languages, Piper delivers natural-sounding speech at real-time speeds even on a Raspberry Pi 4.
Pinakamahusay para sa: Quick previews, accessibility, and embedded applications
> Subukan ang Libre
VITS Free
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is a parallel end-to-end TTS method that generates more natural sounding audio than current two-stage models. It adopts variational inference augmented with normalizing flows and an adversarial training process, achieving a significant improvement in naturalness.
Pinakamahusay para sa: General-purpose text-to-speech with natural prosody
> Subukan ang Libre
MeloTTS Free
MeloTTS by MyShell.ai is a multilingual TTS library supporting English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. It is extremely fast, processing text at near real-time speed on CPU alone. MeloTTS is designed for production use and supports both CPU and GPU inference.
Pinakamahusay para sa: > Production application na nangangailangan ng mabilis, multilingual TTS
> Subukan ang Libre
Bark Standard
Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.
Tagabuo: Suno · Lisensya: MIT
Subukan ito
Bark Small Standard
Lighter version of Bark with faster inference and lower memory usage.
Tagabuo: Suno · Lisensya: MIT
Subukan ito
CosyVoice 2 Standard
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Tagabuo: Alibaba (Tongyi Lab) · Lisensya: Apache 2.0
Subukan ito
Dia TTS Standard
Multi-speaker dialog generation model na lumilikha ng mga natural na pag-uusap sa pagitan ng mga nagsasalita.
Tagabuo: Nari Labs · Lisensya: Apache 2.0
Subukan ito
Parler TTS Standard
Describe the voice you want in natural language and Parler generates matching speech.
Tagabuo: Hugging Face · Lisensya: Apache 2.0
Subukan ito
IndexTTS-2 Standard
Zero-shot TTS with fine-grained emotion control and high expressiveness.
Tagabuo: Index Team · Lisensya: Apache 2.0
Subukan ito
Spark TTS Standard
Voice cloning TTS with controllable emotion and speaking style via prompts.
Tagabuo: SparkAudio · Lisensya: Apache 2.0
Subukan ito
GPT-SoVITS Standard
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
Tagabuo: RVC-Boss · Lisensya: MIT
Subukan ito
Orpheus Standard
Human-level emotional TTS model trained on 100K hours of speech data.
Tagabuo: Canopy Labs · Lisensya: Llama 3.2 Community
Subukan ito
Qwen3 TTS Standard
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
Tagabuo: Alibaba (Qwen) · Lisensya: Apache 2.0
Subukan ito
CosyVoice 2
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Wika: en, zh, ja, ko, fr, de, it, es
Clone ng boses
IndexTTS-2
Zero-shot TTS with fine-grained emotion control and high expressiveness.
Wika: en, zh
Clone ng boses
Spark TTS
Voice cloning TTS with controllable emotion and speaking style via prompts.
Wika: en, zh
Clone ng boses
GPT-SoVITS
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
Wika: en, zh, ja, ko
Clone ng boses
Chatterbox
Ang mga ito ay tinatawag na "zero-shot" voice cloning na may emotion control mula sa Resemble AI.
Wika: en
Clone ng boses
Tortoise TTS
Ang multi-voice text-to-speech ay nakatuon sa kalidad na may autoregressive architecture.
Wika: en
Clone ng boses
OpenVoice
> Instant boses cloning na may granular kontrol sa estilo, damdamin, at accent.
Wika: en, zh, ja, ko, fr, de, es, it
Clone ng boses
Qwen3 TTS
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
Wika: en, zh, ja, ko, de, fr, ru, pt, es, it
Clone ng bosesDeveloper-unang API
> OpenAI-compatible REST API. Isang endpoint, 22+ modelo. Streaming suporta para sa real-time na mga application.
- tl> OpenAI-compatible format
- > Streaming TTS para sa real-time apps
- > Batch pagpoproseso para sa malaking trabaho
- > Webhook mga notification
import requests
response = requests.post(
"https://api.tts.ai/v1/tts/",
headers={"Authorization": "Bearer sk-tts-xxx"},
json={
"model": "kokoro",
"text": "Hello from TTS.ai!",
"voice": "af_bella",
}
)
with open("output.mp3", "wb") as f:
f.write(response.content)
> Simple, Transparent Pagpepresyo
> Magsimula nang libre. Scale habang lumalaki ka.
Libre
> 50 credits
- Kokoro, Piper, VITS, MeloTTS
- > 500 character na limitasyon
- >3gen/oras (walang account)
Simula
> 500 credits/buwan
- Lahat ng 22+ modelo
- > 5,000 character na limitasyon
- > Voice pag-clone
Pro
> 2,000 credits/buwan
- Lahat ng bagay sa Starter
- API pag-access
- > Priority pagpoproseso
Enterprise
> 10,000 credits/buwan
- Lahat ng bagay sa Pro
- Bulk API
- > Priority queue
Mga Madalas Itanong
tl> Simulan ang Paggamit ng AI Voice Ngayon
> Sumali sa mga tagalikha, developer, at mga negosyo na gumagamit ng TTS.ai