자유 AI 텍스트에서 음성으로Name
22개 이상의 오픈 소스 모델, 100개 이상의 음성, 32+ 계정이 필요하지 않습니다.
음성 AI에 필요한 모든 것
24개 이상의 오픈 소스 AI 모델을 기반으로 한 26개의 도구
22+ AI 음성 모델
하나의 플랫폼에서 가장 포괄적인 오픈 소스 TTS 모델 컬렉션
Kokoro Free
Kokoro is an 82 million parameter text-to-speech model that punches well above its weight class. Despite its tiny size, it produces remarkably natural and expressive speech. Kokoro supports multiple languages including English, Japanese, Chinese, and Korean with a variety of expressive voices. It runs incredibly fast — generating audio nearly 100x faster than real-time on a GPU.
가장 적합한 곳: High-quality TTS with minimal latency, streaming applications
무료로 사용해 보기
Piper Free
Piper is a lightweight text-to-speech engine developed by Rhasspy that uses VITS and larynx architectures. It runs entirely on CPU, making it ideal for edge devices, home automation, and applications requiring offline TTS. With over 100 voices across 30+ languages, Piper delivers natural-sounding speech at real-time speeds even on a Raspberry Pi 4.
가장 적합한 곳: Quick previews, accessibility, and embedded applications
무료로 사용해 보기
VITS Free
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is a parallel end-to-end TTS method that generates more natural sounding audio than current two-stage models. It adopts variational inference augmented with normalizing flows and an adversarial training process, achieving a significant improvement in naturalness.
가장 적합한 곳: General-purpose text-to-speech with natural prosody
무료로 사용해 보기
MeloTTS Free
MeloTTS by MyShell.ai is a multilingual TTS library supporting English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. It is extremely fast, processing text at near real-time speed on CPU alone. MeloTTS is designed for production use and supports both CPU and GPU inference.
가장 적합한 곳: 빠른 다국어 TTS가 필요한 프로덕션 애플리케이션
무료로 사용해 보기
Bark Standard
Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.
개발자: Suno · 라이선스: MIT
시도해 보세요
Bark Small Standard
Lighter version of Bark with faster inference and lower memory usage.
개발자: Suno · 라이선스: MIT
시도해 보세요
CosyVoice 2 Standard
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
개발자: Alibaba (Tongyi Lab) · 라이선스: Apache 2.0
시도해 보세요
Parler TTS Standard
Describe the voice you want in natural language and Parler generates matching speech.
개발자: Hugging Face · 라이선스: Apache 2.0
시도해 보세요
IndexTTS-2 Standard
Zero-shot TTS with fine-grained emotion control and high expressiveness.
개발자: Index Team · 라이선스: Apache 2.0
시도해 보세요
Spark TTS Standard
Voice cloning TTS with controllable emotion and speaking style via prompts.
개발자: SparkAudio · 라이선스: Apache 2.0
시도해 보세요
GPT-SoVITS Standard
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
개발자: RVC-Boss · 라이선스: MIT
시도해 보세요
Orpheus Standard
Human-level emotional TTS model trained on 100K hours of speech data.
개발자: Canopy Labs · 라이선스: Llama 3.2 Community
시도해 보세요
Qwen3 TTS Standard
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
개발자: Alibaba (Qwen) · 라이선스: Apache 2.0
시도해 보세요
CosyVoice 2
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
언어: en, zh, ja, ko, fr, de, it, es
음성 복제
IndexTTS-2
Zero-shot TTS with fine-grained emotion control and high expressiveness.
언어: en, zh
음성 복제
Spark TTS
Voice cloning TTS with controllable emotion and speaking style via prompts.
언어: en, zh
음성 복제
GPT-SoVITS
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
언어: en, zh, ja, ko
음성 복제
OpenVoice
Instant voice cloning with granular control over style, emotion, and accent.
언어: en, zh, ja, ko, fr, de, es, it
음성 복제
Qwen3 TTS
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
언어: en, zh, ja, ko, de, fr, ru, pt, es, it
음성 복제개발자 우선 API
OpenAI 호환 REST API. 하나의 엔드포인트, 22개 이상의 모델. 실시간 애플리케이션을 위한 스트리밍 지원.
- OpenAI 호환 형식
- 실시간 앱을 위한 TTS 스트리밍
- 대규모 작업을 위한 일괄 처리
- 웹훅 알림
import requests
response = requests.post(
"https://api.tts.ai/v1/tts/",
headers={"Authorization": "Bearer sk-tts-xxx"},
json={
"model": "kokoro",
"text": "Hello from TTS.ai!",
"voice": "af_bella",
}
)
with open("output.mp3", "wb") as f:
f.write(response.content)
간단하고 투명한 가격 정책
무료로 시작하고 성장함에 따라 확장하세요.