AI Voice Library
Browse, preview, and compare 100+ AI voices across 24+ models. Find the perfect voice for your project.
101 voices found
필터와 일치하는 음성이 없습니다. 검색 기준을 조정해 보십시오.
Voices by AI Model
Each TTS model has its own set of voices with unique characteristics. Some models support voice cloning, allowing you to use any voice as a reference.
Transformer-based text-to-audio model that generates realistic speech, music, and sound effects.
Chatterbox
1 voices
최고급
모델 시도
State-of-the-art zero-shot voice cloning with emotion control from Resemble AI.
Default
English
CosyVoice 2
5 voices
표준
모델 시도
Alibaba's scalable streaming TTS with human-parity naturalness and near-zero latency.
Multi-speaker dialog generation model that creates natural conversations between speakers.
GPT-SoVITS
1 voices
표준
모델 시도
Few-shot voice cloning TTS that replicates any voice from just 5 seconds of audio.
Default
Chinese
IndexTTS-2
1 voices
표준
모델 시도
Zero-shot TTS with fine-grained emotion control and high expressiveness.
Default
EnglishLightweight 82M parameter model delivering studio-quality speech with blazing-fast inference.
High-quality multilingual text-to-speech that runs on CPU with minimal latency.
Instant voice cloning with granular control over style, emotion, and accent.
Default
English
Parler TTS
1 voices
표준
모델 시도
Describe the voice you want in natural language and Parler generates matching speech.
Default
EnglishA fast, local neural text to speech system optimized for Raspberry Pi and embedded devices.
Alibaba's multilingual TTS with voice cloning, preset voices, and voice design from text.
Voice cloning TTS with controllable emotion and speaking style via prompts.
Default
English
StyleTTS 2
1 voices
최고급
모델 시도
Human-level text-to-speech through style diffusion and adversarial training.
Default
English
Tortoise TTS
1 voices
최고급
모델 시도
Multi-voice text-to-speech focused on quality with autoregressive architecture.
Random
EnglishAI 음성 이해
Voice Quality Tiers
TTS.ai offers voices across three quality tiers. Free-tier voices from Piper, VITS, and MeloTTS deliver fast, good-quality synthesis at no cost. Standard-tier voices from models like Kokoro and CosyVoice 2 offer more natural prosody and emotion. Premium-tier voices from OpenVoice, Chatterbox, and StyleTTS 2 provide the most realistic, human-like speech available in open-source TTS.
다국어 음성
Many voices support multiple languages. Some models like CosyVoice 2 and GPT-SoVITS support cross-lingual synthesis, where a voice trained in one language can speak naturally in another. The language filter above lets you find voices that natively support your target language, ensuring the best pronunciation and intonation.
Voice Cloning
Some models support voice cloning, which means you can use any voice as a reference to create speech that sounds like that person. Upload a short audio sample (10-30 seconds) and the model will adapt to match the voice characteristics. Models that support cloning include GPT-SoVITS, CosyVoice 2, and Chatterbox.
올바른 목소리 선택
최상의 음성은 사용 사례에 따라 달라집니다. 오디오북과 팟캐스트의 경우 자연스러운 음조를 갖춘 프리미엄 음성을 사용하세요. 게임 캐릭터의 경우 모델 간 다양한 음성을 탐색하세요. 접근성과 화면 리더의 경우 명확하고 빠른 음성을 선택하세요. 빠른 프로토타이핑의 경우 무료 계층 음성은 크레딧 비용 없이 즉각적인 결과를 제공합니다. 선택하기 전에 재생 버튼을 사용하여 각 음성을 미리 볼 수 있습니다.
자주 묻는 질문
음성 녹음, 향상 및 변환
Preview any voice, then use it directly in Text to Speech. Sign up free and get 50 credits to try premium voices.