MOSS-TTSD

Default (Chinese)

Standard Chinese Neutral MOSS-TTSD

Default (Chinese) is a neutral AI voice powered by the MOSS-TTSD text-to-speech model. This standard-tier voice speaks Chinese and delivers studio-quality speech synthesis. With moderate generation speed and a quality rating of 5/5, Default (Chinese) is well-suited for podcasts, audiobooks, dubbed dialogue, conversational content with multiple voices. The MOSS-TTSD engine is developed by OpenMOSS under the Apache 2.0 license, making it safe for commercial use. Key capabilities include: multi-speaker dialogue, up to 5 speakers, 60min coherent audio, voice cloning, 20 languages. The MOSS-TTSD model also supports voice cloning — upload a short audio sample to create a custom voice that retains the same quality characteristics.

No ratings yet

MOSS-TTSDModel Information

Model MOSS-TTSD
Developer OpenMOSS
Quality
Speed Medium
License Apache 2.0
Cloning Supported
Tier Standard (2x characters)
Parameters 7B
Architecture MOSS-TTS-Delay + dialogue continuation head
Year 2026

Best Use Cases for Default (Chinese)

Recommended applications based on this voice's characteristics

Audiobooks & Narration

Use Default (Chinese) to narrate long-form content with natural prosody and expression.

Video Voiceovers

Add professional narration to YouTube videos, ads, and social media content.

Podcasts & Broadcasting

Studio-quality output suitable for podcasts, radio, and professional broadcasting.

Custom Brand Voice

Clone this voice style with your own audio to create a unique branded TTS voice.

More MOSS-TTSD Voices

Other voices from the same TTS model

Default Speaker

English Neutral

Frequently Asked Questions

MOSS-TTSD v1.0 from OpenMOSS is a 7B dialogue text-to-speech model that continues conversations from a short audio prompt. Supports up to 5 simultaneous speakers via [S1]/[S2] tags, zero-shot voice cloning from 3-10s reference audio, and up to 60 minutes of coherent multi-turn dialogue across 20 languages. Distinct from MOSS-TTS — TTSD is specialized for podcast/audiobook/dubbing workflows.

MOSS-TTSD was developed by OpenMOSS and is released under the Apache 2.0 license, which permits commercial use of generated audio.

MOSS-TTSD supports 20 languages: English, Chinese, German, Spanish, French, Japanese, Italian, Korean and more.

MOSS-TTSD is in the Standard tier — 2 credits per 1,000 characters. You can preview any MOSS-TTSD voice for free before generating full audio.

MOSS-TTSD has moderate generation speed. Generation typically takes a few seconds depending on text length.

MOSS-TTSD is rated 5/5 for audio quality on TTS.ai. It delivers studio-grade, human-like speech.

Yes, MOSS-TTSD supports zero-shot voice cloning. Upload 5-30 seconds of reference audio to create a custom voice.

Yes, MOSS-TTSD is specifically recommended for podcasts, audiobooks, dubbed dialogue, conversational content with multiple voices. Its multi-speaker dialogue, up to 5 speakers, 60min coherent audio capabilities make it an excellent choice for this use case.

Yes, MOSS-TTSD is licensed under Apache 2.0, which allows commercial use. Audio generated with MOSS-TTSD voices can be used in videos, podcasts, apps, games, and any other commercial project.

Yes, all voices on TTS.ai use commercially-licensed open-source models (MIT, Apache 2.0). The generated audio is yours to use in videos, podcasts, apps, games, and any other commercial application.

Send a POST request to /api/v1/tts/ with the model name and voice ID. See our API Documentation page for code examples in Python, JavaScript, Go, and cURL.

Yes, click the play button on this page to hear a sample. You can also type custom text on the Text to Speech page and generate a free preview with any voice.

Try Default (Chinese) Now

Type any text and hear it spoken by Default (Chinese). Free to use.