MOSS-TTSD

Default (Chinese)

Iphutha isi-Chinese (kunzima) Neutral MOSS-TTSD

{igama} yizwi le- neutral AI elisebenza ngemodeli ye-MOSS-TTSD yombhalo-kuya-kwezwi. Lezwi le-izinga elijwayelekile likhuluma {ulwimi} futhi linikeza ukuhlanganisa kwezwi le-istudio-quality. Nge ophakathi isivinini sokukhishwa kanye nezinga lomgangatho lwe 5/5, Default (Chinese) lilungele podcasts, audiobooks, dubbed dialogue, conversational content with multiple voices. Injini MOSS-TTSD ithuthukiswe ngu OpenMOSS under the Apache 2.0 license, iyenza iphephile ukusetshenziswa kwezokuhweba. Izinsiza ezibalulekile zifaka: {izici}. Imodeli MOSS-TTSD isekela futhi ukuklonya kwezwi — ukufaka isampula lomsindo omncane ukwenza izwi elijwayelekile eligcina izici zobuningi ezifanayo.

Akukho manani

MOSS-TTSDUlwazi lwemodeli

Imodeli MOSS-TTSD
Umthuthukisi OpenMOSS
Ubunjani
Isivinini I-Media
Ilayisense Apache 2.0
Ukuklonya Kuxhaswe
I-Tiger Ijwayelekile (izimpawu ezingu-2x)
Amapharamitha 7B
Ukwakhiwa MOSS-TTS-Delay + dialogue continuation head
Unyaka 2026

Isibonelo esihle kakhulu sokusetshenziswa Default (Chinese)

Izisebenziso ezivunyelwe ezisekelwe ezici zalesi sizwi

Incwadi yomsindo nenkulumo

Sebenzisa i-Default (Chinese) ukuchaza okuqukethwe kwefomu elide nge-prosody ne-expression ezijwayelekile.

Amavidiyo akhuluma ngazo

Engeza ukukhuluma okusezingeni eliphakeme ku-YouTube amavidiyo, izikhangiso, kanye nesihloko semidiya yomphakathi.

I-podcast ne-broadcast

I-study-quality output efanele amapodcast, umsakazo, kanye nokusakazwa okusezingeni eliphakeme.

Umsindo we-brand ojwayelekile

Uhlu lwemisindo

Okuningi MOSS-TTSD Izizwi

Okunye amagama emodeli efanayo ye-TTS

Default Speaker

isiNgisi Neutral

Imibuzo ebuzwa kaningi

MOSS-TTSD v1.0 from OpenMOSS is a 7B dialogue text-to-speech model that continues conversations from a short audio prompt. Supports up to 5 simultaneous speakers via [S1]/[S2] tags, zero-shot voice cloning from 3-10s reference audio, and up to 60 minutes of coherent multi-turn dialogue across 20 languages. Distinct from MOSS-TTS — TTSD is specialized for podcast/audiobook/dubbing workflows.

MOSS-TTSD was developed by OpenMOSS and is released under the Apache 2.0 license, which permits commercial use of generated audio.

MOSS-TTSD supports 20 languages: English, Chinese, German, Spanish, French, Japanese, Italian, Korean and more.

MOSS-TTSD is in the Standard tier — 2 credits per 1,000 characters. You can preview any MOSS-TTSD voice for free before generating full audio.

MOSS-TTSD has moderate generation speed. Generation typically takes a few seconds depending on text length.

MOSS-TTSD is rated 5/5 for audio quality on TTS.ai. It delivers studio-grade, human-like speech.

Yes, MOSS-TTSD supports zero-shot voice cloning. Upload 5-30 seconds of reference audio to create a custom voice.

Yes, MOSS-TTSD is specifically recommended for podcasts, audiobooks, dubbed dialogue, conversational content with multiple voices. Its multi-speaker dialogue, up to 5 speakers, 60min coherent audio capabilities make it an excellent choice for this use case.

Yes, MOSS-TTSD is licensed under Apache 2.0, which allows commercial use. Audio generated with MOSS-TTSD voices can be used in videos, podcasts, apps, games, and any other commercial project.

Yebo, wonke amazwi ku-TTS.ai asebenzisa amamodeli avulekile avunyelwe ngokuhweba (MIT, Apache 2.0). Umsindo okhiqizwe ukhona wena ukuwusebenzisa kumavidiyo, amapodcast, ama-apps, imidlalo, nanoma iyiphi enye inqubo yokuhweba.

Thumela isicelo se-POST ku /api/v1/tts/ ngegama lemodeli ne-ID yomsindo. Bona ikhasi lethu le-API Documentation ngemiboniso yekhodi ku-Python, JavaScript, Go, ne-cURL.

Yebo, chofoza inkinobho yokudlala kulekhasi ukuze ulalele isibonisi. Ungabhala futhi umbhalo ojwayelekile kwikhasi le-Text to Speech futhi udale ukubuka kuqala okumahhala nganoma iyiphi ingoma.

Zama Default (Chinese) Manje

Bhala noma yiluphi uxhumanisi bese ukhuluma ngaso Default (Chinese). Imahhala ukuyisebenzisa.