MOSS-TTSD

Default Speaker

Emiselweyo IsiNgesi Neutral MOSS-TTSD

{igama} yi neutral AI yelizwi elinamandla eMOSS-TTSD umbhalo-kwi-speech model. Eli standard-level ilizwi lithetha IsiNgesi kwaye linika studio-quality speech synthesis. Nge phezulu unikezelo lwesantya kunye nomgangatho womgangatho we 5/5, Default Speaker ulungele podcasts, audiobooks, dubbed dialogue, conversational content with multiple voices. I-MOSS-TTSD injini iphuhliswe ngu OpenMOSS under the Apache 2.0 license, iyenza ikhuseleke kwimisebenzi yentengiso. Iinkqubo eziphambili ziquka: {iimpawu}. Imodeli MOSS-TTSD ixhasa ukuklonya kwesandi — ukufaka isampuli yesandi efutshane ukwenza isandi esizikhethelayo esigcina iimpawu zomgangatho ofanayo.

Akukho manqaku

MOSS-TTSDUlwazi lwemodeli

Imodeli MOSS-TTSD
Umbhekisi phambili OpenMOSS
Umgangatho
Isantya I-Media
Ilayisensi Apache 2.0
Ukuklona Ixhaswe
I-Tier Eqhelekileyo (2x uphawu)
Iiparamitha 7B
Uyilo lwezindlu MOSS-TTS-Delay + dialogue continuation head
Iminyaka 2026

Iinkqubo ezilungileyo zokusetyenziswa Default Speaker

Iinkqubo ezicetyiswayo ezisekelwe kwiimpawu zalo msindo

Iincwadi ezinesandi & Uxwebhu

Sebenzisa i {igama} ukuchaza imixholo yefom ende nge-prosody eqhelekileyo ne-expression.

Ividiyo

Yongeza ukuthetha okuzimeleyo kwiividiyo zeYouTube, iintengiso, kunye nemixholo yemidiya yoluntu.

Ipodcasts & Ukusasazwa

Imveliso elungileyo yestudio elungele iipodcasts, umculo, kunye nokusasazwa okuzimeleyo.

Igama leqela lenjongo ethile

Uhlobo lwesandi

I-More MOSS-TTSD IiNkokheli

Ezinye iingoma zemodeli efanayo ye-TTS

Default (Chinese)

IsiTshayina Neutral

Imibuzo ebuzwa rhoqo

MOSS-TTSD v1.0 from OpenMOSS is a 7B dialogue text-to-speech model that continues conversations from a short audio prompt. Supports up to 5 simultaneous speakers via [S1]/[S2] tags, zero-shot voice cloning from 3-10s reference audio, and up to 60 minutes of coherent multi-turn dialogue across 20 languages. Distinct from MOSS-TTS — TTSD is specialized for podcast/audiobook/dubbing workflows.

MOSS-TTSD was developed by OpenMOSS and is released under the Apache 2.0 license, which permits commercial use of generated audio.

MOSS-TTSD supports 20 languages: English, Chinese, German, Spanish, French, Japanese, Italian, Korean and more.

MOSS-TTSD is in the Standard tier — 2 credits per 1,000 characters. You can preview any MOSS-TTSD voice for free before generating full audio.

MOSS-TTSD has moderate generation speed. Generation typically takes a few seconds depending on text length.

MOSS-TTSD is rated 5/5 for audio quality on TTS.ai. It delivers studio-grade, human-like speech.

Yes, MOSS-TTSD supports zero-shot voice cloning. Upload 5-30 seconds of reference audio to create a custom voice.

Yes, MOSS-TTSD is specifically recommended for podcasts, audiobooks, dubbed dialogue, conversational content with multiple voices. Its multi-speaker dialogue, up to 5 speakers, 60min coherent audio capabilities make it an excellent choice for this use case.

Yes, MOSS-TTSD is licensed under Apache 2.0, which allows commercial use. Audio generated with MOSS-TTSD voices can be used in videos, podcasts, apps, games, and any other commercial project.

Ewe, zonke iingoma kwi-TTS.ai zisebenzisa iimodyuli ze-open-source ezilayisensiweyo ngentengiso (MIT, Apache 2.0). Isandi esiveliswe yiyo yakho ukuyisebenzisa kwividiyo, iipodcasts, iiapps, imidlalo, nakweyiphi na enye inkqubo yentengiso.

Thumela isicelo se POST ku /api/v1/tts/ ngegama lemodeli ne ID yesandi. Bona iphepha lethu le-API Documentation ngemizekelo yekhowudi kwi-Python, JavaScript, Go, kunye ne-cURL.

Ewe, nqakraza iqhosha lokudlala kweli phepha ukuva isampuli. Ungabhala umbhalo oqhelekileyo kwiphepha lombhalo ukuya kukuthetha kwaye wenze ukujonga kuqala simahla ngelizwi elithile.

Zama Default Speaker Ngoku

Bhala nawuphi na umbhalo uze uyiva ithetha ngu Default Speaker. Ifumaneka simahla.