AI Audio Inpainting
Replace a section of audio with AI-synthesized speech that matches the surrounding voice. Fix a bad take without re-recording the whole thing.
Upload Audio to Inpaint
500 characters per second of audio replacedPovlecite in spustite datoteko sem ali brskanje
Supports MP3, WAV, FLAC, OGG, M4A. Max 50MB. Up to 10 minutes.file.mp3
0 MBSource audio — scrub to find the bad take
Inpaint Settings
Cloning the voice and synthesizing the replacement...
Slicing → cloning surrounding voice → splicing with crossfadeBefore (Original)
After (Inpainted)
How Audio Inpainting Works
Inpainting is the audio equivalent of Photoshop's content-aware fill. We clone the voice from the audio surrounding your selection, synthesize the new line in that voice, and splice it back with a short crossfade.
Best results: leave at least 3 seconds of clean speech immediately before the edit point so the cloner has good reference material.
Nasveti za najboljše rezultate
- Keep the marked range as tight as possible — only the bad take
- Replacement text should be roughly the same length as what it replaces
- Set the language to match the source audio for best voice match
- 80ms crossfade is usually invisible; bump to 150ms if you hear a click
- For long edits (>10s), consider re-recording the whole passage instead
How AI Audio Inpainting Works
Surgical edits, voice-matched, with no re-recording session.
Upload + Mark Range
Upload your audio and use the scrubber to mark the start/end of the section you want to replace. Type the replacement text.
Voice Clone + Synthesize
We extract up to 12 seconds of clean reference audio surrounding your selection, clone the speaker's voice, and synthesize the new line in that voice.
Crossfade Splice
The synthesized clip is spliced into the original recording with an equal-power crossfade at both edit points. The boundaries are inaudible.
Audio Inpainting Plans
Začnite brezplačno, nadgradnja, ko potrebujete več
- Up to 10-minute source files
- 500-character replacement text
- 4-second inpaint per request
- 80ms crossfade splice
- OpenVoice + CosyVoice 2 backends
- Up to 10-minute source files
- 5,000-character replacement text
- Tunable crossfade (0-250ms)
- Voice-model override
- Generation history + re-edit
- Up to 30-minute source files
- 100,000-character replacement text
- Priority GPU queue
- API access (/v1/audio-inpaint/)
- Batch inpainting (multiple ranges)
Pogosta vprašanja
Kaj bi lahko izboljšali? Vaša povratna informacija nam pomaga rešiti vprašanja.
Fix Your Audio in Seconds
Replace any part of any recording with AI-synthesized speech that matches the original voice. Sign up free to start.