#
Artifacting
Last update: Feb 10, 2024
#
#
Introduction
In RVC, artifacting refers to an anomaly where the output voice sounds "robotic" & glitchy.
This occurs after the inference or model training process.
#
#
Causes
It usually occurs when the dataset/vocal sample meets any of these criteria:
• Audio is low-quality
• Voice model was overall poorly trained
• There are overlapping voices
• There is reverb
• There is noise
As you noticed, most of the issues boil down to the audio sample not being properly clean. RVC is built for purely working with voices, not other sounds.
Remember that the cleaner your input audio is, the better the results.
#
#
Solutions
#
1. Use a lossless format:
If possible, it's best if your audio is in a lossless format like WAV or FLAC, preserving its original quality.
Avoid using lossy ones like MP3 or OGG.
#
2. If doing inference:
Remove undesired noises with an vocal isolation software.
Lowering the search feature ratio can also minimize this issue.
If breathing sounds produce it, lower the Protection value.
#
3. If training models:
- Ensure to clean your dataset properly, this includes removing silences and distortions.