# Applio

Last update: Apr 01, 2024


image
#

# Introduction ‎

  • Applio is a VITS-based Voice Conversion Tool developed by the IA Hispano team.

  • It's liked for its great UI & lots of extra features, such as TTS (with RVC models too), plugins, automatic model upload, customizable theme & more.

  • Because of its user-friendly experience & active development, it's considered to be one of the best forks.

  • It also has a cloud version, in case you don't meet the requirements to run it locally.

# Pros & Cons

✔️ PROS
CONS
  • Very complete
  • Has an active development
  • Currently stable
  • Very fast
  • TTS features
  • Automatic model upload
  • Has Mangio-Crepe
  • User-friendly UI
  • TensorBoard included
  • Extra features: (plugins, model fusion, etc)
  • Issues when downloading to external drives

#

# Download

#

  1. The easiest way to download Applio is by going to Applio's Hugging Face repo, and clicking the [ download ] button on the right-hand side.

    image

  1. Unzip the folder. It may take a few minutes.

  1. Open Applio's folder & execute run-applio.bat.

    image

  • A console tab will appear, and after a moment your default browser, with Applio ready to use.

#
#

# Inference

#
#

# 1. Upload voice model.

  • Go to the Download tab.
    You have two ways of uploading it: through its link or manually inputting its files.

    1. Go to the Download tab & paste the link of the model in the Model Link bar. It must be from Hugging Face or Google Drive.

      image
    2. Press Download Model.
    1. Drag & drop the model's .PTH in the Drop files box below.

      image
    2. Then drag the .INDEX.

# 2. Select voice model.

  1. Return to the Inference tab & click the Refresh button on the right.

    image

  2. Select your model in the Voice Model dropdown.

    image

#

# 3. Input vocals.

  • With Applio you can convert audios individually or in batches:

    1. Drag & drop the audio or click the upload box to search it.

      image
    2. Then select it in the dropdown below.

      image
    1. Go to the Batch tab.
      image

    2. In the Input Folder bar, paste the path folder containing the audios.

      In Output Folder you can paste a path folder for the results.

      Ensure the paths don't contain spaces/special characters.

# 4. Modify settings. (optional)

  • Unfold Advanced Settings if you wish to modify the inference settings for better results, or to determine the output folder.

    image


#

# 5. Convert.

  1. Click Convert at the bottom. The audio will begin to process.
    The processing time will mainly depend on your specs, length of audio & the algorithm picked.

  2. Once it's done, you can hear the results in the Export Audio box below.

    By default the output files will be in the "audios" folder: \ApplioV3.0.7\assets\audios


#
#

# Training

#
#
#
# a. Model Name
#
  • Go to the Train tab. Input a name for your model in Model Name.
    Don't include spaces/special characters.

    image


#
# b. Dataset Path
#
  • Paste the path file of your dataset in the Dataset Path bar. Ensure the path doesn't contain spaces/special characters.

    image


#
# c. Sampling Rate
#
  • Select your dataset's sample rate. If you don't know the amount, click here.

    image


#
# d. Preprocess Dataset
#
  • Ensure RVC Version is set as V2 & click Preprocess Dataset.

    It'll finish when the output box says preprocessed successfully.

    image
#
# a. Pitch extraction algorithm
#
  • Select the algorithm you want. Use either Crepe or RMVPE, as the rest are outdated.

    image

#
# b. Hop Length (optional)
#
  • If you chose Crepe, you can modify its hop length.

    image

#
# c. Extract Features
#
  • Press Extract Features.
    It'll finish when it says extracted successfully.

    image
#
# a. Batch Size
#
  • If you are a newbie, use 8. But in case your dataset is short (around 2 minutes or less), use 4.

    image

#
# b. Save Every Epoch
#
  • Frequency of the saving checkpoints, based on the epochs.

  • If you are a newbie, simply leave it at 15.

    image

  • E.g: with a value of 10, they will be saved after the epoch 10, 20, 30, etc.


#
# c. Total Epoch
#
  • Input the total amount of epochs (training cycles) for the model.

  • But since we'll use TensorBoard, use an arbitrarily large value like 1000

    image


#
# d. GPU Settings
#
  • If you have multiple GPUs, tick GPU Settings to use a specific one for the training.

    image


#
# e. Generate Index
#
  • Click Generate Index. This will create the model's .INDEX file.

#
# f. Start Training
#
  • Press Start Training to begin the training process.

  • To open TB, execute run-tensorboard in Applio's folder. Remember to monitor it, as well as the console just in case.

  • The latter will show you errors if they happen, and information about the epochs & checkpoints.

    image
#
# a. Stop training
#
  • When you're very sure of overtraining, you can stop training by going to the Settings tab & press Restart Applio.

    image


#
# b. Get the INDEX
#
  1. Create a new folder anywhere named as the model.
  2. Open Applio's folder, go to logs, and open the folder named as the model.
  3. Select the .INDEX named added_ & move it to your newly made folder.

    image‎ ‎

#
# c. Get the PTH
#
  1. In said folder you'll also find all the checkpoints.

  2. Select the one closest to before the overtraining point, and move it to the new folder.

    The checkpoints will be organized with this format: ModelName_Epoch_Step.pth
    Example: arianagrande_e60_s120.pth


  1. And that's all, have fun with your model. To test it, do a normal inference as usual.
  • In case the training finished but the model still needed training, you don't have to start from scratch.
  • Simply enter the same settings & criteria that you've previously inserted. You don't have to do the preprocess or train the .INDEX again.
  • You can change the save frequency, or increase the Total Epoch amount in case you didn't input enough before.
  • Begin training again & remember to monitor TB & console like before.
#
#
#

# TTS

+ with any RVC model

#
  • Applio is also known for having one TTS tool by default, with plenty of voices to choose for.

  • You can also use it with RVC models & apply the inference settings if you wish.

  • Aditionally, you can download the Eleven Labs TTS plugin.


#

# Instructions:

  1. Go to the TTS tab.

    image


#
  1. If you want to use an RVC model, download it, go to TTS, click Refresh & select it in Voice Model & Index File.

    image

  • To modify the inference settings or the output folder for the TTS/RVC audio, unfold Advanced Settings.

#
  1. In TTS Voices select the voice of your desired language, accent & gender.

    In Text to Synthesize input your text. Then click Convert.

    image

  • If you are using an RVC model, select a voice that matches the model the most, to guarantee great results.

#
  1. Once it's done, you'll be able to hear the result in the Export Audio box. By default, the output audio will be in the "audios" folder. < \ApplioV3.0.7\assets\audios >

    image


#
#

# Extra

#
  • Applio has an Extra menu, containing an audio analyzer, originally made by Ilaria.

  • Making it convenient for determining the sample rate of datasets when training models.

  • It also contains the model fusion tool, ideal for advanced users.


#

# Audio Analyzer:

  1. Go to the Extra tab & press the upload box to input your audio. Or simply drag & drop.

    image


#
  1. Once it's done uploading, click Get information about the audio.

#
  1. In Sampling rate you'll see the audio's full sample rate. Use said value for training.

    image

#
#
  • # Example:

    image

Here it reached 20 kHz. Doubling it gives 40kHz. Therefore the ideal target sample rate would be 40k


#
#

# Plugins

  • Plugins are components that you can add to Applio, that add new features & enhance your experience.

  • These are made by the public, and are free & easy to install.

  • You can find them on their GitHub page. More will be added in the future.


#

# Installation:

  1. Access their GitHub page & click on the name of the plugin you want.

    image


#
  1. Click on the ZIP file.

    image

  • Click on the download button on the right. This will download the ZIP file of the plugin.

    image


#
  1. Open Applio & head over to the Plugins tab. Drag & drop the ZIP file to the upload box.

    image

  • You will be able to see its installation process in the console.

    image


#
  1. Go to the settings tab & click Restart Applio at the bottom. Then you'll be able to see the plugin in the Plugins tab.

    image


#
#

# Troubleshooting

#
#
  • If it's lower than 32k: select 32k.
  • If it's 44.1k: select 40k.
  • If i'ts higher than 48k: select 48k.
#
  • This a phenomenon called artifacting. To fix it, read here.
#
  • Report your issue here.

#

# You have reached the end.

Report Issues