GPTSoVITS

On A24

Clone repo

cd /rscratch/tk/Desktop/
git clone https://github.com/RVC-Boss/GPT-SoVITS.git
cd ./GPT-SoVITS/

Create Conda environment

conda create -p ./.conda-env/ python=3.9
conda activate ./.conda-env/

# conda install -c conda-forge gcc
# conda install -c conda-forge gxx
# conda install ffmpeg cmake
# conda install pytorch==2.1.1 torchvision==0.16.1 torchaudio==2.1.1 pytorch-cuda=11.8 -c pytorch -c nvidia
pip install -r requirements.txt

Install required models and tools

optional

Dataset labeling

0c-Chinese ASR tool

input folder path

output folder path

ASR model

Faster Whisper - large-v3

0d-Speech to text proofreading tool

.list annotation file path

Training

1A-Dataset formatting

Experiment/model name: borav2

*Text labelling file

*Audio dataset folder

Inference

sample

Model Architecture

Last updated