2DIPW / audio_dataset_vpr
A voiceprint recognition classifier for audio dataset
☆98Updated last year
Alternatives and similar repositories for audio_dataset_vpr:
Users that are interested in audio_dataset_vpr are comparing it to the libraries listed below
- An auxiliary tool for manual screening of audio dataset.☆125Updated last year
- vits2 backbone with bert☆338Updated 11 months ago
- 一个快速制作语音数据集的可视化工具☆193Updated last year
- SubFix: Efficient Web-Based Audio Subtitle Editing and Multilingual Automatic Annotation Tool.☆202Updated last year
- VITS for Mandarin. Support Windows and Linux, low-end and high-end hardwares☆110Updated last year
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆166Updated last year
- A cli tool for split vocal timbre.☆223Updated last month
- ☆279Updated 6 months ago
- BertVITS2前端界面☆300Updated last year
- Preprocess Audio for training☆321Updated 3 weeks ago
- Simple data labeling script with funasr inside. 使用 阿里fanasr进行VITS训练数据标注☆77Updated last year
- 语音合成项目☆164Updated 2 years ago
- ☆443Updated last month
- Split audio using the .srt file, clean up annotations, then merge and package into a format suitable for bert-vits2 in a standard manner.…☆46Updated 9 months ago
- GPT-SoVITS2☆214Updated 8 months ago
- ☆134Updated last month
- vits2 backbone with bert☆84Updated last year
- StarRail Datasets For SVC/SVS/TTS☆302Updated 3 weeks ago
- application of vits on mandarin tts☆122Updated last year
- Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)☆160Updated last year
- Genshin Datasets For SVC/SVS/TTS☆655Updated last month
- async http process VST plugin☆162Updated 2 years ago
- DiffSinger community vocoders release page☆281Updated last month
- 从视频中收集语音素材用于AI翻唱炼丹的小工具☆35Updated 5 months ago
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform☆441Updated 2 years ago
- Inference Specialization☆434Updated 9 months ago
- Voice dataset of Genshin Impact 原神语音数据集☆698Updated last year
- AI吟美零式☆67Updated 11 months ago
- MSST-GUI is a Qt5-based inference GUI, designed to provide a convenient and intuitive way to inference (mainly for my own use)☆96Updated 4 months ago
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform☆48Updated 2 years ago