2DIPW / audio_dataset_screenerLinks
An auxiliary tool for manual screening of audio dataset.
☆126Updated last year
Alternatives and similar repositories for audio_dataset_screener
Users that are interested in audio_dataset_screener are comparing it to the libraries listed below
Sorting:
- A voiceprint recognition classifier for audio dataset☆100Updated last year
- async http process VST plugin☆161Updated 2 years ago
- vits2 backbone with bert☆340Updated last year
- Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)☆161Updated 2 years ago
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform☆48Updated 2 years ago
- VITS for Mandarin. Support Windows and Linux, low-end and high-end hardwares☆110Updated 2 years ago
- vits2 backbone with bert☆85Updated last year
- VITS web UI☆43Updated 2 years ago
- 一个快速制作语音数据集的可视化工具☆193Updated last year
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆166Updated last year
- SoVits Gradio(Web UI)☆26Updated 2 years ago
- Simple data labeling script with funasr inside. 使用阿里fanasr进行VITS训练数据标注☆79Updated last year
- ☆284Updated 8 months ago
- 数据集自动化制作脚本☆71Updated 2 years ago
- BertVITS2前端界面☆302Updated last year
- 语音合成项目☆164Updated 2 years ago
- StarRail Datasets For SVC/SVS/TTS☆318Updated last week
- SubFix: Efficient Web-Based Audio Subtitle Editing and Multilingual Automatic Annotation Tool.☆202Updated last year
- ☆81Updated last year
- application of vits on mandarin tts☆122Updated 2 years ago
- OpenUTAU renderer for diffsinger / 适用于diffsinger的OpenUTAU渲染器,使用方法:https://github.com/xunmengshe/OpenUtau/wiki/%E4%BD%BF%E7%94%A8%E6%96%B9…☆24Updated 2 years ago
- ☆138Updated 4 months ago
- Split audio using the .srt file, clean up annotations, then merge and package into a format suitable for bert-vits2 in a standard manner.…☆48Updated 11 months ago
- VitsWebUi☆33Updated 2 years ago
- A cli tool for split vocal timbre.☆234Updated 3 months ago
- ACG Text-to-Speech☆175Updated 2 years ago
- MSST-GUI is a Qt5-based inference GUI, designed to provide a convenient and intuitive way to inference (mainly for my own use)☆122Updated 6 months ago
- Acoustic models for SVS/SVC/TTS☆31Updated 9 months ago
- A convenient tool for generating audio files☆135Updated 2 years ago
- model_repo☆123Updated 2 years ago