MrXnneHang / auto_labeling_for_BERT_VITS2Links
这个项目是数据预处理。第一步是对获取到的音频做处理,结合Funasr的时间戳去掉空背景音。也包含了喂给BERT前的label
☆16Updated this week
Alternatives and similar repositories for auto_labeling_for_BERT_VITS2
Users that are interested in auto_labeling_for_BERT_VITS2 are comparing it to the libraries listed below
Sorting:
- ☆138Updated 3 months ago
- GPT-SoVITS2☆216Updated 10 months ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆180Updated 10 months ago
- 数据集自动化制作脚本☆71Updated 2 years ago
- VC Without Retrain!☆124Updated last year
- A collection of neural vocoders suitable for singing voice synthesis tasks.☆124Updated 2 months ago
- A cli tool for split vocal timbre.☆232Updated 3 months ago
- vits2 backbone with bert☆85Updated last year
- ☆282Updated last year
- Bert-VITS2 onnx推理版本☆42Updated last year
- text to speech using autoregressive transformer and VITS☆239Updated last year
- Bert-vits2-V2.3 训练和推理☆46Updated last year
- SubFix: Efficient Web-Based Audio Subtitle Editing and Multilingual Automatic Annotation Tool.☆202Updated last year
- 基于达摩院视频切割技术的视频转换为短音频的vits数据集生成工具 A VITS Dataset Generation Tool for Converting Video to Short Audio Based on Damo Academy Video Cutting T…☆54Updated last year
- Sovits5 with RMVPE☆14Updated last year
- Simple data labeling script with funasr inside. 使用阿里fanasr进行VITS训练数据标注☆79Updated last year
- Split audio using the .srt file, clean up annotations, then merge and package into a format suitable for bert-vits2 in a standard manner.…☆48Updated 11 months ago
- Pipelines and tools to build your own DiffSinger dataset.☆109Updated 2 months ago
- A voiceprint recognition classifier for audio dataset☆100Updated last year
- 一个快速制作语音数据集的可视化工具☆193Updated last year
- 基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。☆195Updated 2 years ago
- ☆210Updated 2 years ago
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆17Updated 2 years ago
- vits2 backbone with bert☆340Updated last year
- Preprocess Audio for training☆340Updated 2 months ago
- VITS with phoneme-level prosody modeling based on MaskGIT☆81Updated 9 months ago
- 音频响度统一,音量归一化处理☆11Updated last year
- 基于vits fastspeech2 visinger的tts模型☆23Updated 2 years ago
- ☆108Updated 3 weeks ago
- SOFA: Singing-Oriented Forced Aligner☆168Updated 2 weeks ago