AliceNavigator / SpeakerClassifierLinks
A lightweight tool that efficiently isolates target speaker data from your datasets.
☆19Updated 11 months ago
Alternatives and similar repositories for SpeakerClassifier
Users that are interested in SpeakerClassifier are comparing it to the libraries listed below
Sorting:
- VC Without Retrain!☆128Updated last year
- GPT-SoVITS2☆227Updated last year
- 基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo☆106Updated last year
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆185Updated last year
- A voiceprint recognition classifier for audio dataset☆105Updated 2 years ago
- Acoustic models for SVS/SVC/TTS☆31Updated last year
- 数据集自动化制作脚本☆72Updated 2 years ago
- ☆150Updated 8 months ago
- Simple data labeling script with funasr inside. 使用阿里fanasr进行VITS训练数据标注☆80Updated 2 years ago
- ☆295Updated last year
- SubFix: Efficient Web-Based Audio Subtitle Editing and Multilingual Automatic Annotation Tool.☆209Updated last year
- Bert-VITS2 onnx推理版本☆43Updated last year
- 这个项目是数据预处理。第一步是对获取到的 音频做处理,结合Funasr的时间戳去掉空背景音。也包含了喂给BERT前的label☆16Updated 4 months ago
- 音频响度统一,音量归一化处理☆12Updated last year
- ☆49Updated last year
- 一个快速制作语音数据集的可视化工具☆195Updated last year
- Split audio using the .srt file, clean up annotations, then merge and package into a format suitable for bert-vits2 in a standard manner.…☆49Updated last year
- An auxiliary tool for manual screening of audio dataset.☆130Updated 2 years ago
- Documentation for Bert-VITS2☆22Updated last year
- vits2 backbone with bert☆339Updated last year
- GPT-SoVITS 参考音频推理效果批量试听☆52Updated last year
- Bert-vits2-V2.3 训练和推理☆49Updated last year
- ☆70Updated last year
- Preprocess Audio for training☆363Updated 2 weeks ago
- vits2 backbone with bert☆84Updated last year
- 基于达摩院视频切割技术的视频转换为短音频的vits数据集生成工具 A VITS Dataset Generation Tool for Converting Video to Short Audio Based on Damo Academy Video Cutting T…☆55Updated last year
- Step-Audio-TTS-3B demo☆12Updated 8 months ago
- Subtitle dubbing with multiple TTS Engines☆208Updated last week
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform☆47Updated 2 years ago
- ☆41Updated last year