koharubiyori / smartSlicer
从视频中收集语音素材用于AI翻唱炼丹的小工具
☆36Updated 6 months ago
Alternatives and similar repositories for smartSlicer:
Users that are interested in smartSlicer are comparing it to the libraries listed below
- GPT-SoVITS 参考音频推理效果批量试听☆50Updated last year
- An auxiliary tool for manual screening of audio dataset.☆125Updated last year
- 基于GPT-SoVITS的视频剪辑快捷配音工具☆154Updated last year
- vits2 backbone with bert☆338Updated last year
- vits2 backbone with bert☆85Updated last year
- Simple data labeling script with funasr inside. 使用阿里fanasr进行VITS训练数据标注☆78Updated last year
- A voiceprint recognition classifier for audio dataset☆100Updated last year
- 基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo☆96Updated last year
- SubFix: Efficient Web-Based Audio Subtitle Editing and Multilingual Automatic Annotation Tool.☆203Updated last year
- MSST-GUI is a Qt5-based inference GUI, designed to provide a convenient and intuitive way to inference (mainly for my own use)☆108Updated 5 months ago
- ☆41Updated last year
- 集成主流开源大模型,实现不同类型大模型以及同类型大模型之间的协调合作。☆62Updated 2 weeks ago
- A cli tool for split vocal timbre.☆229Updated 2 months ago
- BertVITS2前端界面☆301Updated last year
- Split audio using the .srt file, clean up annotations, then merge and package into a format suitable for bert-vits2 in a standard manner.…☆46Updated 10 months ago
- Subtitle dubbing with multiple AI projects☆134Updated this week
- 早期Lora一键包☆28Updated last year
- 以太流派的AI转绘工具包☆210Updated last year
- 基于 RWKV_Role_Playing 项目接入GPT-SoVITS语音对话项目☆30Updated last year
- AI吟美零式☆68Updated last year
- ☆31Updated 8 months ago
- 用于快速扬声器自适应TTS和任意语音转换 This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and any-to-any voice conversion☆38Updated last year
- 带有 WebUI 的 NovelAI 量产工具, 实现了批量文生图; 批量图生图; 视频转绘; 分块重绘; 批量 Vibe; 批量局部重绘; 批量超分降噪; 批量自动打码; 批量添加水印; 批量上传 Pixiv; 图片筛选; 批量抹除, 还原或导出生成信息; 法术解析; 多…☆267Updated this week
- 利用Stable-Diffution API去除图片ai感☆77Updated last year
- Acoustic models for SVS/SVC/TTS☆31Updated 8 months ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆176Updated 9 months ago
- 跨语种语音克隆,中文版Webui☆51Updated last year
- 本次开源为DL-B,是一个基于ChatGLM、Wav2Lip、So-VITS组建的数字形象方案。可以在此基础之上增加其他组件达到数字生命的效果。This open source is DL-B, which is a digital image scheme based o…☆106Updated last year
- Inference Specialization☆443Updated 10 months ago
- ☆75Updated last year