一种基于Emotion2Vec的批量音频情感自动标注脚本
☆504Mar 7, 2025Updated 11 months ago
Alternatives and similar repositories for RefAudioEmoTagger
Users that are interested in RefAudioEmoTagger are comparing it to the libraries listed below
Sorting:
- A cli tool for split vocal timbre.☆273Jan 17, 2026Updated last month
- A lightweight tool that efficiently isolates target speaker data from your datasets.☆19Nov 23, 2024Updated last year
- GAG is a GUI for GPT-SoVITS inference. Just add it to the official integration package and run for a smoother experience.☆227Jun 24, 2025Updated 8 months ago
- ☆156Feb 6, 2025Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆96Nov 16, 2025Updated 3 months ago
- GPT-SoVITS 参考音频推理效果批量试听☆53Mar 8, 2024Updated last year
- GPT-SoVITS2☆229Feb 9, 2026Updated 2 weeks ago
- 本项目意图在于让使用各类语音合成引擎的方式变得统一,支持多种语音合成引擎适配器,允许直接作为模组使用或启动后端服务☆767Apr 15, 2024Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆55,240Feb 9, 2026Updated 2 weeks ago
- Inference Specialization☆506Jun 25, 2024Updated last year
- waifu年龄检测器!☆15Feb 22, 2025Updated last year
- 基于PyQt5写的一个音频响度匹配小工具,目前支持4种匹配方式☆10Aug 14, 2025Updated 6 months ago
- 这是一个批量推理工具,对同一段文字进行多次推理,并且支持随机参数,直到筛选出最满意的结果。☆11Aug 19, 2024Updated last year
- ☆13Jun 8, 2024Updated last year
- A WebUI app for Music-Source-Separation-Training and we packed UVR together!☆980Feb 15, 2026Updated 2 weeks ago
- ☆15Mar 31, 2025Updated 11 months ago
- 各种引擎的工具☆72Oct 25, 2025Updated 4 months ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆184Jul 10, 2024Updated last year
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- 基于GPT-SoVITS的视频剪辑快捷配音工具☆173Mar 15, 2024Updated last year
- StarRail Datasets For SVC/SVS/TTS☆335Jul 27, 2025Updated 7 months ago
- Vocal Remover using Deep Neural Networks☆19Dec 31, 2024Updated last year
- ☆19Feb 2, 2023Updated 3 years ago
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 11 months ago
- Preprocess Audio for training☆375Feb 2, 2026Updated 3 weeks ago
- ☆128Feb 2, 2026Updated 3 weeks ago
- SOFA: Singing-Oriented Forced Aligner☆208May 16, 2025Updated 9 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆19,695Feb 11, 2026Updated 2 weeks ago
- 基于达摩院视频切割技术的视频转换为短音频的vits数据集生成工具 A VITS Dataset Generation Tool for Converting Video to Short Audio Based on Damo Academy Video Cutting T…☆55Jan 17, 2024Updated 2 years ago
- ☆474Jan 19, 2026Updated last month
- a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now☆249Aug 9, 2024Updated last year
- ☆21Dec 18, 2025Updated 2 months ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated 2 years ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Apr 4, 2024Updated last year
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- 低成本的简单基于live2d TTS文字转语音和大模型聊天的直播解决方案☆275Jul 4, 2024Updated last year
- Train the next generation of TTS systems.☆171Sep 13, 2024Updated last year
- Convenient for developers to call inference models from version v1 to v3 through API, supporting streaming transmission and specified typ…☆44Mar 4, 2025Updated 11 months ago