这个项目是数据预处理。第一步是对获取到的音频做处理,结合Funasr的时间戳去掉空背景音。也包含了喂给BERT前的label
☆16May 27, 2025Updated 9 months ago
Alternatives and similar repositories for auto_labeling_for_BERT_VITS2
Users that are interested in auto_labeling_for_BERT_VITS2 are comparing it to the libraries listed below
Sorting:
- A lightweight tool that efficiently isolates target speaker data from your datasets.☆19Nov 23, 2024Updated last year
- NovelAi Image Studio☆16Feb 20, 2026Updated 2 weeks ago
- 数据集自动化制作脚本☆72Mar 26, 2023Updated 2 years ago
- Cochlear implant signal processing☆10Jun 24, 2021Updated 4 years ago
- 不会聊天的字幕提取器不是一个好 B 站下载器~☆92Updated this week
- Mod for 0 A.D. that plays out a "grand-strategy" style campaign.☆14Jun 18, 2021Updated 4 years ago
- colorizing images☆10Sep 16, 2022Updated 3 years ago
- ComfyUI sampler for HyperSDXL UNet☆11Jun 20, 2024Updated last year
- 한국 라노베 분석기☆10Jul 6, 2022Updated 3 years ago
- ☆11Feb 22, 2024Updated 2 years ago
- Supplementary materials for "Evaluating generalised additive mixed modelling strategies for dynamic speech analysis"☆10Jan 25, 2021Updated 5 years ago
- node.js based CMS , built on top of grapejs framework☆12Sep 8, 2016Updated 9 years ago
- Werewolf Party Game with AI Bots☆11Mar 3, 2026Updated last week
- ☆12Sep 27, 2024Updated last year
- Script for Ren'Py projects to be able to interact with Discord Rich Presence.☆21Oct 9, 2022Updated 3 years ago
- I wanted guided tutorials on digital signal processing so I decided to create them. The result is this ebook: "Digital Signal Processing …☆12Feb 5, 2024Updated 2 years ago
- This is a repository for a deep learning model that can detect text bubbles in manga and feed them into a translator.☆15Mar 19, 2023Updated 2 years ago
- This a is a simple fortune teller like app which tells what 2 people are, it does this based on the letters in both names. The given answ…☆12Mar 15, 2021Updated 4 years ago
- audio/speech feature extraction using parselmouth, librosa, disvoice☆10Jan 28, 2022Updated 4 years ago
- remove bg☆13Feb 7, 2025Updated last year
- 챗봇 프론트엔드 예제☆19Feb 8, 2026Updated last month
- ☆13Nov 17, 2025Updated 3 months ago
- Viewer and editor for files of NDS games☆11Mar 13, 2023Updated 2 years ago
- a compact audio-to-phoneme aligner for singing voice☆12Jan 17, 2024Updated 2 years ago
- 音频响度统一,音量归一化处理☆12May 3, 2024Updated last year
- ☆16Aug 31, 2024Updated last year
- Shader collection for Ren'Py engine☆13Mar 15, 2022Updated 3 years ago
- An original package of the dynamic compressive gammachirp filterbank (dcGC-FB)☆14Oct 27, 2024Updated last year
- 🍭Quickly ask questions using chatgpt in the terminal.☆16Aug 1, 2024Updated last year
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆15Nov 11, 2022Updated 3 years ago
- ☆18Mar 4, 2022Updated 4 years ago
- 遮挡人脸识别,在insightface基础上增加了一个识别遮挡概率的分类模型,包含在项目中。☆15May 30, 2023Updated 2 years ago
- Dynamic DNS updater for AWS-Route53 (Made with Python)☆11Jan 8, 2026Updated 2 months ago
- ☆12Apr 1, 2024Updated last year
- ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"☆14Nov 29, 2024Updated last year
- Classification of audio signals using PyTorch☆13May 19, 2020Updated 5 years ago
- Praat-based tools for EGG analysis☆18Sep 21, 2023Updated 2 years ago
- Uchinoko Studio is a web application designed to facilitate real-time voice conversations with AI.☆16Dec 20, 2025Updated 2 months ago
- Just to provide some simple necessary capabilities for the workflow I made, written using GPT☆16May 22, 2024Updated last year