A lightweight tool that efficiently isolates target speaker data from your datasets.
☆19Nov 23, 2024Updated last year
Alternatives and similar repositories for SpeakerClassifier
Users that are interested in SpeakerClassifier are comparing it to the libraries listed below
Sorting:
- 这个项目是数据预处理。第一步是对获取到的音频做处理,结合Funasr的时间戳去掉空背景音。也包含了喂给BERT前的label☆16May 27, 2025Updated 9 months ago
- ☆24Jan 17, 2026Updated last month
- real time face swap and one-click video deepfake with only a single image☆14Sep 10, 2024Updated last year
- Details about the wide minima density hypothesis and code to compute width of a minima☆10Nov 30, 2024Updated last year
- GAN Step By Step -- GSBS,顾名思义,我希望我自己能够一步一步的学习GAN。GAN 又名 生成对抗网络,是最近几年很热门的一种无监督算法,他能生成出非常逼真的照片,图像甚至视频。GAN是一个图像的全新的领域,从2014的GAN的发展现在,在计算机视觉中…☆11Jan 11, 2023Updated 3 years ago
- 绝区零 一条龙 | 全自动 | 自动闪避 | 自动每日 | 自动空洞 | 支持手柄(1.4游戏更新请耐心等待适配)☆16Updated this week
- Cochlear implant signal processing☆10Jun 24, 2021Updated 4 years ago
- ☆156Feb 6, 2025Updated last year
- 填坑前人留下来的yds_charger项目☆10Jun 17, 2025Updated 8 months ago
- 欢乐书客小说下载 破解章节内容加密☆12Aug 12, 2019Updated 6 years ago
- This is a project based on machine learning and deep learning method for playing Gobang by controlling mechanical arm(利用机械臂下五子棋)☆12Apr 16, 2023Updated 2 years ago
- A Python implementation of Delaunay triangulation☆11Aug 5, 2021Updated 4 years ago
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆26Feb 12, 2026Updated 2 weeks ago
- 把微信小程序异步api转化为Promise,方便异步编程☆10Aug 14, 2018Updated 7 years ago
- ☆19Jul 21, 2025Updated 7 months ago
- Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …☆10Jul 18, 2023Updated 2 years ago
- Simple GUI for Amphion Vevo☆14May 4, 2025Updated 9 months ago
- machine translation data process tools☆10Apr 29, 2024Updated last year
- Extend bert-nmt to context-aware translation.☆11May 24, 2021Updated 4 years ago
- Supplementary materials for "Evaluating generalised additive mixed modelling strategies for dynamic speech analysis"☆10Jan 25, 2021Updated 5 years ago
- Right click images in file explorer to search on trace.moe☆11Aug 23, 2025Updated 6 months ago
- GAG is a GUI for GPT-SoVITS inference. Just add it to the official integration package and run for a smoother experience.☆227Jun 24, 2025Updated 8 months ago
- We propose MMAD, a novel automated pipeline for precise AD generation. MMAD introduces ambient music alongside visual and linguistic, enh…☆16Dec 31, 2024Updated last year
- Easy-to-use and Fast NLP library with awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications.☆12Mar 13, 2024Updated last year
- I wanted guided tutorials on digital signal processing so I decided to create them. The result is this ebook: "Digital Signal Processing …☆12Feb 5, 2024Updated 2 years ago
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago
- ☆13Jun 7, 2021Updated 4 years ago
- PaddleSeq☆10Mar 28, 2023Updated 2 years ago
- 王者荣耀的英雄语音数据集 | 王者荣耀数据集 | 王者荣耀语音数据集 | 语音数据集 | GPT-sovits 数据集☆11Sep 9, 2024Updated last year
- audio/speech feature extraction using parselmouth, librosa, disvoice☆10Jan 28, 2022Updated 4 years ago
- 《Menhera酱降临我身边》开源仓库☆14Oct 20, 2024Updated last year
- Create a UIView hierarchy from XML☆12Apr 1, 2016Updated 9 years ago
- Just a template for quickly creating a python library.☆10Feb 8, 2026Updated 3 weeks ago
- 大模型学习资料☆39Oct 11, 2025Updated 4 months ago
- ☆12Aug 3, 2024Updated last year
- Head Orientation Node for ComfyUI: Analyze and sort images based on facial orientation using MediaPipe. This custom node detects facial l…☆12Oct 30, 2025Updated 4 months ago
- [ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆15Oct 27, 2024Updated last year
- 低轨道离子炮——一个开源的网络压力测试工具,使用C#编写。基于Praetox的LOIC项目。使用此工具造成的任何潜在风险自行承担。☆11Jun 1, 2022Updated 3 years ago
- Speaker embedding for anime speech domain based on ECAPA_TDNN☆17Jun 22, 2025Updated 8 months ago