A lightweight tool that efficiently isolates target speaker data from your datasets.
☆20Nov 23, 2024Updated last year
Alternatives and similar repositories for SpeakerClassifier
Users that are interested in SpeakerClassifier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Speaker embedding for anime speech domain based on ECAPA_TDNN☆20Jun 22, 2025Updated 11 months ago
- 这个项目是数据预处理。第一步是对获取到的音频做处理,结合Funasr的时间戳去掉空背景音。也包含了喂给BERT前的label☆15May 27, 2025Updated last year
- Just a template for quickly creating a python library.☆10Jun 5, 2026Updated last week
- ☆28May 1, 2026Updated last month
- The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization"☆10Dec 24, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Simple GUI for Amphion Vevo☆14May 4, 2025Updated last year
- ☆157Feb 6, 2025Updated last year
- 将视频分割,提取分镜☆92Dec 4, 2025Updated 6 months ago
- GAG is a GUI for GPT-SoVITS inference. Just add it to the official integration package and run for a smoother experience.☆237Jun 24, 2025Updated 11 months ago
- chinese voice converted from Warcraft III: Reforged☆12Jun 28, 2023Updated 2 years ago
- 基于达摩院视频切割技术的视频转换为短音频的vits数据集生成工具 A VITS Dataset Generation Tool for Converting Video to Short Audio Based on Damo Academy Video Cutting T…☆55Jan 17, 2024Updated 2 years ago
- Bridge between Comfyui and Houdini☆31Jul 4, 2025Updated 11 months ago
- Extract SD (Automatic1111, ComfyUI) metadata from generated files in bulk, search and browse your prompts☆19Updated this week
- Head Orientation Node for ComfyUI: Analyze and sort images based on facial orientation using MediaPipe. This custom node detects facial l…☆11May 17, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ECAI 2025☆20May 4, 2026Updated last month
- 用纯白誓约,守护彼此一生(V3版本之后会用ts生态重构,请移步BaiShou-Next)☆118Jun 4, 2026Updated last week
- 数据集自动化制作脚本☆71Mar 26, 2023Updated 3 years ago
- The whole image inference based on the onnx model of AnimeGANv3☆27Jan 7, 2026Updated 5 months ago
- 绝区零 一条龙 | 全自动 | 自动闪避 | 自动每日 | 自动空洞 | 支持手柄(1.4游戏更新请耐心等待适配)☆16Updated this week
- Docker Images with Desktop Environment and Support Remote Desktop Connecting☆28May 21, 2025Updated last year
- MSST-GUI is a Qt5-based inference GUI, designed to provide a convenient and intuitive way to inference (mainly for my own use)☆398Sep 23, 2025Updated 8 months ago
- 希望用代码为 waifus 绘心。☆96May 28, 2026Updated 2 weeks ago
- 此仓库是我在学习MySQL中写下的笔记,我更倾向于初学者,所以我用通俗易懂的语句描述了MySQL的使用☆28Dec 28, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- RVCで音声学習をするための便利スクリプト集☆26Apr 8, 2023Updated 3 years ago
- Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …☆10Jul 18, 2023Updated 2 years ago
- ☆12Jan 6, 2025Updated last year
- Details about the wide minima density hypothesis and code to compute width of a minima☆10Nov 30, 2024Updated last year
- Building a quick conversation-based search demo with langchain.☆10Apr 2, 2024Updated 2 years ago
- Nyakku 的个人博客~☆27Updated this week
- Extend bert-nmt to context-aware translation.☆11May 24, 2021Updated 5 years ago
- ComfyUI custom nodes to create a speech dataset☆23Jun 17, 2025Updated 11 months ago
- machine translation data process tools☆10Apr 29, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- GSV-TTS-Lite A high-performance inference engine specifically designed for the GPT-SoVITS text-to-speech model.(few shot voice cloning)☆109May 29, 2026Updated last week
- audio/speech feature extraction using parselmouth, librosa, disvoice☆10Jan 28, 2022Updated 4 years ago
- wav2svp: Waveform & pitchs to Synthesizer V Project☆17Jan 9, 2025Updated last year
- A cli tool for split vocal timbre.☆290Jan 17, 2026Updated 4 months ago
- 使用pyqt做的一些自定义组件库,包括类似微信的气泡消息、导航栏等☆30Nov 12, 2024Updated last year
- A WebUI app for Music-Source-Separation-Training and we packed UVR together!☆1,161May 27, 2026Updated 2 weeks ago
- audiolm-pytorch training code☆15Jul 31, 2023Updated 2 years ago