将视频中不同说话人的声音提取后区分保存,得到音频训练数据
☆31May 23, 2024Updated last year
Alternatives and similar repositories for speaker-diarization
Users that are interested in speaker-diarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆36Dec 30, 2025Updated 4 months ago
- EaseVoice Trainer is a simple and user-friendly voice cloning and speech model trainer.☆14Apr 27, 2025Updated last year
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 4 years ago
- This tools permits to decompress the packed data inside of the game executable☆12Apr 14, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆12Aug 15, 2022Updated 3 years ago
- 这是一个批量推理工具,对同一段文字进行多次推理,并且支持随机参数,直到筛选出最满意的结果。☆11Aug 19, 2024Updated last year
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- ☆19Aug 23, 2024Updated last year
- Create Chatbot using Gemini and RAG that could read from SQL databases☆16Dec 5, 2024Updated last year
- 基于LLM的多轮问答系统。结合了意图识别和词槽填充技术☆22Jul 30, 2025Updated 9 months ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- A lightweight tool that efficiently isolates target speaker data from your datasets.☆20Nov 23, 2024Updated last year
- ☆21Nov 22, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆31Apr 29, 2022Updated 4 years ago
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆192Apr 28, 2026Updated 3 weeks ago
- This is a project of Interspeech2021 paper "SpecMix : A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Fea…☆11Sep 27, 2022Updated 3 years ago
- An unofficial non-causal Tensorflow implementation of "Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Spee…☆14Dec 27, 2022Updated 3 years ago
- ☆15Sep 16, 2024Updated last year
- A code repository for the accepted paper entitled "Fast Generation of Sound Zones Using Variable Span Trade-Off Filters in the DFT-Domain…☆18Feb 17, 2025Updated last year
- c++ port of truetype-tracer (font to G-code/DXF converter)☆26Nov 14, 2022Updated 3 years ago
- Whale 是专为 DeepSeek 模型优化的终端 AI 编码助手,支持 MCP, Skills, 缓存优化。☆116May 13, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Cross-Layer Similarity Knowledge Distillation for Speech Enhancement☆11Jun 22, 2023Updated 2 years ago
- ☆12May 22, 2023Updated 2 years ago
- ☆17Mar 30, 2023Updated 3 years ago
- Open-Source Turn-Taking Detection Model and Dataset for Full-Duplex Spoken Dialogue Systems☆110Jan 25, 2026Updated 3 months ago
- 使用命令行界面(CLI)或 Python 包进行简单易用的人声分离,采用各种出色的模型(主要由 @Anjok07 作为 UVR 项目的一部分训练)☆29Mar 1, 2026Updated 2 months ago
- SOFiA - Sound Field Analysis Toolbox for Matlab