将视频中不同说话人的声音提取后区分保存,得到音频训练数据
☆31May 23, 2024Updated last year
Alternatives and similar repositories for speaker-diarization
Users that are interested in speaker-diarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- This tools permits to decompress the packed data inside of the game executable☆12Apr 14, 2022Updated 4 years ago
- ☆12Aug 15, 2022Updated 3 years ago
- 这是一个批量推理工具,对同一段文字进行多次推理,并且支持随机参数,直到筛选出最满意的结果。☆11Aug 19, 2024Updated last year
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Using vanna framework and custom api. Vanna框架和自定义API的完整调用☆20Jul 17, 2024Updated last year
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- A lightweight tool that efficiently isolates target speaker data from your datasets.☆20Nov 23, 2024Updated last year
- ☆20Nov 22, 2025Updated 5 months ago
- Viewer and editor for files of NDS games☆11Mar 13, 2023Updated 3 years ago
- Anime4k v0.9 effect shader implementation for obs to improve the quality of the preview and stream☆13Mar 19, 2024Updated 2 years ago
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- Image Converter Ultra☆16Jan 29, 2026Updated 3 months ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆190Sep 1, 2025Updated 8 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The power-law compressed phase-aware asymmetric (PLCPA-ASYM) loss☆14Sep 4, 2023Updated 2 years ago
- This is a project of Interspeech2021 paper "SpecMix : A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Fea…☆11Sep 27, 2022Updated 3 years ago
- Optimizing Source and Sensor Placement for Sound Field Control☆16Mar 27, 2023Updated 3 years ago
- An unofficial non-causal Tensorflow implementation of "Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Spee…☆14Dec 27, 2022Updated 3 years ago
- ☆15Sep 16, 2024Updated last year
- Monster Hunter Frontier ZZ Custom Launcher without IE☆14Jun 27, 2024Updated last year
- A code repository for the accepted paper entitled "Fast Generation of Sound Zones Using Variable Span Trade-Off Filters in the DFT-Domain…☆18Feb 17, 2025Updated last year
- Tools for converting Nintendo DS binaries to an ELF file for Ghidra/IDA☆26Jan 28, 2022Updated 4 years ago
- 使用命令行界面(CLI)或 Python 包进行简单易用的人声分离,采用各种出色的模型(主要由 @Anjok07 作为 UVR 项目的一部分训练)☆25Mar 1, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- c++ port of truetype-tracer (font to G-code/DXF converter)☆26Nov 14, 2022Updated 3 years ago
- Custom quest editor for the game Monster Hunter Frontier Z☆12Oct 22, 2024Updated last year
- ☆12May 22, 2023Updated 2 years ago
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆70Jul 8, 2024Updated last year
- Multizone Soundfield Reproduction☆15Mar 23, 2018Updated 8 years ago
- Streaming Audiotransformers for online Audio tagging☆54Jun 14, 2024Updated last year
- Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"☆36Jan 28, 2026Updated 3 months ago
- 基于GptSoVits项目的参考音频筛选工具☆25Aug 17, 2025Updated 8 months ago
- FEETECH BUS Servo Python library☆32Jul 3, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀☆15Jul 12, 2021Updated 4 years ago
- ☆14Oct 12, 2023Updated 2 years ago
- Game Boy Advance tools for image and video conversion☆27Feb 8, 2026Updated 2 months ago
- ☆17Sep 12, 2023Updated 2 years ago
- ☆20Updated this week
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆27Nov 20, 2024Updated last year
- Non-Uniform FFT on the CPU and GPU (1D, 2D and 3D)☆14Jan 13, 2021Updated 5 years ago