Anwarvic / VAD_Benchmark
Benchmarking different VAD models on AVA-Speech dataset
☆14Updated last year
Alternatives and similar repositories for VAD_Benchmark:
Users that are interested in VAD_Benchmark are comparing it to the libraries listed below
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆54Updated last year
- ☆10Updated 2 years ago
- ☆57Updated 4 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆13Updated 2 years ago
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆20Updated last year
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆25Updated 4 years ago
- ☆15Updated 4 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated last month
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆46Updated 2 months ago
- Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021☆39Updated 3 years ago
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆40Updated 2 years ago
- implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain☆45Updated 4 years ago
- A temporal module for PyTorch-ComplexTensor☆44Updated 8 months ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆49Updated 7 months ago
- Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch☆28Updated 3 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆48Updated 5 months ago
- ☆16Updated 4 years ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆27Updated 8 months ago
- Paderbox: A collection of utilities for audio / speech processing☆38Updated 3 weeks ago
- Constrained Permutation Invariant Training, Speech Separation☆47Updated 4 years ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆25Updated 5 months ago
- The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"☆77Updated 2 years ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆52Updated 2 years ago
- PyTorch implementation of Continuous Speech Separation☆13Updated 2 years ago
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆111Updated last year
- Conformer-based Metric GAN for speech enhancement☆26Updated 10 months ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Updated 3 years ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆35Updated 2 weeks ago