soham97 / awesome-sound_event_detection
Reading list for research topics in Sound AI
☆165Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-sound_event_detection
- Repo associated to the DESED dataset, download and creation of data☆123Updated 3 months ago
- Domestic environment sound event detection task☆127Updated 4 months ago
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆203Updated last year
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆97Updated 3 weeks ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆229Updated 6 months ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆105Updated 2 months ago
- Toolkit for downloading and processing Google's AudioSet dataset.☆160Updated last year
- ☆78Updated last year
- Source code for Consistent ensemble distillation for audio tagging☆14Updated 3 months ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆128Updated 9 months ago
- transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.☆258Updated 3 years ago
- Easy to use Beamformers for multi-channel speech separation/enhancement☆186Updated 3 years ago
- The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation☆223Updated this week
- Baseline method for sound event localization task of DCASE 2023 challenge☆41Updated last year
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆139Updated last year
- target speaker extraction and verification for multi-talker speech☆163Updated 3 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆122Updated last year
- ☆125Updated 2 weeks ago
- Baseline method for sound event localization task of DCASE 2020 challenge☆53Updated 3 years ago
- An open source dataset for source separation☆378Updated 9 months ago
- PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio☆149Updated last year
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆344Updated 3 months ago
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: sp…☆125Updated 4 years ago
- A two-stage polyphonic sound event detection and localization method for both SED and DOA.☆105Updated last year
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆116Updated 2 years ago
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆143Updated 2 years ago
- Libri-CSS: dataset and evaluation pipeline☆132Updated last year
- This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.☆89Updated 2 years ago
- Visualization toolbox for Sound Event Detection☆115Updated 8 months ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆141Updated 2 months ago