jim-schwoebel / sound_event_detection
🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
☆41Updated 3 years ago
Alternatives and similar repositories for sound_event_detection
Users that are interested in sound_event_detection are comparing it to the libraries listed below
Sorting:
- Sound event detection with depthwise separable and dilated convolutions.☆53Updated 5 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- Streaming Audiotransformers for online Audio tagging☆44Updated 11 months ago
- ☆13Updated last year
- ☆50Updated last year
- a python library for speech enhancement☆79Updated 10 months ago
- ☆63Updated 8 months ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆41Updated 3 years ago
- Easy to use Audio Tagging in PyTorch☆22Updated 3 years ago
- Utility tools for the "Divide and Remaster" dataset, introduced as part of the Cocktail Fork problem paper: https://arxiv.org/abs/2110.09…☆72Updated 2 years ago
- ☆16Updated 4 years ago
- The MOS system combines components from DNSMOS, NISQA, MOSSSL, and SIGMOS, using the librosa library to process audio waveforms.☆23Updated last year
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆139Updated 2 years ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated 2 years ago
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆25Updated 4 years ago
- Paderborn Sound Event Detection☆74Updated last year
- ☆88Updated last year
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Updated 8 months ago
- ☆29Updated 4 years ago
- A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]☆25Updated 2 years ago
- A fast implementation of bss_eval metrics for blind source separation☆136Updated 2 years ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆34Updated 2 years ago
- This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.☆99Updated 2 years ago
- Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.☆29Updated last year
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆41Updated 3 years ago
- MultiSV: scripts for data preparation☆27Updated 3 months ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆42Updated 2 years ago
- ☆31Updated this week
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago