jim-schwoebel / sound_event_detection
🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
☆41Updated 3 years ago
Alternatives and similar repositories for sound_event_detection:
Users that are interested in sound_event_detection are comparing it to the libraries listed below
- Sound event detection with depthwise separable and dilated convolutions.☆53Updated 4 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆129Updated 2 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- Streaming Audiotransformers for online Audio tagging☆43Updated 8 months ago
- A fast implementation of bss_eval metrics for blind source separation☆134Updated 2 years ago
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆115Updated 2 years ago
- Constrained Permutation Invariant Training, Speech Separation☆46Updated 4 years ago
- ☆13Updated last year
- A library built for easier audio self-supervised training, downstream tasks evaluation☆113Updated 5 months ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆39Updated 3 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆146Updated 2 years ago
- ☆63Updated 5 months ago
- ☆29Updated 7 months ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆54Updated last year
- SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition☆116Updated 8 months ago
- ☆81Updated last year
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆40Updated 2 years ago
- Machine learning speaker characteristics☆33Updated last week
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆155Updated 2 years ago
- The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"☆77Updated 2 years ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆38Updated 3 years ago
- Python toolkit for speech processing☆68Updated last month
- Python library for audio augmentation☆83Updated last year
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement (ICASSP 2023)☆78Updated last year
- Easy to use Audio Tagging in PyTorch☆20Updated 3 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- Phase-aware speech enchancement with Deep Complex U-Net☆104Updated last year