jim-schwoebel / sound_event_detection
π΅ A repository for manually annotating files to create labeled acoustic datasets for machine learning.
β41Updated 3 years ago
Alternatives and similar repositories for sound_event_detection:
Users that are interested in sound_event_detection are comparing it to the libraries listed below
- Sound event detection with depthwise separable and dilated convolutions.β53Updated 4 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).β132Updated 2 years ago
- Easy to use Audio Tagging in PyTorchβ20Updated 3 years ago
- β13Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixITβ53Updated last year
- β62Updated 6 months ago
- β30Updated last year
- The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"β77Updated 2 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterancesβ49Updated 2 years ago
- Download and create a tfreader for the audioset datasetβ16Updated 4 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, aβ¦β39Updated 3 years ago
- β50Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdfβ65Updated 3 years ago
- Python library for audio augmentationβ83Updated last year
- Clustering-based methods for overlapping diarizationβ78Updated last year
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" presentβ¦β25Updated 2 years ago
- β25Updated 3 years ago
- Paderborn Sound Event Detectionβ73Updated last year
- Learning differentiable temporal resolution on time-series data.β36Updated 2 years ago
- SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognitionβ116Updated 9 months ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Lβ¦β54Updated last year
- Constrained Permutation Invariant Training, Speech Separationβ47Updated 4 years ago
- Streaming Audiotransformers for online Audio taggingβ43Updated 9 months ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detectionβ28Updated 2 years ago
- β85Updated last year
- This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.β97Updated 2 years ago
- Python toolkit for speech processingβ68Updated last week
- Single channel speech source separation by diffusion process (ICASSP 2023)β100Updated last year
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformerβ34Updated 2 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.β40Updated 2 years ago