YashNita / sound-event-detection-winning-method
☆23Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for sound-event-detection-winning-method
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- Sound event detection with depthwise separable and dilated convolutions.☆53Updated 4 years ago
- Spectra extraction tutorials based on torch and torchaudio.☆40Updated last year
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆29Updated last year
- Language modelling for sound event detection☆21Updated 4 years ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆41Updated last year
- Weakly Supervised CRNN System for Sound Event Detection With Large-scale Unlabeled In-domain Data☆9Updated 6 years ago
- A repository holding my personal implementations of audio feature extraction for environmental and musical auditory analysis and classifi…☆10Updated 4 years ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆53Updated last year
- ☆41Updated 2 months ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆39Updated last year
- Python library for audio augmentation☆83Updated last year
- Audio data augmentation examples☆35Updated 6 years ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆30Updated 6 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆29Updated 5 months ago
- Baseline systems for the FSD50K dataset☆67Updated 3 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated last year
- A database of clean and noisy speech for audio research☆10Updated 6 years ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆28Updated 3 years ago
- 1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging☆30Updated 3 years ago
- MobileNetV2-based baseline system for DCASE2021 Challenge Task 2.☆21Updated 3 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 2 years ago
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 5 years ago
- Constrained Permutation Invariant Training, Speech Separation☆43Updated 3 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Updated last year
- Source code of the DCASE 2020 SELD submission "Audio Event Detection and Localization with Multitask Regression Network"☆16Updated 4 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- Code for https://arxiv.org/abs/1712.00254☆16Updated 6 years ago