gorinars / dcase16-cnn
Sound event detection in real life audio with CNN submitted to DCASE16
☆22Updated 2 years ago
Alternatives and similar repositories for dcase16-cnn:
Users that are interested in dcase16-cnn are comparing it to the libraries listed below
- DCASE 2016 Baseline system, python implementation☆51Updated 7 years ago
- ☆27Updated 6 years ago
- DCASE2016 TASK1 Scene Classification☆12Updated 7 years ago
- Bag-of-Features Acoustic Event Detection☆14Updated 8 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- Convolutional neural networks for sound classification☆20Updated 7 years ago
- Task 4 Large-scale weakly supervised sound event detection for smart cars☆65Updated 3 years ago
- DCASE 2017 Baseline system☆82Updated 4 years ago
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 5 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆123Updated 7 years ago
- ☆13Updated 8 years ago
- Multiple Instance Learning for Sound Event Detection☆34Updated 6 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Updated 6 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆38Updated 7 years ago
- ☆71Updated 7 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- DCASE 2018 Baseline systems☆129Updated 5 years ago
- ☆7Updated 7 years ago
- Here, an algorithm to classify environmental sounds with the aim of providing contextual information to devices such as hearing aids for …☆21Updated 10 years ago
- FFTNet vocoder implementation☆81Updated 6 years ago
- audio processing module for pytorch:stft, istft☆36Updated 5 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Updated 6 years ago
- ☆20Updated 5 years ago
- A set of speech feature extraction functions for ASR and speaker identification written in matlab.☆43Updated 8 years ago
- ☆58Updated 6 years ago
- Surrey CVSSP DCASE 2018 Task 2 system☆19Updated 2 years ago
- A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆92Updated 6 years ago
- Baseline of dcase 2019 task 4☆58Updated 2 years ago
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Updated 6 years ago