mjpyeon / wavenet-classifier
Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
☆64Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for wavenet-classifier
- Tensorflow - Very Deep Convolutional Neural Networks For Raw Waveforms - https://arxiv.org/pdf/1610.00087.pdf☆73Updated 3 years ago
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"☆95Updated 5 years ago
- 1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging☆30Updated 3 years ago
- Utils and data sets for audio and PyTorch☆83Updated 2 years ago
- Audio data augmentation examples☆35Updated 6 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆72Updated 3 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆37Updated 6 years ago
- DCASE 2018 Baseline systems☆129Updated 5 years ago
- Multiple Instance Learning for Sound Event Detection☆34Updated 6 years ago
- ☆58Updated 6 years ago
- pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf☆43Updated 6 years ago
- Convolutional neural networks for sound classification☆20Updated 6 years ago
- This repository contains the code to reproduce the core results from the paper "Scalable Factorized Hierarchical Variational Autoencoders…☆52Updated 6 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- Pytorch implementation of time-domain filterbanks☆110Updated 3 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Updated 6 years ago
- Implementation of WaveNet with Gluon☆16Updated 5 years ago
- DCASE2019 Challenge Task 1 baseline system☆20Updated 5 years ago
- PyTorch implementation of a Time Delay Neural Network (TDNN)☆40Updated 5 years ago
- Training General-Purpose Audio Tagging Networks with Noisy Labels and Iterative Self-Verification☆29Updated 5 years ago
- Unsupervised segmentation and clustering of Buckeye English and NCHLT Xitsonga corpora.☆9Updated 7 years ago
- Pytorch implementation of "Sample-level Deep Convolutional Neural Networks for Music Auto-tagging Using Raw Waveforms"☆57Updated last year
- Framewise phoneme classification on the TIMIT dataset using neural networks☆19Updated 8 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- ☆53Updated 6 years ago
- ☆34Updated 5 years ago
- 6th place solution to Freesound Audio Tagging 2019 kaggle competition☆25Updated 4 years ago
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- ☆27Updated 5 years ago
- This repository contains the code to reproduce the core results from the paper "Learning Latent Representations for Speech Generation and…☆52Updated 6 years ago