falloutdurham / specaugmentLinks
PyTorch Implementation of Time/Frequency Masks
☆12Updated 6 years ago
Alternatives and similar repositories for specaugment
Users that are interested in specaugment are comparing it to the libraries listed below
Sorting:
- Kaggle | 1st place solution for Freesound Audio Tagging 2019☆316Updated 3 years ago
- Freesound Audio Tagging 2019☆95Updated 6 years ago
- 8th place solution (on Kaggle) to the Freesound General-Purpose Audio Tagging Challenge (DCASE 2018 - Task 2)☆115Updated 5 years ago
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆392Updated 4 years ago
- Team NUDT code for DCASE2018Task2.☆79Updated 7 years ago
- A fully convolution-network for speech-to-text, built on pytorch.☆126Updated 5 years ago
- ☆230Updated 5 years ago
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆499Updated 4 years ago
- Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling☆169Updated 3 years ago
- An implementation of vggish in keras with tf backend☆122Updated 4 years ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆129Updated 4 years ago
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"☆99Updated 6 years ago
- Single and multichannel sound event detection using convolutional recurrent neural networks. DCASE 2017 real-life sound event detection w…☆197Updated 3 years ago
- ☆15Updated 6 years ago
- Speech Recognition model based off of FAIR research paper built using Pytorch.☆87Updated 7 years ago
- DCASE 2018 Baseline systems☆130Updated 6 years ago
- Share some recent speaker recognition papers and their implementations.☆90Updated 6 years ago
- Speaker recognition ,Voiceprint recognition☆53Updated 5 years ago
- Pytorch and TensorFlow data loaders for several audio datasets☆113Updated 5 years ago
- ☆99Updated 8 years ago
- 6th place solution to Freesound Audio Tagging 2019 kaggle competition☆25Updated 5 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated 2 years ago
- Problem Agnostic Speech Encoder☆445Updated 2 years ago
- Time delay neural network (TDNN) implementation in Pytorch using unfold method☆203Updated 6 years ago
- ☆59Updated 7 years ago
- Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks☆105Updated 7 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 6 years ago
- Deep multi-metric learning for text-independent speaker verification☆24Updated 6 years ago
- Speech commands recognition with PyTorch | Kaggle 10th place solution in TensorFlow Speech Recognition Challenge☆200Updated last year
- this is a treasure-house of speech☆166Updated 7 years ago