falloutdurham / specaugmentLinks
PyTorch Implementation of Time/Frequency Masks
☆12Updated 6 years ago
Alternatives and similar repositories for specaugment
Users that are interested in specaugment are comparing it to the libraries listed below
Sorting:
- A fully convolution-network for speech-to-text, built on pytorch.☆126Updated 5 years ago
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"☆98Updated 6 years ago
- Freesound Audio Tagging 2019☆95Updated 6 years ago
- Team NUDT code for DCASE2018Task2.☆79Updated 6 years ago
- Kaggle | 1st place solution for Freesound Audio Tagging 2019☆314Updated 3 years ago
- ☆227Updated 5 years ago
- 8th place solution (on Kaggle) to the Freesound General-Purpose Audio Tagging Challenge (DCASE 2018 - Task 2)☆115Updated 4 years ago
- Speaker recognition ,Voiceprint recognition☆53Updated 5 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 6 years ago
- Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling☆169Updated 3 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆72Updated 6 years ago
- DCASE 2018 Baseline systems☆130Updated 6 years ago
- 6th place solution to Freesound Audio Tagging 2019 kaggle competition☆25Updated 5 years ago
- Deep multi-metric learning for text-independent speaker verification☆24Updated 5 years ago
- ASR with PyTorch☆140Updated 6 years ago
- Speech Recognition model based off of FAIR research paper built using Pytorch.☆85Updated 6 years ago
- Speech commands recognition with PyTorch | Kaggle 10th place solution in TensorFlow Speech Recognition Challenge☆201Updated last year
- A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition☆237Updated 5 years ago
- A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.☆203Updated 6 years ago
- Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks☆105Updated 7 years ago
- Include some core functions and model to handle speech separation☆155Updated 4 years ago
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆208Updated 2 years ago
- Single and multichannel sound event detection using convolutional recurrent neural networks. DCASE 2017 real-life sound event detection w…☆194Updated 3 years ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆129Updated 4 years ago
- ☆99Updated 7 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆123Updated 5 years ago
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆387Updated 4 years ago
- Time delay neural network (TDNN) implementation in Pytorch using unfold method☆203Updated 5 years ago
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆499Updated 4 years ago
- ☆38Updated 5 years ago