falloutdurham / specaugmentLinks
PyTorch Implementation of Time/Frequency Masks
☆12Updated 6 years ago
Alternatives and similar repositories for specaugment
Users that are interested in specaugment are comparing it to the libraries listed below
Sorting:
- A fully convolution-network for speech-to-text, built on pytorch.☆126Updated 5 years ago
- Kaggle | 1st place solution for Freesound Audio Tagging 2019☆314Updated 3 years ago
- Team NUDT code for DCASE2018Task2.☆80Updated 6 years ago
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"☆98Updated 6 years ago
- Freesound Audio Tagging 2019☆95Updated 6 years ago
- Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling☆169Updated 3 years ago
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆388Updated 4 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 6 years ago
- Deep multi-metric learning for text-independent speaker verification☆24Updated 5 years ago
- ☆226Updated 5 years ago
- 8th place solution (on Kaggle) to the Freesound General-Purpose Audio Tagging Challenge (DCASE 2018 - Task 2)☆115Updated 4 years ago
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆208Updated 2 years ago
- Speaker recognition ,Voiceprint recognition☆53Updated 5 years ago
- ☆99Updated 7 years ago
- DCASE 2018 Baseline systems☆129Updated 6 years ago
- Speech commands recognition with PyTorch | Kaggle 10th place solution in TensorFlow Speech Recognition Challenge☆199Updated last year
- Single and multichannel sound event detection using convolutional recurrent neural networks. DCASE 2017 real-life sound event detection w…☆192Updated 3 years ago
- Pytorch and TensorFlow data loaders for several audio datasets☆113Updated 5 years ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆128Updated 4 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆72Updated 6 years ago
- Speech Recognition model based off of FAIR research paper built using Pytorch.☆84Updated 6 years ago
- Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks☆105Updated 7 years ago
- ASR with PyTorch☆139Updated 6 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated last year
- this is a treasure-house of speech☆165Updated 7 years ago
- Time delay neural network (TDNN) implementation in Pytorch using unfold method☆203Updated 5 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Updated 2 years ago
- Benchmark for sound event localization task of DCASE 2019 challenge☆77Updated 4 years ago
- Augmented Audio Data Generator for 1D-Convolutional Neural Networks☆48Updated 4 years ago
- ☆15Updated 6 years ago