falloutdurham / specaugment
PyTorch Implementation of Time/Frequency Masks
☆12Updated 5 years ago
Alternatives and similar repositories for specaugment:
Users that are interested in specaugment are comparing it to the libraries listed below
- ☆38Updated 4 years ago
- ☆43Updated 7 months ago
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"☆97Updated 5 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 5 years ago
- Speaker recognition ,Voiceprint recognition☆52Updated 5 years ago
- Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling☆165Updated 2 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆68Updated 2 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- An implementation of vggish in keras with tf backend☆119Updated 3 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 5 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- Auto Segmentation Criterion (ASG) implemented in pytorch☆51Updated 3 years ago
- A fully convolution-network for speech-to-text, built on pytorch.☆126Updated 4 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- ☆99Updated 7 years ago
- Deep multi-metric learning for text-independent speaker verification☆24Updated 5 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆126Updated 5 years ago
- LSTM CTC End2End Speech Recognition.☆38Updated 5 years ago
- Audio-Visual Speech Recognition using Deep Learning☆60Updated 6 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- PyTorch end-to-end speech recognition☆49Updated 4 years ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆128Updated 4 years ago
- DCASE 2018 Baseline systems☆129Updated 5 years ago
- Team NUDT code for DCASE2018Task2.☆79Updated 6 years ago
- Repository for the paper "Towards duration robust weakly supervised sound event detection"☆23Updated last year
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆101Updated 6 years ago
- Freesound Audio Tagging 2019☆95Updated 5 years ago
- Speech Recognition model based off of FAIR research paper built using Pytorch.☆83Updated 6 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆107Updated last year