falloutdurham / specaugment
PyTorch Implementation of Time/Frequency Masks
☆12Updated 5 years ago
Alternatives and similar repositories for specaugment:
Users that are interested in specaugment are comparing it to the libraries listed below
- Speaker recognition ,Voiceprint recognition☆52Updated 5 years ago
- Deep multi-metric learning for text-independent speaker verification☆24Updated 5 years ago
- SpeechYOLO Interspeech 2019☆42Updated 2 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"☆97Updated 5 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling☆165Updated 2 years ago
- ☆98Updated 7 years ago
- Pytorch and TensorFlow data loaders for several audio datasets☆112Updated 5 years ago
- ☆58Updated 6 years ago
- A fully convolution-network for speech-to-text, built on pytorch.☆126Updated 4 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆126Updated 5 years ago
- Stochastic Adaptive Neural Architecture Search☆65Updated 6 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 5 years ago
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆31Updated 5 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 5 years ago
- Baseline method for sound event localization task of DCASE 2020 challenge☆53Updated 4 years ago
- Time Delayed NN implemented in pytorch☆80Updated 7 years ago
- An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset☆12Updated 5 years ago
- LSTM CTC End2End Speech Recognition.☆38Updated 5 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆29Updated last year
- Benchmark for sound event localization task of DCASE 2019 challenge☆76Updated 4 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆71Updated 5 years ago
- Single and multichannel sound event detection using convolutional recurrent neural networks. DCASE 2017 real-life sound event detection w…☆187Updated 2 years ago
- Audio-Visual Speech Recognition using Deep Learning☆60Updated 6 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- Paper: https://arxiv.org/abs/1702.02285☆63Updated 6 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago