jarfo / gcommands
Speech Commands Recognition using end-to-end deep learning models in pytorch
☆27Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for gcommands
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆26Updated 5 years ago
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆23Updated 2 years ago
- ☆16Updated 5 years ago
- PyTorch implementation of PLDA as described in https://ravisoji.com/assets/papers/ioffe2006probabilistic.pdf☆15Updated 4 years ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆43Updated 4 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆64Updated 5 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Updated 5 years ago
- University of Edinbrugh-Johns Hopkins University's system for ASVspoof 2017 Version 2.0 dataset.☆49Updated 5 years ago
- This is the implementation of the paper "Adversarial Attacks on Spoofing Countermeasures of automatic speaker verification".☆43Updated last year
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- Download and create a tfreader for the audioset dataset☆16Updated 4 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆57Updated 4 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago
- Sound event detection with depthwise separable and dilated convolutions.☆53Updated 4 years ago
- ICASSP2019 Tutorial: Detection and Classification of Acoustic Scenes and Events / Code examples☆41Updated 5 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆35Updated 4 years ago
- Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augment…☆38Updated last year
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)☆98Updated 4 years ago
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Updated 4 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Updated 3 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆37Updated 6 years ago
- Various algorithms for voice activity detection☆22Updated 7 years ago
- Code for DCASE 2020 task 1a and task 1b.☆85Updated 2 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆63Updated 5 years ago
- ☆19Updated 5 years ago
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- ☆98Updated 6 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- 💬 A list of End-to-End speech recognition, including papers, codes and other materials☆53Updated 5 years ago