voletiv / GRIDcorpus-experiments
My experiments with lip reading using GRIDcorpus dataset
☆9Updated 7 years ago
Alternatives and similar repositories for GRIDcorpus-experiments
Users that are interested in GRIDcorpus-experiments are comparing it to the libraries listed below
Sorting:
- Audio-Visual Speech Recognition using Deep Learning☆60Updated 6 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 4 years ago
- Audio-Visual Speech Recognition using Sequence to Sequence Models☆82Updated 4 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Updated 2 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Updated 5 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- [ICASSP19] An Interaction-aware Attention Network for Speech Emotion Recognition in Spoken Dialogs☆35Updated 5 years ago
- MTGAN: Speaker Verification through Multitasking Triplet Generative Adversarial Networks☆19Updated 5 years ago
- This Repository includes four different implementations of the Speaker Verification task including the GMM_UBM, Ivector, Deep-Speaker, an…☆32Updated 6 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- University of Edinbrugh-Johns Hopkins University's system for ASVspoof 2017 Version 2.0 dataset.☆49Updated 6 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- This is a implementation of kaldi-plda.☆15Updated 6 years ago
- Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features☆26Updated 9 months ago
- ☆38Updated 3 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆45Updated 4 years ago
- Build an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.☆29Updated 5 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 6 years ago
- Language identification using Siamese network based on i-vector☆7Updated 7 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆107Updated last year
- VoxCeleb plugin for pyannote.database☆29Updated 3 years ago
- Speaker diarization with GMM-UBM and MAP Adaptation☆30Updated 6 years ago
- ☆35Updated 6 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- Classify the emotions from variable-length speech segments☆11Updated 7 years ago
- PyTorch implementation of a self-attentive speaker embedding☆17Updated 5 years ago
- ☆10Updated 5 years ago
- System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models☆21Updated 7 years ago
- ☆60Updated 4 years ago