aldld / lip-reading
Models for performing visual speech recognition, i.e. lip reading from video.
☆8Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for lip-reading
- CNN for visual speech recognition☆23Updated 7 years ago
- ☆65Updated 6 years ago
- demo code for lip reading☆21Updated 7 years ago
- Audio-Visual Speech Recognition using Deep Learning☆59Updated 5 years ago
- Adversarial Auto-encoders for Speech Based Emotion Recogntion☆14Updated 6 years ago
- End to End Multiview Lip Reading☆10Updated 6 years ago
- Audio Visual Speech Recognition☆22Updated 7 years ago
- Lip Reading in the Wild using ResNet and LSTMs in PyTorch☆58Updated 6 years ago
- Audio-Visual Speech Recognition using Sequence to Sequence Models☆81Updated 4 years ago
- Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab☆45Updated 7 years ago
- Detecting emotions using MFCC features of human speech using Deep Learning☆125Updated 3 years ago
- Supporting code for "Emotion Recognition in Speech using Cross-Modal Transfer in the Wild"☆101Updated 5 years ago
- A program for automatic speaker identification using deep learning techniques.☆84Updated 7 years ago
- Python toolkit for Visual Speech Recognition☆37Updated 4 years ago
- Torch code for using Residual Networks with LSTMs for Lipreading☆98Updated 6 years ago
- Pytorch code for End-to-End Audiovisual Speech Recognition☆174Updated last year
- Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).☆130Updated 3 years ago
- Speech Recognition without audio input☆136Updated 5 years ago
- Automated Lip Reading using Deep Reinforcement Learning☆29Updated 6 years ago
- ☆10Updated 4 years ago
- A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schiz…☆52Updated 6 years ago
- Audio Classifier in Keras using Convolutional Neural Network☆160Updated 5 years ago
- An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three ne…☆28Updated 5 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆66Updated last year
- CTC for emotion recognition☆60Updated 7 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆44Updated 4 years ago
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).☆31Updated 5 years ago
- A machine learning application for emotion recognition from speech☆132Updated 6 years ago
- It uses GMM to train a speaker identification model. The training and testing has been done on subset (34 speakers) from VoxForge data co…☆55Updated 5 years ago