Automated Lip Reading using Deep Reinforcement Learning
☆32Jun 24, 2018Updated 7 years ago
Alternatives and similar repositories for lips-reading
Users that are interested in lips-reading are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My experiments in lip reading using deep learning with the LRW dataset☆53Mar 14, 2021Updated 5 years ago
- End-to-end pipeline for lip reading at the word level using a tensorflow CNN implementation.☆35Feb 15, 2020Updated 6 years ago
- ☆65Oct 8, 2018Updated 7 years ago
- Automated Lip reading from real-time videos in tensorflow in python☆164Mar 20, 2018Updated 8 years ago
- Audio-Visual Speech Recognition using Deep Learning☆61Nov 14, 2018Updated 7 years ago
- Speech Recognition without audio input☆144Jan 14, 2019Updated 7 years ago
- Lip Reading in the Wild using ResNet and LSTMs in PyTorch☆58Apr 23, 2018Updated 7 years ago
- demo code for lip reading☆21Dec 9, 2016Updated 9 years ago
- Code and models for evaluating a state-of-the-art lip reading network☆196Mar 24, 2023Updated 3 years ago
- Keras implementation of 'LipNet: End-to-End Sentence-level Lipreading'☆688Nov 22, 2022Updated 3 years ago
- "LipNet: End-to-End Sentence-level Lipreading" in PyTorch☆69Sep 9, 2019Updated 6 years ago
- Pytorch code for End-to-End Audiovisual Speech Recognition☆184Nov 18, 2022Updated 3 years ago
- CNN for visual speech recognition☆23Dec 5, 2016Updated 9 years ago
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆94Jul 23, 2025Updated 8 months ago
- The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild☆26Nov 23, 2018Updated 7 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Mar 23, 2018Updated 8 years ago
- SQL Tutorials using Jupyter Notebook☆17Apr 9, 2023Updated 2 years ago
- ☆12May 11, 2024Updated last year
- Get Tunisian translation, audio and sample sentence for the most common 20.000 english word☆13Jan 20, 2024Updated 2 years ago
- Facial-Expression Recognition with Deep Neural Networks☆10Mar 6, 2016Updated 10 years ago
- An augmented reality menu experience☆10Oct 8, 2017Updated 8 years ago
- A collection of papers I am interested in.☆29Apr 3, 2023Updated 2 years ago
- The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxi…☆235Sep 21, 2022Updated 3 years ago
- Sakhi, a mobile-first app tailored for women, encompasses daily journals, safety features, community, and holistic health tools. Elevate …☆11Mar 7, 2024Updated 2 years ago
- There are many studies done to detect anomalies based on logs. Current approaches are mainly divided into three categories: supervised le…☆10Jan 10, 2022Updated 4 years ago
- Script to simulate room impulse responses☆15Sep 29, 2016Updated 9 years ago
- ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASS…☆433May 18, 2023Updated 2 years ago
- #DNN #CNN #LSTM #Classification #Sequential_data #Lip_reading☆28Jun 3, 2018Updated 7 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Deep Visual Speech Recognition in arabic words☆16Oct 18, 2023Updated 2 years ago
- A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.☆243Feb 15, 2024Updated 2 years ago
- Lip Reading - Cross Audio-Visual Recognition using 3D Architectures☆1,902Nov 7, 2022Updated 3 years ago
- This is an introduction to Retrieval-Augmented Generation (RAG) for beginners . It uses Llama 2 LLM, FAISS vector store, and LangChain as…☆17Jul 8, 2025Updated 8 months ago
- Voice Activity Detection: In this first assignment, we will create a dataset that simulates speech in every-day scenarios. We train a cla…☆18May 3, 2015Updated 10 years ago
- The code of '3D-Aware Semantic-Guided Generative Model for Human Synthesis' (ECCV 2022)☆36Jul 18, 2022Updated 3 years ago
- Visual Speech Recognition for Multiple Languages☆460Aug 17, 2023Updated 2 years ago
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆22Aug 4, 2024Updated last year
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- Gait recognition system based on YOLOv8☆15Jan 26, 2024Updated 2 years ago