Automated Lip Reading using Deep Reinforcement Learning
☆32Jun 24, 2018Updated 7 years ago
Alternatives and similar repositories for lips-reading
Users that are interested in lips-reading are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My experiments in lip reading using deep learning with the LRW dataset☆53Mar 14, 2021Updated 5 years ago
- Using an LSTM and 4d convolutional network for lip reading☆12May 11, 2018Updated 7 years ago
- ☆64Oct 8, 2018Updated 7 years ago
- LipNet with gluon☆23Nov 22, 2022Updated 3 years ago
- Automated Lip reading from real-time videos in tensorflow in python☆163Mar 20, 2018Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code and models for evaluating a state-of-the-art lip reading network☆196Mar 24, 2023Updated 3 years ago
- Pytorch code for End-to-End Audiovisual Speech Recognition☆184Nov 18, 2022Updated 3 years ago
- CNN for visual speech recognition☆23Dec 5, 2016Updated 9 years ago
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆94Jul 23, 2025Updated 8 months ago
- Local File Inclusion (LFI) in FHEM 6.0 allows an attacker to include a file, it can lead to sensitive information disclosure.☆12Jan 20, 2021Updated 5 years ago
- The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild☆26Nov 23, 2018Updated 7 years ago
- ☆11Feb 27, 2025Updated last year
- ☆12May 11, 2024Updated last year
- ☆12Aug 14, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Facial-Expression Recognition with Deep Neural Networks☆10Mar 6, 2016Updated 10 years ago
- An augmented reality menu experience☆10Oct 8, 2017Updated 8 years ago
- A collection of papers I am interested in.☆29Apr 3, 2023Updated 3 years ago
- The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxi…☆235Sep 21, 2022Updated 3 years ago
- Deep Steganalysis training script☆13May 1, 2025Updated 11 months ago
- Octave port of the Fast Image Source Model by Eric A. Lehmann. Used for room acoustic modeling and impulse response simulation.☆12Aug 2, 2017Updated 8 years ago
- Sakhi, a mobile-first app tailored for women, encompasses daily journals, safety features, community, and holistic health tools. Elevate …☆12Mar 7, 2024Updated 2 years ago
- There are many studies done to detect anomalies based on logs. Current approaches are mainly divided into three categories: supervised le…☆11Jan 10, 2022Updated 4 years ago
- Script to simulate room impulse responses☆15Sep 29, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Question Generation Application leveraging RAG and Weaviate vector store to be able to retrieve relative contexts and generate a more u…☆17Feb 3, 2025Updated last year
- ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASS…☆433May 18, 2023Updated 2 years ago
- Chinese words classification using lipnet with pytorch☆40Nov 18, 2019Updated 6 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Deep Visual Speech Recognition in arabic words☆16Oct 18, 2023Updated 2 years ago
- A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.☆243Feb 15, 2024Updated 2 years ago
- Lip Reading - Cross Audio-Visual Recognition using 3D Architectures☆1,905Nov 7, 2022Updated 3 years ago
- A Unity Tutorial for 2D games 🎮! Acts as a template for a 2D platformer☆13May 19, 2023Updated 2 years ago
- Voice Activity Detection: In this first assignment, we will create a dataset that simulates speech in every-day scenarios. We train a cla…☆18May 3, 2015Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Visual Speech Recognition for Multiple Languages☆465Aug 17, 2023Updated 2 years ago
- 2019年“创青春·交子杯”新网银行高校金融科技挑战赛初赛、决赛思路代码分享☆28Dec 11, 2019Updated 6 years ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- Gait recognition system based on YOLOv8☆16Jan 26, 2024Updated 2 years ago
- A tensorflow implementation of the R3DCNN network by Molchanov (2016).☆10Sep 13, 2017Updated 8 years ago
- create timer videos at any speed.☆15Sep 25, 2023Updated 2 years ago
- ☆12Sep 19, 2021Updated 4 years ago