The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild
☆26Nov 23, 2018Updated 7 years ago
Alternatives and similar repositories for D3D
Users that are interested in D3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and model for paper <Mutual Information Maximization for Effective Lip Reading>☆19Sep 4, 2020Updated 5 years ago
- Pytorch code for End-to-End Audiovisual Speech Recognition☆184Nov 18, 2022Updated 3 years ago
- The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the…☆166Sep 12, 2025Updated 7 months ago
- ☆15Dec 11, 2021Updated 4 years ago
- ☆11May 31, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- My experiments in lip reading using deep learning with the LRW dataset☆54Mar 14, 2021Updated 5 years ago
- Torch code for using Residual Networks with LSTMs for Lipreading☆99Oct 8, 2018Updated 7 years ago
- This dataset is presented in the paper Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video…☆12Sep 21, 2022Updated 3 years ago
- Visual speech recognition with face inputs: code and models for F&G 2020 paper "Can We Read Speech Beyond the Lips? Rethinking RoI Select…☆18Apr 12, 2021Updated 5 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆47Sep 22, 2020Updated 5 years ago
- Code release for paper "How good is my GAN?"☆12Mar 9, 2019Updated 7 years ago
- 2019年“创青春·交子杯”新网银行高校金融科技挑战赛初赛、决赛思路代码分享☆28Dec 11, 2019Updated 6 years ago
- Automated Lip Reading using Deep Reinforcement Learning☆32Jun 24, 2018Updated 7 years ago
- ☆11Sep 16, 2014Updated 11 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An ugly tool for labeling segmentations given images and the corresponding superpixels.☆14May 18, 2016Updated 9 years ago
- Using an LSTM and 4d convolutional network for lip reading☆12May 11, 2018Updated 7 years ago
- ☆16Apr 20, 2020Updated 6 years ago
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 5 years ago
- Portal of Johannes and Felix's RNN implementation and further modifications for ASR☆21Nov 27, 2014Updated 11 years ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆19Nov 13, 2020Updated 5 years ago
- Attention-based multimodal fusion for sentiment analysis☆13Aug 14, 2018Updated 7 years ago
- Guide for installing Hackintosh on Dell 7577☆10Aug 17, 2019Updated 6 years ago
- Structured Receptive Fields in Convolutional Neural Networks☆47Feb 20, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification (BMVC 2019)☆143Jun 13, 2021Updated 4 years ago
- 一个测试各种功能的demo☆12Apr 16, 2020Updated 6 years ago
- modified version of src☆17Jan 13, 2018Updated 8 years ago
- Code for our submision on ICCV2017. A fork from https://github.com/rbgirshick/py-faster-rcnn☆21Sep 18, 2017Updated 8 years ago
- tensorflow serving and deep model online https://dataxujing.github.io/tensorflow-serving-Wechat/?transition=convex#/☆19Nov 23, 2018Updated 7 years ago
- Keras implementation of 'LipNet: End-to-End Sentence-level Lipreading'☆688Nov 22, 2022Updated 3 years ago
- Use human pose information to help action recognition, explored with attention-pooling method, C3D method and two-stream architecture, im…☆18Jun 7, 2018Updated 7 years ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Apr 12, 2026Updated 3 weeks ago
- ☆13May 10, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Detects lip movement and check if a person is speaking☆19May 4, 2018Updated 8 years ago
- Chinese words classification using lipnet with pytorch☆40Nov 18, 2019Updated 6 years ago
- Theano bindings for Baidu's CTC library.☆20Aug 25, 2016Updated 9 years ago
- ☆108Sep 20, 2017Updated 8 years ago
- HEtero-Assists Distillation for Heterogeneous Object Detectors☆10Jul 3, 2023Updated 2 years ago
- Automated Lip reading from real-time videos in tensorflow in python☆163Mar 20, 2018Updated 8 years ago
- Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17☆115Feb 15, 2017Updated 9 years ago