The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild
☆26Nov 23, 2018Updated 7 years ago
Alternatives and similar repositories for D3D
Users that are interested in D3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and model for paper <Mutual Information Maximization for Effective Lip Reading>☆19Sep 4, 2020Updated 5 years ago
- Pytorch code for End-to-End Audiovisual Speech Recognition☆184Nov 18, 2022Updated 3 years ago
- "LipNet: End-to-End Sentence-level Lipreading" in PyTorch☆70Sep 9, 2019Updated 6 years ago
- The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the…☆168Sep 12, 2025Updated 8 months ago
- ☆11May 31, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxi…☆237Sep 21, 2022Updated 3 years ago
- My experiments in lip reading using deep learning with the LRW dataset☆54Mar 14, 2021Updated 5 years ago
- Audio-Visual Speech Recognition using Deep Learning☆61Nov 14, 2018Updated 7 years ago
- Torch code for using Residual Networks with LSTMs for Lipreading☆99Oct 8, 2018Updated 7 years ago
- This dataset is presented in the paper Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video…☆12Sep 21, 2022Updated 3 years ago
- Visual speech recognition with face inputs: code and models for F&G 2020 paper "Can We Read Speech Beyond the Lips? Rethinking RoI Select…☆18Apr 12, 2021Updated 5 years ago
- Code release for paper "How good is my GAN?"☆12Mar 9, 2019Updated 7 years ago
- 2019年“创青春·交子杯”新网银行高校金融科技挑战赛初赛、决赛思路代码分享☆28Dec 11, 2019Updated 6 years ago
- Python scripts for general purposes, data analysis, and plotting.☆14Sep 25, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Automated Lip Reading using Deep Reinforcement Learning☆32Jun 24, 2018Updated 7 years ago
- ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASS…☆435May 18, 2023Updated 3 years ago
- Lip Reading in the Wild using ResNet and LSTMs in PyTorch☆57Apr 23, 2018Updated 8 years ago
- An ugly tool for labeling segmentations given images and the corresponding superpixels.☆14May 18, 2016Updated 10 years ago
- Using an LSTM and 4d convolutional network for lip reading☆12May 11, 2018Updated 8 years ago
- ☆19Jul 14, 2019Updated 6 years ago
- pytorch implementation of SOSELETO☆15Sep 5, 2019Updated 6 years ago
- sk-cnn is proposed in Skeleton based action recognition with convolutional neural network(PR 2016). Here implemented in Keras☆19Apr 10, 2018Updated 8 years ago
- OmniSyn: Synthesizing 360 Videos with Wide-baseline Panoramas (VRW 2022)☆14Mar 5, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- EfficientDet_anchor_free☆11Feb 19, 2020Updated 6 years ago
- Portal of Johannes and Felix's RNN implementation and further modifications for ASR☆21Nov 27, 2014Updated 11 years ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆19Nov 13, 2020Updated 5 years ago
- An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset☆12Dec 11, 2019Updated 6 years ago
- A simplified version for DMC (Deep Multimodal Clustering for Unsupervised Audiovisual Learning)☆19May 27, 2020Updated 6 years ago
- [ACL 2026] Paper list of Video LLM hallucination. Welcome to Star and Contribute!☆34May 2, 2026Updated 3 weeks ago
- Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification (BMVC 2019)☆143Jun 13, 2021Updated 4 years ago
- Code for our submision on ICCV2017. A fork from https://github.com/rbgirshick/py-faster-rcnn☆21Sep 18, 2017Updated 8 years ago
- PyTorch implementation of Human Action Recognition Based on Spatial-Temporal Attention at ICLR 2019☆14Dec 12, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- tensorflow serving and deep model online https://dataxujing.github.io/tensorflow-serving-Wechat/?transition=convex#/☆19Nov 23, 2018Updated 7 years ago
- ☆23May 29, 2024Updated 2 years ago
- Use human pose information to help action recognition, explored with attention-pooling method, C3D method and two-stream architecture, im…☆18Jun 7, 2018Updated 7 years ago
- ☆13Nov 6, 2021Updated 4 years ago
- Detects lip movement and check if a person is speaking☆19May 4, 2018Updated 8 years ago
- Chinese words classification using lipnet with pytorch☆40Nov 18, 2019Updated 6 years ago
- Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"☆17May 22, 2021Updated 5 years ago