The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild
☆26Nov 23, 2018Updated 7 years ago
Alternatives and similar repositories for D3D
Users that are interested in D3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.069…☆119Mar 13, 2026Updated 2 weeks ago
- "LipNet: End-to-End Sentence-level Lipreading" in PyTorch☆69Sep 9, 2019Updated 6 years ago
- The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxi…☆235Sep 21, 2022Updated 3 years ago
- Finalist entry for the M2CAI Workflow Challenge 2016☆10Nov 25, 2016Updated 9 years ago
- Audio-Visual Speech Recognition using Deep Learning☆61Nov 14, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Torch code for using Residual Networks with LSTMs for Lipreading☆99Oct 8, 2018Updated 7 years ago
- This dataset is presented in the paper Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video…☆12Sep 21, 2022Updated 3 years ago
- Visual speech recognition with face inputs: code and models for F&G 2020 paper "Can We Read Speech Beyond the Lips? Rethinking RoI Select…☆18Apr 12, 2021Updated 4 years ago
- 2019年“创青春·交子杯”新网银行高校金融科技挑战赛初赛、决赛思路代码分享☆28Dec 11, 2019Updated 6 years ago
- Automated Lip Reading using Deep Reinforcement Learning☆32Jun 24, 2018Updated 7 years ago
- ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASS…☆433May 18, 2023Updated 2 years ago
- An ugly tool for labeling segmentations given images and the corresponding superpixels.☆14May 18, 2016Updated 9 years ago
- The code used to create the ARCA23K and ARCA23K-FSD datasets☆15Nov 9, 2021Updated 4 years ago
- ☆16Apr 20, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- sk-cnn is proposed in Skeleton based action recognition with convolutional neural network(PR 2016). Here implemented in Keras☆19Apr 10, 2018Updated 7 years ago
- pytorch implementation of SOSELETO☆15Sep 5, 2019Updated 6 years ago
- The 1st place solution for AutoSpeech 2019.☆17Jun 9, 2020Updated 5 years ago
- Paper list of Video LLM hallucination. Welcome to Star and Contribute!☆23Mar 6, 2026Updated 3 weeks ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆19Nov 13, 2020Updated 5 years ago
- [IROS 2024 Oral Pitch] PyTorch Implementation of "Dual-Branch Graph Transformer Network for 3D Human Mesh Reconstruction from Video"☆15Jul 19, 2024Updated last year
- An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset☆12Dec 11, 2019Updated 6 years ago
- Attention-based multimodal fusion for sentiment analysis☆13Aug 14, 2018Updated 7 years ago
- Translating Torch model to other framework such as Caffe, MxNet ...☆22Dec 16, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A simplified version for DMC (Deep Multimodal Clustering for Unsupervised Audiovisual Learning)☆19May 27, 2020Updated 5 years ago
- ☆65Oct 8, 2018Updated 7 years ago
- 一个测试各种功能的demo☆12Apr 16, 2020Updated 5 years ago
- Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification (BMVC 2019)☆143Jun 13, 2021Updated 4 years ago
- modified version of src☆17Jan 13, 2018Updated 8 years ago
- Keras implementation of 'LipNet: End-to-End Sentence-level Lipreading'☆688Nov 22, 2022Updated 3 years ago
- ☆20May 29, 2024Updated last year
- Use human pose information to help action recognition, explored with attention-pooling method, C3D method and two-stream architecture, im…☆18Jun 7, 2018Updated 7 years ago
- ☆13Nov 6, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Sep 11, 2025Updated 6 months ago
- ☆13May 10, 2022Updated 3 years ago
- Pytorch plugin to generate saliency maps for neural networks☆12Nov 1, 2018Updated 7 years ago
- Chinese words classification using lipnet with pytorch☆40Nov 18, 2019Updated 6 years ago
- ☆12Sep 19, 2021Updated 4 years ago
- Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"☆17May 22, 2021Updated 4 years ago
- ☆108Sep 20, 2017Updated 8 years ago