The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild
☆26Nov 23, 2018Updated 7 years ago
Alternatives and similar repositories for D3D
Users that are interested in D3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch code for End-to-End Audiovisual Speech Recognition☆184Nov 18, 2022Updated 3 years ago
- My experiments in lip reading using deep learning with the LRW dataset☆53Mar 14, 2021Updated 5 years ago
- Finalist entry for the M2CAI Workflow Challenge 2016☆10Nov 25, 2016Updated 9 years ago
- Audio-Visual Speech Recognition using Deep Learning☆61Nov 14, 2018Updated 7 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆46Sep 22, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Active appearance model toolbox☆14Nov 2, 2015Updated 10 years ago
- Automated Lip Reading using Deep Reinforcement Learning☆32Jun 24, 2018Updated 7 years ago
- ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASS…☆433May 18, 2023Updated 2 years ago
- Lip Reading in the Wild using ResNet and LSTMs in PyTorch☆57Apr 23, 2018Updated 7 years ago
- An ugly tool for labeling segmentations given images and the corresponding superpixels.☆14May 18, 2016Updated 9 years ago
- Using an LSTM and 4d convolutional network for lip reading☆12May 11, 2018Updated 7 years ago
- The code used to create the ARCA23K and ARCA23K-FSD datasets☆15Nov 9, 2021Updated 4 years ago
- ☆16Apr 20, 2020Updated 5 years ago
- pytorch implementation of SOSELETO☆15Sep 5, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 5 years ago
- [ACM MM 2024] PyTorch Implementation of "ARTS: Semi-Analytical Regressor using Disentangled Skeletal Representations for Human Mesh Recov…☆15Feb 27, 2025Updated last year
- The 1st place solution for AutoSpeech 2019.☆17Jun 9, 2020Updated 5 years ago
- Paper list of Video LLM hallucination. Welcome to Star and Contribute!☆23Apr 1, 2026Updated 2 weeks ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆19Nov 13, 2020Updated 5 years ago
- [IROS 2024 Oral Pitch] PyTorch Implementation of "Dual-Branch Graph Transformer Network for 3D Human Mesh Reconstruction from Video"☆15Jul 19, 2024Updated last year
- An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset☆12Dec 11, 2019Updated 6 years ago
- Translating Torch model to other framework such as Caffe, MxNet ...☆22Dec 16, 2016Updated 9 years ago
- A simplified version for DMC (Deep Multimodal Clustering for Unsupervised Audiovisual Learning)☆19May 27, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification (BMVC 2019)☆143Jun 13, 2021Updated 4 years ago
- modified version of src☆17Jan 13, 2018Updated 8 years ago
- Code for our submision on ICCV2017. A fork from https://github.com/rbgirshick/py-faster-rcnn☆21Sep 18, 2017Updated 8 years ago
- tensorflow serving and deep model online https://dataxujing.github.io/tensorflow-serving-Wechat/?transition=convex#/☆19Nov 23, 2018Updated 7 years ago
- PyTorch implementation of Human Action Recognition Based on Spatial-Temporal Attention at ICLR 2019☆14Dec 12, 2018Updated 7 years ago
- Keras implementation of 'LipNet: End-to-End Sentence-level Lipreading'☆688Nov 22, 2022Updated 3 years ago
- ☆22May 29, 2024Updated last year
- Use human pose information to help action recognition, explored with attention-pooling method, C3D method and two-stream architecture, im…☆18Jun 7, 2018Updated 7 years ago
- ☆13May 10, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Chinese words classification using lipnet with pytorch☆40Nov 18, 2019Updated 6 years ago
- Pytorch plugin to generate saliency maps for neural networks☆12Nov 1, 2018Updated 7 years ago
- ☆12Sep 19, 2021Updated 4 years ago
- Theano bindings for Baidu's CTC library.☆20Aug 25, 2016Updated 9 years ago
- Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"☆17May 22, 2021Updated 4 years ago
- Fully Convolutional Geometric Features (FCGF, ICCV19) based on spconv library☆16Jul 14, 2022Updated 3 years ago
- This is a toolbox repository to help evaluate various methods that perform image matching from a pair of images.☆12Jul 5, 2023Updated 2 years ago