The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild
☆26Nov 23, 2018Updated 7 years ago
Alternatives and similar repositories for D3D
Users that are interested in D3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.069…☆122Mar 13, 2026Updated 3 months ago
- Code and model for paper <Mutual Information Maximization for Effective Lip Reading>☆19Sep 4, 2020Updated 5 years ago
- The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the…☆168Sep 12, 2025Updated 9 months ago
- ☆15Dec 11, 2021Updated 4 years ago
- ☆11May 31, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxi…☆238Sep 21, 2022Updated 3 years ago
- My experiments in lip reading using deep learning with the LRW dataset☆54Mar 14, 2021Updated 5 years ago
- Finalist entry for the M2CAI Workflow Challenge 2016☆10Nov 25, 2016Updated 9 years ago
- Torch code for using Residual Networks with LSTMs for Lipreading☆99Oct 8, 2018Updated 7 years ago
- This dataset is presented in the paper Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video…☆12Sep 21, 2022Updated 3 years ago
- Visual speech recognition with face inputs: code and models for F&G 2020 paper "Can We Read Speech Beyond the Lips? Rethinking RoI Select…☆18Apr 12, 2021Updated 5 years ago
- Code release for paper "How good is my GAN?"☆12Mar 9, 2019Updated 7 years ago
- 2019年“创青春·交子杯”新网银行高校金融科技挑战赛初赛、决赛思路代码分享☆28Dec 11, 2019Updated 6 years ago
- Active appearance model toolbox☆14Nov 2, 2015Updated 10 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASS…☆438May 18, 2023Updated 3 years ago
- ☆11Sep 16, 2014Updated 11 years ago
- Lip Reading in the Wild using ResNet and LSTMs in PyTorch☆57Apr 23, 2018Updated 8 years ago
- An ugly tool for labeling segmentations given images and the corresponding superpixels.☆14May 18, 2016Updated 10 years ago
- ☆19Jul 14, 2019Updated 6 years ago
- ☆16Apr 20, 2020Updated 6 years ago
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 6 years ago
- The code used to create the ARCA23K and ARCA23K-FSD datasets☆16Nov 9, 2021Updated 4 years ago
- The 1st place solution for AutoSpeech 2019.☆17Jun 9, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Portal of Johannes and Felix's RNN implementation and further modifications for ASR☆21Nov 27, 2014Updated 11 years ago
- An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset☆12Dec 11, 2019Updated 6 years ago
- ☆12Oct 5, 2022Updated 3 years ago
- Translating Torch model to other framework such as Caffe, MxNet ...☆22Dec 16, 2016Updated 9 years ago
- Guide for installing Hackintosh on Dell 7577☆10Aug 17, 2019Updated 6 years ago
- ☆64Oct 8, 2018Updated 7 years ago
- A simplified version for DMC (Deep Multimodal Clustering for Unsupervised Audiovisual Learning)☆19May 27, 2020Updated 6 years ago
- Structured Receptive Fields in Convolutional Neural Networks☆47Feb 20, 2018Updated 8 years ago
- Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification (BMVC 2019)☆144Jun 13, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 一个测试各种功能的demo☆12Apr 16, 2020Updated 6 years ago
- modified version of src☆17Jan 13, 2018Updated 8 years ago
- Code for our submision on ICCV2017. A fork from https://github.com/rbgirshick/py-faster-rcnn☆21Sep 18, 2017Updated 8 years ago
- PyTorch implementation of Human Action Recognition Based on Spatial-Temporal Attention at ICLR 2019☆14Dec 12, 2018Updated 7 years ago
- Use human pose information to help action recognition, explored with attention-pooling method, C3D method and two-stream architecture, im…☆18Jun 7, 2018Updated 8 years ago
- Detects lip movement and check if a person is speaking☆19May 4, 2018Updated 8 years ago
- ☆12Sep 19, 2021Updated 4 years ago