prajwalkr / vtpLinks
Official Implementation of Visual Transformer Pooling for Lip reading
☆40Updated 3 years ago
Alternatives and similar repositories for vtp
Users that are interested in vtp are comparing it to the libraries listed below
Sorting:
- The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the…☆165Updated 3 months ago
- Official implementation of RAVEn (ICLR 2023) and BRAVEn (ICASSP 2024)☆77Updated 9 months ago
- a PyTorch implementation of Lip2Wav☆51Updated 3 years ago
- Code for the Active Speakers in Context Paper (CVPR2020)☆56Updated 4 years ago
- Visual speech recognition with face inputs: code and models for F&G 2020 paper "Can We Read Speech Beyond the Lips? Rethinking RoI Select…☆18Updated 4 years ago
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆68Updated last year
- ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASS…☆427Updated 2 years ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Updated 2 years ago
- Code and model for paper <Mutual Information Maximization for Effective Lip Reading>