Visual speech recognition with face inputs: code and models for F&G 2020 paper "Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition"
☆18Apr 12, 2021Updated 4 years ago
Alternatives and similar repositories for deep-face-speechreading
Users that are interested in deep-face-speechreading are comparing it to the libraries listed below
Sorting:
- Code and model for paper <Mutual Information Maximization for Effective Lip Reading>☆19Sep 4, 2020Updated 5 years ago
- ☆15Dec 11, 2021Updated 4 years ago
- The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the…☆165Sep 12, 2025Updated 5 months ago
- PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)☆20Apr 11, 2022Updated 3 years ago
- Official Implementation of Visual Transformer Pooling for Lip reading☆40Aug 8, 2022Updated 3 years ago
- "LipNet: End-to-End Sentence-level Lipreading" in PyTorch☆69Sep 9, 2019Updated 6 years ago
- PyTorch implementation of "Distinguishing Homophenes using Multi-Head Visual-Audio Memory" (AAAI2022)☆27Mar 9, 2024Updated last year
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆19Nov 13, 2020Updated 5 years ago
- Visual Speech Recognition for Multiple Languages☆459Aug 17, 2023Updated 2 years ago
- [CVPR 2025 Highlight] Ev3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event Cameras☆26Aug 2, 2025Updated 7 months ago
- ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASS…☆433May 18, 2023Updated 2 years ago
- The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxi…☆234Sep 21, 2022Updated 3 years ago
- Python toolkit for Visual Speech Recognition☆38Jun 10, 2020Updated 5 years ago
- PyTorch Implementation of the paper "Defining and Quantifying the Emergence of Sparse Concepts in DNNs" (CVPR 2023)☆12Dec 24, 2023Updated 2 years ago
- Python资源大全中文版,内容包括:Web框架、网络爬虫、网络内容提取、模板引擎、数据库、数据可视化、图片处理、文本处理、自然语言处理、机器学习、日志、代码分析等☆11May 24, 2016Updated 9 years ago
- Operating tools for texture bank files.☆10Nov 2, 2016Updated 9 years ago
- ☆11Aug 20, 2025Updated 6 months ago
- DiG-IN: Diffusion Guidance for Investigating Networks - Uncovering Classifier Differences, Neuron Visualisations, and Visual Counterfactu…☆10Oct 9, 2024Updated last year
- Unofficial PyTorch implementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting", Berg et al. 2021.☆40Oct 11, 2022Updated 3 years ago
- profile tools for pytorch nn models☆42Jan 11, 2021Updated 5 years ago
- Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets☆12May 25, 2023Updated 2 years ago
- [TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language models☆10Feb 20, 2025Updated last year
- Fairness-Aware Representation Learning by Suppressing Attribute-Class Associations☆12Dec 10, 2024Updated last year
- ☆11Mar 24, 2025Updated 11 months ago
- [CVPR'25] AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data☆17Mar 27, 2025Updated 11 months ago
- evaluation of shot detection results using the RAI dataset☆10Jun 7, 2018Updated 7 years ago
- Code for the paper: Graph Jigsaw Learning for Cartoon Face Recognition☆10Jul 1, 2022Updated 3 years ago
- [NeurIPS 2025] TopoPoint: Enhance Topology Reasoning via Endpoint Detection in Autonomous Driving☆29Dec 13, 2025Updated 2 months ago
- The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization"☆10Dec 24, 2024Updated last year
- Decorrelate Irrelevant, Purify Relevant: Overcome Textual Spurious Correlations from a Feature Perspective☆11Nov 16, 2022Updated 3 years ago
- EPWING dictionary viewer☆11Nov 13, 2018Updated 7 years ago
- ☆44Nov 14, 2019Updated 6 years ago
- One-Pixel Shortcut: on the Learning Preference of Deep Neural Networks (ICLR 2023 Spotlight)☆14Sep 28, 2025Updated 5 months ago
- Official Repository for "Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality" (ECCV 2024)☆16Oct 29, 2024Updated last year
- Visual Speech Recognition For Low-Resource Languages with Automatic Labels (ICASSP 2024)☆16Mar 17, 2025Updated 11 months ago
- Official codebase for the NeurIPS 2023 paper: Towards Last-layer Retraining for Group Robustness with Fewer Annotations. https://arxiv.or…☆11May 15, 2024Updated last year
- Repository for "A Convolutional Neural Network Cascade for Face Detection", implemented with Python interface.☆13Nov 16, 2017Updated 8 years ago
- Code for fitting masks to face images in the wild☆10Jul 29, 2021Updated 4 years ago
- ☆11May 31, 2020Updated 5 years ago