Official implementation of EgoHOD at ICLR 2025; 14 EgoVis Challenge Winners in CVPR 2024
☆32Nov 25, 2025Updated 3 months ago
Alternatives and similar repositories for EgoHOD
Users that are interested in EgoHOD are comparing it to the libraries listed below
Sorting:
- ☆12Jul 22, 2025Updated 8 months ago
- InternRobotics' open-source toolbox for vision-based embodied spatial intelligence.☆48Sep 18, 2025Updated 6 months ago
- ☆29Nov 14, 2025Updated 4 months ago
- Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2…☆24Jun 13, 2024Updated last year
- We introduce DiffH2O, a diffusion-based framework to synthesize dexterous hand-object interactions. DiffH2O generates realistic hand-obje…☆34Nov 21, 2025Updated 4 months ago
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆86Nov 27, 2025Updated 3 months ago
- Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"☆29Aug 28, 2023Updated 2 years ago
- An official pytorch implementation of the paper: [MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval].☆14Jul 27, 2024Updated last year
- (ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"☆32Apr 8, 2025Updated 11 months ago
- ☆23Aug 19, 2024Updated last year
- ☆24Jun 12, 2025Updated 9 months ago
- Code for the paper "SMACE: A New Method for the Interpretability of Composite Decision Systems", ECML 2022☆15Apr 17, 2023Updated 2 years ago
- ☆44Jan 13, 2026Updated 2 months ago
- [CVPR 2024 Champions][ICLR 2025] Solutions for EgoVis Chanllenges in CVPR 2024☆133May 11, 2025Updated 10 months ago
- RoCoG-v2 (Robot Control Gestures) is a dataset intended to support the study of synthetic-to-real and ground-to-air video domain adaptati…☆17Mar 28, 2024Updated last year
- [ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?☆12Apr 11, 2025Updated 11 months ago
- [AAAI26 oral] CronusVLA: Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling☆91Jan 11, 2026Updated 2 months ago
- ☆10Jul 14, 2023Updated 2 years ago
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆43Dec 7, 2024Updated last year
- MaskPlanner is a deep learning model for the quick generation of multiple, long-horizon paths from free-form 3D objects represented as po…☆21Jun 20, 2025Updated 9 months ago
- Code for the paper "Attention Meets Post-hoc Interpretability: A Mathematical Perspective", ICML 2024☆21Nov 10, 2025Updated 4 months ago
- ☆17Jan 26, 2025Updated last year
- Building Egocentric Procedural AI Assistant: Methods, Benchmarks, and Challenges☆42Feb 10, 2026Updated last month
- Video Summarization With Spatiotemporal Vision Transformer☆24Jul 5, 2023Updated 2 years ago
- Official Repository for "Communication Efficient Federated Learning with Generalized Heavy-Ball Momentum", accepted at TMLR 2025☆27Jul 14, 2025Updated 8 months ago
- Code for the paper "Differentiable Task Graph Learning: Procedural Activity Representation and Online Mistake Detection from Egocentric V…☆21Jan 9, 2025Updated last year
- HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos☆185Apr 5, 2025Updated 11 months ago
- ☆16Oct 28, 2025Updated 4 months ago
- This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)☆31Jun 28, 2024Updated last year
- Example external repository for interacting with armory.☆11May 2, 2022Updated 3 years ago
- [CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection☆137Jul 28, 2025Updated 7 months ago
- ☆18Jan 8, 2026Updated 2 months ago
- Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Align…☆19Apr 5, 2024Updated last year
- ☆15May 13, 2024Updated last year
- Research Paper Review Notes☆13Oct 26, 2018Updated 7 years ago
- ☆30Aug 14, 2023Updated 2 years ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- ☆26Apr 26, 2025Updated 10 months ago
- ☆22May 2, 2025Updated 10 months ago