InternRobotics / EgoHODLinks
Official implementation of EgoHOD at ICLR 2025; 14 EgoVis Challenge Winners in CVPR 2024
☆22Updated 2 weeks ago
Alternatives and similar repositories for EgoHOD
Users that are interested in EgoHOD are comparing it to the libraries listed below
Sorting:
- (ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"☆30Updated 6 months ago
- Official code releasse for "The Invisible EgoHand: 3D Hand Forecasting through EgoBody Pose Estimation"☆27Updated last month
- A curated list of Egocentric Action Understanding resources☆24Updated last month
- Code implementation of the paper 'FIction: 4D Future Interaction Prediction from Video'☆14Updated 6 months ago
- ☆50Updated 5 months ago
- [ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…☆38Updated 7 months ago
- ☆25Updated 4 months ago
- [Nips 2025] EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆118Updated 2 months ago
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆33Updated last year
- Accepted by CVPR 2024☆38Updated last year
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆29Updated last year
- ☆21Updated last year
- Official code for MotionBench (CVPR 2025)☆59Updated 7 months ago
- Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)☆44Updated last year
- CVPR 2025☆33Updated 5 months ago
- Bidirectional Mapping between Action Physical-Semantic Space☆31Updated last month
- Official implementation of CVPR24 highlight paper "Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Sce…☆161Updated last year
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆74Updated 8 months ago
- [ICML 2024] A Touch, Vision, and Language Dataset for Multimodal Alignment☆83Updated 4 months ago
- ☆88Updated 4 months ago
- ☆37Updated last year
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆35Updated last year
- M3GPT: An advanced multimodal, multitask framework for motion comprehension and generation.☆17Updated 9 months ago
- ☆90Updated last week
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".☆13Updated 8 months ago
- ☆25Updated 10 months ago
- OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding☆18Updated 2 months ago
- HORT: Monocular Hand-held Objects Reconstruction with Transformers, ICCV 2025☆42Updated 6 months ago
- ☆16Updated last year
- [NeurIPS 2024] Official code repository for MSR3D paper☆64Updated 2 months ago