yunongLiu1 / IKEA-Manuals-at-Work
IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos
☆41Updated last month
Alternatives and similar repositories for IKEA-Manuals-at-Work:
Users that are interested in IKEA-Manuals-at-Work are comparing it to the libraries listed below
- ☆96Updated 2 months ago
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆37Updated 5 months ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆55Updated last month
- [CVPR 2025] 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs☆38Updated 10 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆103Updated 5 months ago
- ☆49Updated 7 months ago
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆44Updated 10 months ago
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆43Updated last year
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆19Updated last month
- ☆50Updated 2 weeks ago
- ☆64Updated 3 months ago
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆75Updated 9 months ago
- [NeurIPS 2024] Official code repository for MSR3D paper☆52Updated 2 weeks ago
- Official Reporsitory of "EgoMono4D: Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos"☆21Updated last month
- Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation☆44Updated 4 months ago
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆30Updated 4 months ago
- Unifying 2D and 3D Vision-Language Understanding☆79Updated 3 weeks ago
- SceneFun3D ToolKit☆133Updated 3 weeks ago
- [ECCV'24] Parameterized Quasi-Physical Simulators for Dexterous Manipulations Transfer☆67Updated 9 months ago
- Code repository for the Habitat Synthetic Scenes Dataset (HSSD) paper.☆88Updated 11 months ago
- [CVPR 2024] Physical Property Understanding from Language-Embedded Feature Fields☆71Updated last year
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆37Updated 2 years ago
- ☆39Updated 11 months ago
- Official Code for the NeurIPS'23 paper "3D-Aware Visual Question Answering about Parts, Poses and Occlusions"☆16Updated 6 months ago
- Code implementation of CVPR 2024 highlight paper "PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI"☆147Updated 6 months ago
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆77Updated 6 months ago
- ☆31Updated 9 months ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆59Updated last month
- [RAL 2024] OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding☆27Updated 2 months ago
- Official PyTorch implementation of Doduo: Dense Visual Correspondence from Unsupervised Semantic-Aware Flow☆44Updated last year