JasonQSY / 3DOI
[ICCV 2023] Understanding 3D Object Interaction from a Single Image
ā43Updated last year
Alternatives and similar repositories for 3DOI:
Users that are interested in 3DOI are comparing it to the libraries listed below
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)ā44Updated 10 months ago
- IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videosā41Updated last month
- š±šš Perform conditional procedural generation to generate houses like your own!ā35Updated last year
- [CVPR 2022] Understanding 3D Object Articulation in Internet Videosā31Updated last year
- Official PyTorch implementation of Doduo: Dense Visual Correspondence from Unsupervised Semantic-Aware Flowā44Updated last year
- ā36Updated last year
- HInt dataset from HaMeR: Reconstructing Hands in 3D with Transformersā43Updated last year
- Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimationā44Updated 4 months ago
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"ā75Updated 9 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generationā103Updated 5 months ago
- [ECCV'24] 3D Reconstruction of Objects in Hands without Real World 3D Supervisionā11Updated 3 months ago
- ā16Updated last year
- ā25Updated 2 years ago
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)ā37Updated 2 years ago
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videosā63Updated last year
- (ECCV 2022 Oral) TO-Scene: A Large-scale Dataset for Understanding 3D Tabletop Scenesā53Updated 3 months ago
- Agent-to-Sim Learning Interactive Behavior from Casual Videos.ā43Updated 6 months ago
- Bidirectional Mapping between Action Physical-Semantic Spaceā31Updated 8 months ago
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoningā37Updated 4 months ago
- ā48Updated 2 weeks ago
- [ECCV 2022, Oral] OPD: Single-view 3D Openable Part Detectionā33Updated last year
- Official Reporsitory of "EgoMono4D: Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos"ā19Updated last month
- ā17Updated last month
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasksā58Updated 7 months ago
- Official PyTorch implementation of NeuralDiff: Segmenting 3D objects that move in egocentric videos.ā31Updated 2 years ago
- Official Reimplementation of Diffusion-Guided Reconstruction of Everyday Hand-Object Interaction Clips (DiffHOI, ICCV23) https://judyye.gā¦ā36Updated last year
- Codes for "Affordance Diffusion: Synthesizing Hand-Object Interactions"ā122Updated 5 months ago
- [ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.ā93Updated 11 months ago
- ā42Updated last year
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Explorationā48Updated this week