chunfeng3364 / LARC
☆17Updated 7 months ago
Alternatives and similar repositories for LARC:
Users that are interested in LARC are comparing it to the libraries listed below
- [NeurIPS 2024] Official code repository for MSR3D paper☆37Updated 2 weeks ago
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆43Updated 7 months ago
- ☆48Updated 4 months ago
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆35Updated 2 months ago
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆41Updated last month
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆64Updated 4 months ago
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆65Updated 6 months ago
- Code for "Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes"☆50Updated 10 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆46Updated 2 months ago
- G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆34Updated 2 weeks ago
- ☆43Updated 2 months ago
- Official implementation of Language Conditioned Spatial Relation Reasoning for 3D Object Grounding (NeurIPS'22).☆58Updated 2 years ago
- SceneFun3D ToolKit☆92Updated this week
- ☆42Updated 9 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆90Updated 3 months ago
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆47Updated 6 months ago
- OVExp: Open Vocabulary Exploration for Object-Oriented Navigation☆33Updated 7 months ago
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆34Updated last year
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆41Updated 11 months ago
- [ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects☆79Updated last year
- ☆37Updated this week
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆24Updated last month
- One-Shot Open Affordance Learning with Foundation Models (CVPR 2024)☆25Updated 6 months ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆55Updated last month
- Code for Stable Control Representations☆23Updated last month
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆35Updated 7 months ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated 4 months ago
- Official Code for the NeurIPS'23 paper "3D-Aware Visual Question Answering about Parts, Poses and Occlusions"☆15Updated 4 months ago
- ☆40Updated last year