yl3800 / LASOLinks
☆39Updated last year
Alternatives and similar repositories for LASO
Users that are interested in LASO are comparing it to the libraries listed below
Sorting:
- CVPR 2025☆35Updated 7 months ago
- One-Shot Open Affordance Learning with Foundation Models (CVPR 2024)☆45Updated last year
- [CVPR-2025] GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding☆28Updated 3 months ago
- ☆44Updated last year
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆45Updated 2 years ago
- [NeurIPS 2024] Official code repository for MSR3D paper☆68Updated 3 months ago
- [NeurIPS 2025 Spotlight] MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning☆64Updated last month
- Official implementation of ICCV 2025 paper "TACO: Taming Diffusion for in-the-wild Video Amodal Completion"☆26Updated 4 months ago
- IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos☆53Updated 7 months ago
- Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model☆133Updated last week
- Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022☆69Updated last year
- HORT: Monocular Hand-held Objects Reconstruction with Transformers, ICCV 2025☆45Updated 7 months ago
- Code implementation of CVPR 2024 highlight paper "PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI"☆178Updated 5 months ago
- [ICCV2025] AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation☆91Updated 4 months ago
- 3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians (ACM MM 25)☆60Updated 4 months ago
- (Incomplete version) This is an implementation of affordancellm.☆14Updated last year
- OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding☆19Updated last week
- [IROS 2023] Open-Vocabulary Affordance Detection in 3d Point Clouds☆80Updated last year
- ☆40Updated 4 months ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆56Updated 7 months ago
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆41Updated 2 months ago
- Implementation of Prompting with the Future: Open-World Model Predictive Control with Interactive Digital Twins. [RSS 2025]☆45Updated last month
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆195Updated last month
- ☆64Updated 4 months ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆80Updated last year
- CVPR2025 | TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation☆30Updated 2 months ago
- ☆52Updated last year
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆248Updated 7 months ago
- Official Code for the NeurIPS'23 paper "3D-Aware Visual Question Answering about Parts, Poses and Occlusions"☆19Updated last year
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆82Updated last year