HCPLab-SYSU / 3DAffordSplatLinks
3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians (ACM MM 25)
☆67Updated 5 months ago
Alternatives and similar repositories for 3DAffordSplat
Users that are interested in 3DAffordSplat are comparing it to the libraries listed below
Sorting:
- LaSSM: Efficient Semantic-Spatial Query Decoding via Local Aggregation and State Space Models for 3D Instance Segmentation☆15Updated 7 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆171Updated 7 months ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆60Updated 9 months ago
- [RAL 2024] OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding☆32Updated 11 months ago
- Unifying 2D and 3D Vision-Language Understanding☆119Updated 5 months ago
- ☆44Updated last year
- AIR-Embodied: An Efficient Active 3DGS-based Interaction and Reconstruction Framework with Embodied Large Language Model☆21Updated 9 months ago
- [CVPR 2025] GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency☆40Updated 2 months ago
- code for affordance-r1☆50Updated 3 weeks ago
- [CVPR-2025] GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding☆34Updated 5 months ago
- CVPR 2025☆39Updated last month
- ☆54Updated last year
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding☆202Updated 8 months ago
- [NeurIPS 2024] Official code repository for MSR3D paper☆69Updated last month
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆42Updated last year
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆32Updated last month
- Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”☆73Updated last month
- ☆67Updated 6 months ago
- OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding☆19Updated last month
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆61Updated last year
- SceneFun3D ToolKit☆165Updated 9 months ago
- Official implementation of GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs.☆67Updated 6 months ago
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆83Updated last year
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆45Updated 7 months ago
- [ICLR 2025] Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention☆28Updated 10 months ago
- ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D Images (NeurIPS2024)☆88Updated 3 months ago
- [ICLR 2025] Official code of "Segment any 3D Object with Language"☆53Updated 3 months ago
- IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos☆55Updated 9 months ago
- Open-source implementations on real robots☆34Updated last year
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆28Updated last month