RayYoh / LaSSMLinks
LaSSM: Efficient Semantic-Spatial Query Decoding via Local Aggregation and State Space Models for 3D Instance Segmentation
☆16Updated 8 months ago
Alternatives and similar repositories for LaSSM
Users that are interested in LaSSM are comparing it to the libraries listed below
Sorting:
- 3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians (ACM MM 25)☆71Updated 6 months ago
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆29Updated 2 months ago
- ☆71Updated 6 months ago
- Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.☆53Updated 2 months ago
- [RAL 2024] OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding☆32Updated 11 months ago
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆33Updated 2 months ago
- Unifying 2D and 3D Vision-Language Understanding☆121Updated 6 months ago
- AIR-Embodied: An Efficient Active 3DGS-based Interaction and Reconstruction Framework with Embodied Large Language Model☆22Updated 9 months ago
- Open-source implementations on real robots☆34Updated last year
- [CVPR 2025] GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency☆43Updated 3 months ago
- [CVPR 2025] MoST: Efficient Monarch Sparse Tuning for 3D Representation Learning☆16Updated 4 months ago
- ☆56Updated last year
- ☆21Updated 8 months ago
- [RA-L] Lost & Found dynamically tracks object poses from egocentric videos while updating a scene graph, enabling richer semantic 3D unde…☆54Updated 4 months ago
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆45Updated 7 months ago
- [ICLR 2026] Codebase for paper "Geometry-aware 4D Video Generation for Robot Manipulation"☆72Updated last month
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆60Updated 10 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆172Updated 7 months ago
- ☆31Updated 4 months ago
- Implementation of Prompting with the Future: Open-World Model Predictive Control with Interactive Digital Twins. [RSS 2025]☆48Updated 3 months ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆93Updated 8 months ago
- [ICLR 2025] Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention☆29Updated 11 months ago
- Official implementation of GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs.☆69Updated 7 months ago
- ☆143Updated 9 months ago
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆28Updated 3 months ago
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆43Updated last year
- Zero-Shot Multi-Object Shape Completion (ECCV 2024)☆27Updated 10 months ago
- ☆20Updated last year
- [ICLR 2025] Official code of "Segment any 3D Object with Language"☆67Updated 3 months ago
- 🔥GrabS in PyTorch (ICLR 2025 Spotlight)☆19Updated 5 months ago