oppo-us-research / USSTLinks
☆18Updated last year
Alternatives and similar repositories for USST
Users that are interested in USST are comparing it to the libraries listed below
Sorting:
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆114Updated last month
- Visualization of the PCA as shown in Figure 1.☆39Updated last year
- Code implementation of the paper 'FIction: 4D Future Interaction Prediction from Video'☆14Updated 5 months ago
- The official repository for paper "MLLMs Need 3D-Aware Representation Supervision for Scene Understanding"☆90Updated 2 weeks ago
- Unifying 2D and 3D Vision-Language Understanding☆100Updated last month
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆40Updated 2 years ago
- 3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians (ACM MM 25)☆50Updated last month
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆80Updated last year
- ☆93Updated last month
- [ICCV2025] Where, What, Why: Towards Explainable Driver Attention Prediction☆33Updated 2 months ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆121Updated last year
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆46Updated last year
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆70Updated 9 months ago
- code for affordance-r1☆26Updated last week
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆55Updated last year
- OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding☆18Updated 3 weeks ago
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆88Updated last year
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆95Updated 7 months ago
- OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models☆61Updated 3 weeks ago
- For Ego4D VQ3D Task☆21Updated last year
- ☆77Updated 3 months ago
- CVPR 2024 "Instance Tracking in 3D Scenes from Egocentric Videos"☆19Updated last year
- The official implementation of The paper "Exploring the Potential of Encoder-free Architectures in 3D LMMs"☆55Updated 3 months ago
- Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"☆132Updated last month
- ☆166Updated 6 months ago
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆39Updated 8 months ago
- [ECCV'24] 3D Reconstruction of Objects in Hands without Real World 3D Supervision☆16Updated 7 months ago
- Code&Data for Grounded 3D-LLM with Referent Tokens☆125Updated 7 months ago
- In CVPR'2024. Meta-Point Learning and Refining for Category-Agnostic Pose Estimation☆19Updated last year
- Unsupervised Semantic Correspondence Using Stable Diffusion☆58Updated last year