oppo-us-research / USSTLinks
☆18Updated last year
Alternatives and similar repositories for USST
Users that are interested in USST are comparing it to the libraries listed below
Sorting:
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆110Updated last week
- OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding☆18Updated this week
- Visualization of the PCA as shown in Figure 1.☆33Updated last year
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆120Updated last year
- ☆93Updated 2 weeks ago
- The official repository for paper "MLLMs Need 3D-Aware Representation Supervision for Scene Understanding"☆79Updated 2 months ago
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆40Updated 2 years ago
- [ECCV'24] 3D Reconstruction of Objects in Hands without Real World 3D Supervision☆15Updated 6 months ago
- Code implementation of the paper 'FIction: 4D Future Interaction Prediction from Video'☆14Updated 4 months ago
- For Ego4D VQ3D Task☆21Updated last year
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆69Updated 8 months ago
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆46Updated last year
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆89Updated last year
- Official code for NeurIPS 2023 SpotLight: VoxDet: Voxel Learning for Novel Instance Detection☆31Updated last year
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆95Updated 6 months ago
- [RAL 2024] OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding☆28Updated 5 months ago
- ☆163Updated 5 months ago
- Unifying 2D and 3D Vision-Language Understanding☆98Updated 2 weeks ago
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆78Updated last year
- Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"☆131Updated last week
- 3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians (ACM MM 25)☆45Updated 3 weeks ago
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆55Updated last year
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆39Updated 8 months ago
- Unsupervised Semantic Correspondence Using Stable Diffusion☆57Updated last year
- [ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models☆31Updated last year
- CVPR 2024 "Instance Tracking in 3D Scenes from Egocentric Videos"☆19Updated last year
- Official PyTorch implementation of Self-Supervised Any-Point Tracking by Contrastive Random Walks, ECCV 2024.☆52Updated 9 months ago
- [ICCV 2025] Detect Anything 3D in the Wild☆163Updated last month
- [CVPR 2025 Highlight] Towards Autonomous Micromobility through Scalable Urban Simulation☆100Updated 2 weeks ago
- ☆75Updated 2 months ago