oppo-us-research / USSTLinks
☆19Updated last year
Alternatives and similar repositories for USST
Users that are interested in USST are comparing it to the libraries listed below
Sorting:
- [Nips 2025] EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆122Updated 4 months ago
- ☆95Updated 4 months ago
- Official code for NeurIPS 2023 SpotLight: VoxDet: Voxel Learning for Novel Instance Detection☆30Updated last year
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆42Updated 11 months ago
- Unifying 2D and 3D Vision-Language Understanding☆116Updated 4 months ago
- Unsupervised Semantic Correspondence Using Stable Diffusion☆59Updated last year
- Visualization of the PCA as shown in Figure 1.☆40Updated last year
- HD-EPIC Python script to download the entire datasets or parts of it☆14Updated last month
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆45Updated 2 years ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆122Updated last year
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Updated 2 years ago
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆47Updated last year
- (Incomplete version) This is an implementation of affordancellm.☆16Updated last year
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆58Updated last year
- Code implementation of the paper 'FIction: 4D Future Interaction Prediction from Video'☆16Updated 8 months ago
- ☆170Updated 9 months ago
- Official PyTorch implementation of Self-Supervised Any-Point Tracking by Contrastive Random Walks, ECCV 2024.☆54Updated last year
- [NeurIPS 2022] Segmenting Moving Objects via an Object-Centric Representation. Junyu Xie, Weidi Xie, Andrew Zisserman.☆32Updated last year
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆82Updated last year
- [ECCV'24] 3D Reconstruction of Objects in Hands without Real World 3D Supervision☆16Updated 9 months ago
- Bidirectional Mapping between Action Physical-Semantic Space☆32Updated 2 months ago
- Code implementation of the paper "World-in-World: World Models in a Closed-Loop World"☆98Updated this week
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆71Updated last year
- For Ego4D VQ3D Task☆22Updated last year
- 🔍 Explore Egocentric Vision: research, data, challenges, real-world apps. Stay updated & contribute to our dynamic repository! Work-in-p…☆121Updated last year
- Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".☆40Updated 5 months ago
- [NeurIPS 2025] MLLMs Need 3D-Aware Representation Supervision for Scene Understanding☆122Updated 3 weeks ago
- Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"☆137Updated 4 months ago
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆116Updated 8 months ago
- OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding☆19Updated 2 weeks ago