oppo-us-research / USST
☆17Updated 7 months ago
Alternatives and similar repositories for USST:
Users that are interested in USST are comparing it to the libraries listed below
- Official code for MotionBench☆25Updated this week
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆92Updated 3 months ago
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆36Updated last week
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆60Updated last month
- ☆18Updated 2 months ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated 5 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆82Updated last month
- For Ego4D VQ3D Task☆19Updated last year
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆35Updated last year
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆32Updated 10 months ago
- ☆80Updated 9 months ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆65Updated 4 months ago
- [NeurIPS 2024] Official code for paper "EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection"☆27Updated 2 months ago
- ☆58Updated last year
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆69Updated 7 months ago
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆101Updated 9 months ago
- (ICCV 2023) Uni-3D: A Universal Model for Panoptic 3D Scene Reconstruction☆25Updated last year
- IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos☆35Updated 2 months ago
- ☆88Updated last month
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆35Updated 2 months ago
- ☆65Updated 8 months ago
- Official implementation of PARIS3D (Accepted to ECCV 2024).☆21Updated 5 months ago
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆48Updated 7 months ago
- [NeurIPS 2024] Official code repository for MSR3D paper☆37Updated this week
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆42Updated 2 months ago
- SceneFun3D ToolKit☆119Updated this week
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆27Updated 6 months ago
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆33Updated 2 months ago
- [CVPR 2023] Detecting Human-Object Contact in Images☆51Updated last year
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆77Updated 6 months ago