gabrielegoletto / AMEGO
Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024
☆36Updated 2 months ago
Alternatives and similar repositories for AMEGO:
Users that are interested in AMEGO are comparing it to the libraries listed below
- Official repository for the paper: "SAMWISE: Infusing wisdom in SAM2 for Text-Driven Video Segmentation"☆39Updated 3 months ago
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆25Updated 10 months ago
- Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2…☆17Updated 8 months ago
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆32Updated 5 months ago
- Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)☆34Updated this week
- IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos☆35Updated 2 months ago
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆58Updated 8 months ago
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆35Updated last year
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆69Updated 7 months ago
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆48Updated 7 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆82Updated last month
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Updated last year
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆60Updated last month
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆67Updated last year
- FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024☆21Updated 2 months ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆65Updated 4 months ago
- ☆42Updated 10 months ago
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆42Updated last year
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆36Updated last week
- Official Implementation of DINO-Foresight: Looking into the Future with DINO☆47Updated this week
- ☆48Updated 5 months ago
- Constraint Satisfaction Visual Grounding☆9Updated 3 months ago
- Official Code for the NeurIPS'23 paper "3D-Aware Visual Question Answering about Parts, Poses and Occlusions"☆15Updated 4 months ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated 5 months ago
- ☆11Updated 5 months ago
- ☆24Updated last year
- Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentati…☆16Updated last month
- [NeurIPS 2023 Spotlight] Code for "Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion"☆66Updated last year
- [WACV 2024] Learning the What and How of Annotation in Video Object Segmentation☆24Updated 8 months ago
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding☆46Updated this week