gabrielegoletto / AMEGO
Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024
☆35Updated last month
Alternatives and similar repositories for AMEGO:
Users that are interested in AMEGO are comparing it to the libraries listed below
- Official repository for the paper: "SAMWISE: Infusing wisdom in SAM2 for Text-Driven Video Segmentation"☆34Updated 2 months ago
- Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2…☆16Updated 7 months ago
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆67Updated last year
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆24Updated 9 months ago
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆32Updated 4 months ago
- ☆48Updated 3 months ago
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆54Updated last week
- ☆11Updated 4 months ago
- [NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"☆100Updated last month
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆69Updated last month
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆19Updated last year
- Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)☆34Updated 3 months ago
- [WACV 2024] Learning the What and How of Annotation in Video Object Segmentation☆24Updated 7 months ago
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆57Updated 7 months ago
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆74Updated 8 months ago
- ☆86Updated 2 weeks ago
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆98Updated 8 months ago
- ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D Images (NeurIPS2024)☆63Updated last month
- SceneFun3D ToolKit☆89Updated 3 months ago
- [NeurIPS 2024] Official code repository for MSR3D paper☆33Updated last week
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆47Updated 5 months ago
- Pixel-Aligned Recurrent Queries for Multi-View 3D Object Detection (ICCV23)☆40Updated last year
- IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos☆36Updated last month
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated 4 months ago
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆60Updated 5 months ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆64Updated 3 months ago
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆33Updated last month
- For Ego4D VQ3D Task☆19Updated last year
- ☆70Updated last month
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆35Updated 2 months ago