gabrielegoletto / AMEGOLinks
Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024
☆38Updated 6 months ago
Alternatives and similar repositories for AMEGO
Users that are interested in AMEGO are comparing it to the libraries listed below
Sorting:
- Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2…☆21Updated 11 months ago
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆27Updated last year
- Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentati…☆33Updated 4 months ago
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆33Updated 8 months ago
- ☆20Updated 2 months ago
- Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)☆39Updated last month
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆88Updated last year
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆68Updated last year
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Updated 2 years ago
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆66Updated 11 months ago
- ☆88Updated 2 weeks ago
- Unifying 2D and 3D Vision-Language Understanding☆82Updated last month
- FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024☆20Updated 5 months ago
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆38Updated 2 years ago
- Official PyTorch implementation of the paper "Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs"☆65Updated this week
- N-EPIC-Kitchens: The event-based camera extension of the large-scale EPIC-Kitchens dataset.☆22Updated 3 years ago
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆75Updated 10 months ago
- Are Synthetic Data Useful for Egocentric Hand-Object Interaction Detection? [ECCV, 2024]☆12Updated 4 months ago
- [ECCV'24] 3D Reconstruction of Objects in Hands without Real World 3D Supervision☆11Updated 4 months ago
- Official repository for the paper "SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation."☆50Updated last week
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆57Updated 8 months ago
- ☆12Updated last month
- ☆11Updated 6 months ago
- This is the project for 'USG'.☆16Updated 2 months ago
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆34Updated 3 months ago
- Official repostory of the paper: Masked Scene Modeling (CVPR 2025)☆14Updated last month
- Improving Semantic Correspondences with Viewpoint-Guided Spherical Maps (CVPR 2024)☆20Updated 6 months ago
- 🔍 Explore Egocentric Vision: research, data, challenges, real-world apps. Stay updated & contribute to our dynamic repository! Work-in-p…☆110Updated 6 months ago
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆41Updated this week
- [NeurIPS 2022] Segmenting Moving Objects via an Object-Centric Representation. Junyu Xie, Weidi Xie, Andrew Zisserman.☆32Updated last year