Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2024.
☆24Jun 13, 2024Updated last year
Alternatives and similar repositories for EgoPack
Users that are interested in EgoPack are comparing it to the libraries listed below
Sorting:
- ☆21Apr 4, 2025Updated 11 months ago
- ☆13Jul 22, 2025Updated 7 months ago
- Official Repository for "Communication Efficient Federated Learning with Generalized Heavy-Ball Momentum", accepted at TMLR 2025☆27Jul 14, 2025Updated 7 months ago
- N-EPIC-Kitchens: The event-based camera extension of the large-scale EPIC-Kitchens dataset.☆23May 10, 2022Updated 3 years ago
- MaskPlanner is a deep learning model for the quick generation of multiple, long-horizon paths from free-form 3D objects represented as po…☆21Jun 20, 2025Updated 8 months ago
- Visual Relationship Reasoning for Grasp Planning☆18May 22, 2025Updated 9 months ago
- DROPO: Sim-to-Real Transfer with Offline Domain Randomization☆25Jul 8, 2025Updated 7 months ago
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆43Dec 7, 2024Updated last year
- Interface to stable-baselines3 APIs for training RL policies on gym-registered environments☆12Jan 24, 2024Updated 2 years ago
- Official implementation of https://arxiv.org/abs/2106.03496☆15Jul 27, 2022Updated 3 years ago
- Domain Randomization via Entropy Maximization☆23Apr 18, 2024Updated last year
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆19Jul 10, 2025Updated 7 months ago
- [ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?☆12Apr 11, 2025Updated 10 months ago
- [CHI24] AI-Assisted In-Context Writing on OHMD During Travels☆11Dec 19, 2024Updated last year
- List of papers wrote by Focoos AI research team!☆12Jun 3, 2025Updated 9 months ago
- Official Implementation for ACM MM2024 paper "VrdONE: One-stage Video Visual Relation Detection".☆11Nov 13, 2024Updated last year
- Collection of gym environments with support for domain randomization☆10Dec 11, 2024Updated last year
- [AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos☆33May 27, 2025Updated 9 months ago
- In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…☆15Jan 16, 2025Updated last year
- Code of the Grounded MUIE model, REAMO☆11Dec 3, 2024Updated last year
- ☆23Jun 12, 2025Updated 8 months ago
- ☆26Jun 20, 2024Updated last year
- Official code releasse for "The Invisible EgoHand: 3D Hand Forecasting through EgoBody Pose Estimation"☆31Aug 19, 2025Updated 6 months ago
- Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos☆28Oct 27, 2025Updated 4 months ago
- [CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentation☆130Mar 10, 2025Updated 11 months ago
- (NeurIPS 2023) Open-set visual object query search & localization in long-form videos☆26Feb 1, 2024Updated 2 years ago
- Code for the paper "A Sea of Words: An In-Depth Analysis of Anchors for Text Data", AISTATS 2023☆15Oct 26, 2024Updated last year
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆34Jun 12, 2023Updated 2 years ago
- 🚀 Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud ☁️ and edge 📱 deployment.…☆347Dec 11, 2025Updated 2 months ago
- ☆11Sep 24, 2021Updated 4 years ago
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- Official implementation of paper "Semantic Novelty Detection via Relational Reasoning"☆15Jul 10, 2023Updated 2 years ago
- Egocentric Video Understanding Dataset (EVUD)☆33Jul 4, 2024Updated last year
- ☆31Oct 27, 2022Updated 3 years ago
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆104Jul 2, 2024Updated last year
- [NeurIPS 2022] Egocentric Video-Language Pretraining☆256May 9, 2024Updated last year
- Official PyTorch code of GroundVQA (CVPR'24)☆64Sep 13, 2024Updated last year
- [ACL'25 Oral] Code for the paper "UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban…☆26Jul 15, 2025Updated 7 months ago
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆43Mar 11, 2025Updated 11 months ago