For Ego4D VQ3D Task
☆22Jan 9, 2024Updated 2 years ago
Alternatives and similar repositories for EgoLoc
Users that are interested in EgoLoc are comparing it to the libraries listed below
Sorting:
- CVPR 2024 "Instance Tracking in 3D Scenes from Egocentric Videos"☆19Jun 27, 2024Updated last year
- Human-centric environment representations from egocentric video☆14Feb 5, 2026Updated last month
- [CHI24] AI-Assisted In-Context Writing on OHMD During Travels☆11Dec 19, 2024Updated last year
- ☆16Sep 25, 2025Updated 5 months ago
- Official PyTorch code of GroundVQA (CVPR'24)☆64Sep 13, 2024Updated last year
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆21Feb 27, 2025Updated last year
- FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024☆22Dec 9, 2024Updated last year
- ☆20Apr 14, 2023Updated 2 years ago
- Official repo for EscapeCraft (an 3D environment for room escape) and benchmark MM-Escape. This work is accepted by ICCV 2025.☆36Jul 7, 2025Updated 7 months ago
- [ICCV 2023] Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation☆24Aug 26, 2023Updated 2 years ago
- Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2…☆24Jun 13, 2024Updated last year
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆104Jul 2, 2024Updated last year
- EgoTV Egocentric Task Verification from Natural Language Task Descriptions☆27Jan 9, 2024Updated 2 years ago
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆35Sep 9, 2024Updated last year
- Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos☆28Oct 27, 2025Updated 4 months ago
- 📚 A collection of resources and papers on Large Language Models in autonomous driving☆27Oct 30, 2023Updated 2 years ago
- A Massive Multi-Discipline Lecture Understanding Benchmark☆33Nov 1, 2025Updated 4 months ago
- A curated list of egocentric (first-person) vision and related area resources☆310Oct 14, 2024Updated last year
- ☆152Aug 23, 2023Updated 2 years ago
- A curated list of resources about long-context in large-language models and video understanding.☆32Aug 8, 2023Updated 2 years ago
- Official PyTorch Implementation of BB Generator & pRoI Generator [WACV2020]☆30Mar 24, 2021Updated 4 years ago
- [IJCV] EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning☆81Dec 6, 2024Updated last year
- Egocentric Video Understanding Dataset (EVUD)☆33Jul 4, 2024Updated last year
- ☆41Sep 9, 2025Updated 5 months ago
- Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models☆37Sep 19, 2023Updated 2 years ago
- Official implementation of `Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning`, CVPR 2025☆13Aug 1, 2025Updated 7 months ago
- Tool to convert '.com' Gaussian files into files supported by 3D rendering programs, such as Blender, Maya, and others.☆13Jan 15, 2026Updated last month
- [NeurIPS 2022] Egocentric Video-Language Pretraining☆256May 9, 2024Updated last year
- Official implementation of Recurrent Action Transformer with Memory, an offline RL agent with memory mechanisms. https://sites.google.com…☆18Nov 23, 2025Updated 3 months ago
- ☆18Aug 7, 2025Updated 6 months ago
- ICCV'23 | Adverse Weather Removal with Codebook Priors☆10Aug 28, 2023Updated 2 years ago
- [ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?☆12Apr 11, 2025Updated 10 months ago
- Self-Supervised Learning with Multi-View Rendering for 3D Point Cloud Analysis (ACCV 2022)☆10Jul 22, 2024Updated last year
- ☆13Jul 22, 2022Updated 3 years ago
- [CVPR 2022] Sequential Voting with Relational Box Fields for Active Object Detection☆10Jun 19, 2022Updated 3 years ago
- Official code for "Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection Model"☆12Oct 29, 2022Updated 3 years ago
- 基于langchain和chatglm6b构建的智能问答系统,支持自定义语料☆10Jun 25, 2023Updated 2 years ago
- The official implementation of the paper SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder☆18Oct 19, 2025Updated 4 months ago
- A fast CUDA accelerated implementation for MVS evaluation.☆12Dec 1, 2022Updated 3 years ago