[CVPR'24 Highlight] The official code and data for paper "EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Language Models"
☆64Mar 25, 2025Updated last year
Alternatives and similar repositories for EgoThink
Users that are interested in EgoThink are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV'24] 3D Reconstruction of Objects in Hands without Real World 3D Supervision☆17Feb 3, 2025Updated last year
- Pytorch implementation for Egoinstructor at CVPR 2024☆28Dec 1, 2024Updated last year
- Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos☆30Oct 27, 2025Updated 7 months ago
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆41Apr 11, 2025Updated last year
- ☆37Sep 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆44Mar 11, 2025Updated last year
- Egocentric Video Understanding Dataset (EVUD)☆34Jul 4, 2024Updated last year
- [CVPR 2024 Champions][ICLR 2025] Solutions for EgoVis Chanllenges in CVPR 2024☆136May 11, 2025Updated last year
- [ECCV2024] The official implementation of "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation".☆15Feb 24, 2025Updated last year
- ☆13Apr 13, 2026Updated 2 months ago
- ☆21May 29, 2023Updated 3 years ago
- Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)☆58Apr 15, 2024Updated 2 years ago
- [IJCV] EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning☆85Dec 6, 2024Updated last year
- CVPR 2024 "Instance Tracking in 3D Scenes from Egocentric Videos"☆19Jun 27, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- (ICCV2025) Official repository of paper "ViSpeak: Visual Instruction Feedback in Streaming Videos"☆52Jul 1, 2025Updated 11 months ago
- Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2…☆24Jun 13, 2024Updated 2 years ago
- [NeurIPS 2022] Egocentric Video-Language Pretraining☆260May 9, 2024Updated 2 years ago
- The official code and data for paper "VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI"☆18Mar 25, 2025Updated last year
- PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"☆13Feb 5, 2024Updated 2 years ago
- [ICCV 2025 ✨ / IJCV 2023] Official code repository to download the VISTA and TREK-150 benchmark datasets and run experiments on them.☆12Aug 12, 2025Updated 10 months ago
- Official repo for "Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge" ICLR2025☆111Mar 14, 2025Updated last year
- ☆10Jul 5, 2024Updated last year
- Understanding Self-Supervised Learning in a non-IID Setting☆21Oct 21, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [NeurIPS 2025] Continual Multimodal Contrastive Learning☆28Dec 18, 2025Updated 5 months ago
- ☆13Jul 20, 2024Updated last year
- ☆28Mar 20, 2023Updated 3 years ago
- [EMNLP 2024 Findings] Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information