AdaCheng / VidEgoThinkLinks

The official code and data for paper "VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI"

☆15

Alternatives and similar repositories for VidEgoThink

Users that are interested in VidEgoThink are comparing it to the libraries listed below

Sorting:

linkangheng / Video-UTR
[ICLR2025] Official code implementation of Video-UTR: Unhackable Temporal Rewarding for Scalable Video MLLMs
☆60Updated 8 months ago
The-Martyr / Awesome-Multimodal-Reasoning
Latest Advances on (RL based) Multimodal Reasoning and Generation in Multimodal Large Language Models
☆39Updated last week
www-Ye / Time-R1
R1-like Video-LLM for Temporal Grounding
☆121Updated 4 months ago
RupertLuo / VoCoT
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models
☆75Updated last year
Wang-Xiaodong1899 / Open-R1-Video
✨First Open-Source R1-like Video-LLM [2025/02/18]
☆369Updated 8 months ago
LightChen233 / M3CoT
☆84Updated last year
Hui-design / R1-Video-fixbug
[Blog 1] Recording a bug of grpo_trainer in some R1 projects
☆20Updated 8 months ago
HKUST-LongGroup / Awesome-MLLM-Benchmarks
☆143Updated 8 months ago
yaolinli / TimeChat-Online
[ACM MM 2025] TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos
☆86Updated last month
tanhuajie / Reason-RFT
[NeurIPS 2025]⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.
☆229Updated 3 weeks ago
appletea233 / Temporal-R1
Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency
☆55Updated 4 months ago
mat-agent / MAT-Agent
MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)
☆66Updated 4 months ago
Open-Reasoner-Zero / Open-Vision-Reasoner
The official repository for our paper, "Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning".
☆142Updated last month
RifleZhang / LLaVA-Hound-DPO
☆155Updated 11 months ago
Wild-Cooperation-Hub / Awesome-MLLM-Reasoning-Benchmarks
A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.
☆69Updated 7 months ago
njucckevin / MM-Self-Improve
A Self-Training Framework for Vision-Language Reasoning
☆84Updated 9 months ago
ChenYi99 / EgoPlan
[IJCV] EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning
☆74Updated 10 months ago
JoseponLee / IntentQA
Official repository for "IntentQA: Context-aware Video Intent Reasoning" from ICCV 2023.
☆21Updated 10 months ago
GuangyanS / Sys2-LLaVA
☆28Updated 8 months ago
Video-R1 / Awesome-Multimodal-Reasoning
Collections of Papers and Projects for Multimodal Reasoning.
☆105Updated 6 months ago
JoeLeelyf / OVO-Bench
[CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?
☆93Updated 3 months ago
Hui-design / Open-LLaVA-Video-R1
[LLaVA-Video-R1]✨First Adaptation of R1 to LLaVA-Video (2025-03-18)
☆33Updated 5 months ago
PKU-YuanGroup / Look-Back
This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".
☆64Updated 3 months ago
FanqingM / MM-Eureka-V0
MM-Eureka V0 also called R1-Multimodal-Journey, Latest version is in MM-Eureka
☆320Updated 4 months ago
PlusLabNLP / VISCO
[CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning
☆14Updated 4 months ago
DoubtedSteam / MM-GCoT
The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"
☆17Updated 3 months ago
z-x-yang / DoraemonGPT
Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
☆86Updated last year
MME-Benchmarks / MME-CoT
MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency
☆131Updated 2 months ago
mu-cai / TemporalBench
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
☆37Updated 11 months ago
ADaM-BJTU / Mind_with_eyes_Awesome_MLLMs_Reasoning
This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!
☆54Updated 7 months ago