InternRobotics/EgoThinker

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/InternRobotics/EgoThinker)

InternRobotics / EgoThinker

Official implementation of EgoThinker at NIPS 2025

☆29

Alternatives and similar repositories for EgoThinker

Users that are interested in EgoThinker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ayiyayi / EgoExoBench
View on GitHub
☆15Nov 13, 2025Updated 8 months ago
SooLab / EyeWO
View on GitHub
[NeurIPS2025] The official PyTorch implementation of the "Eyes Wide Open: Ego Proactive Video-LLM for Streaming Video".
☆34Dec 25, 2025Updated 6 months ago
AV-Reasoner / AV-Reasoner
View on GitHub
☆19Jul 22, 2025Updated 11 months ago
IVUL-KAUST / VideoAuto-R1
View on GitHub
[CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice
☆88Feb 27, 2026Updated 4 months ago
OpenGVLab / EgoExoLearn
View on GitHub
[CVPR 2024] Data and benchmark code for the EgoExoLearn dataset
☆85Aug 26, 2025Updated 10 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
iSEE-Laboratory / EgoExo-Fitness
View on GitHub
(ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"
☆38Apr 8, 2025Updated last year
facebookresearch / ego4d-goalstep
View on GitHub
Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)
☆61Apr 15, 2024Updated 2 years ago
lntzm / HICom
View on GitHub
[CVPR2025] Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models
☆21Apr 30, 2025Updated last year
Buzz-Beater / EgoTaskQA
View on GitHub
Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.
☆44Apr 17, 2023Updated 3 years ago
doc-doc / EgoBlind
View on GitHub
EgoBlind: Towards Egocentric Visual Assistance for the Blind (NeurIPS'25, D&B Track)
☆23Apr 20, 2026Updated 3 months ago
zihuixue / AlignEgoExo
View on GitHub
Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Align…
☆19Apr 5, 2024Updated 2 years ago
InternRobotics / InternSR
View on GitHub
InternRobotics' open-source toolbox for vision-based embodied spatial intelligence.
☆49Sep 18, 2025Updated 10 months ago
Beckschen / spatialcode
View on GitHub
Open studio for "Thinking with Spatial Code" (https://arxiv.org/pdf/2603.05591)
☆20Mar 18, 2026Updated 4 months ago
HaroldChen19 / VistaDPO
View on GitHub
[ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models
☆41Jun 14, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
md-mohaiminul / BIMBA
View on GitHub
☆29Jul 25, 2025Updated 11 months ago
hmxiong / StreamChat
View on GitHub
Official repo for "Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge" ICLR2025
☆111Mar 14, 2025Updated last year
facebookresearch / egoman
View on GitHub
The repository provides code for EgoMAN model and dataset creation scripts.
☆32Dec 31, 2025Updated 6 months ago
egolife-ai / Ego-R1
View on GitHub
[TPAMI 2026] Ego-R1: Agentic Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning
☆165Jun 10, 2026Updated last month
wufeim / SpatialReasonerDataGen
View on GitHub
Synthetic VQA data generation code for SpatialReasoner.
☆20Nov 25, 2025Updated 7 months ago
multimodal-art-projection / IV-Bench
View on GitHub
☆14Apr 23, 2025Updated last year
EvolvingLMMs-Lab / EgoLife
View on GitHub
[CVPR 2025] EgoLife: Towards Egocentric Life Assistant
☆447Mar 19, 2025Updated last year
Richard-Zhang-AI / KVPO
View on GitHub
☆24Jun 6, 2026Updated last month
longmalongma / TW-GRPO
View on GitHub
The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"
☆36Jun 12, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Lzq5 / Video-Text-Alignment
View on GitHub
☆28Jul 18, 2025Updated last year
aleflabo / PREGO
View on GitHub
The official PyTorch implementation of the IEEE/CVF Computer Vision and Pattern Recognition (CVPR) '24 paper PREGO: online mistake detect…
☆34Jun 9, 2025Updated last year
Becomebright / ReKV
View on GitHub
[ICLR'25] Streaming Video Question-Answering with In-context Video KV-Cache Retrieval
☆121Nov 4, 2025Updated 8 months ago
aiming-lab / ReAgent-V
View on GitHub
[NeurIPS'25] ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding
☆51Sep 21, 2025Updated 9 months ago
XuWuLingYu / WristWorld
View on GitHub
The official code of paper WristWorld.
☆29Nov 8, 2025Updated 8 months ago
telepathylabsai / dialog_breakdown_detection
View on GitHub
☆10Nov 8, 2022Updated 3 years ago
air-embodied-brain / Em-Garde
View on GitHub
Implementation of Em_Garde: a proposal-retrieval framework for streaming video understanding
☆26Jun 24, 2026Updated 3 weeks ago
martian422 / MaskGRPO
View on GitHub
The official implementation of MaskGRPO: Consolidating Reinforcement Learning for Multimodal Discrete Diffusion Models. (ICLR 2026, arxiv…
☆19Jan 27, 2026Updated 5 months ago
PotassiumWings / BUAA-CO-2019
View on GitHub
☆11Jan 15, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mbzuai-oryx / Video-R2
View on GitHub
Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models
☆19Jan 21, 2026Updated 6 months ago
ayiyayi / Awesome-Egocentric-and-Exocentric-Vision
View on GitHub
☆40Nov 14, 2025Updated 8 months ago
mlvlab / DeepVideoR1
View on GitHub
[NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"
☆35Feb 22, 2026Updated 4 months ago
zjuchenlong / WSAG
View on GitHub
[EMNLP'22] Weakly-Supervised Temporal Article Grounding
☆14Nov 25, 2023Updated 2 years ago
CeeZh / SILVR
View on GitHub
Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"
☆19Jan 18, 2026Updated 6 months ago
AtsuMiyai / rethinking_rotation
View on GitHub
[WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…
☆12Feb 24, 2023Updated 3 years ago
f-ilic / SelectivePrivacyPreservation
View on GitHub
[CVPR 2024] Selective, Interpretable and Motion Consistent Privacy Attribute Obfuscation for Action Recognition
☆12Mar 20, 2024Updated 2 years ago