LaVi-Lab/Rethink_CoT_Video

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LaVi-Lab/Rethink_CoT_Video)

LaVi-Lab / Rethink_CoT_Video

Official code for "Rethinking Chain-of-Thought Reasoning for Videos"

☆21

Alternatives and similar repositories for Rethink_CoT_Video

Users that are interested in Rethink_CoT_Video are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PKU-VaLuE-Lab / m3eval
View on GitHub
Official code for M3Eval: Multi-Modal Memory Evaluation through Cognitively-Grounded Video Tasks
☆21Jun 4, 2026Updated last month
jhuang81 / weak-sup-visual-grounding
View on GitHub
The official implementation of CVPR 2021 Paper: Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation.
☆12Oct 15, 2021Updated 4 years ago
IVUL-KAUST / VideoAuto-R1
View on GitHub
[CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice
☆88Feb 27, 2026Updated 5 months ago
mbzuai-oryx / Video-CoM
View on GitHub
Video-CoM: Interactive Video Reasoning via Chain of Manipulations
☆22Jun 17, 2026Updated last month
abrarmajeedi / rica2_aqa
View on GitHub
Code release for RICA^2: Rubric-Informed, Calibrated Assessment of Actions (ECCV 2024)
☆15Nov 9, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ShaoqLin / DiscoSG
View on GitHub
[EMNLP 2025 Outstanding Paper Award] Official repo for DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph …
☆22Nov 16, 2025Updated 8 months ago
OpenGVLab / VKnowU
View on GitHub
[ECCV 2026] VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs
☆16Feb 3, 2026Updated 5 months ago
LJungang / Awesome-Video-Reasoning-Landscape
View on GitHub
🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.
☆190Jun 14, 2026Updated last month
Yui010206 / Adaptive-Visual-Imagination-Control
View on GitHub
When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning
☆18Jun 2, 2026Updated last month
hy0Y / ST-GT
View on GitHub
[CVPR 2024] Official repository of ST_GT
☆10Sep 15, 2024Updated last year
LaVi-Lab / Visual-Table
View on GitHub
[EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"
☆20Oct 17, 2024Updated last year
bethgelab / supersanity
View on GitHub
A critical analysis of the Cambrian-S model and VSI-Super benchmarks
☆16Nov 20, 2025Updated 8 months ago
mbzuai-oryx / Video-R2
View on GitHub
Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models
☆19Jan 21, 2026Updated 6 months ago
dragonlzm / PAVE
View on GitHub
This repo holds the implementation of PAVE: Patching and Adapting Video Large Language Models (CVPR2025)
☆27Sep 6, 2025Updated 10 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
daniel-cores / tvbench
View on GitHub
TVBench: Redesigning Video-Language Evaluation
☆15Jun 9, 2025Updated last year
hwanyu112 / Latent-Sketchpad
View on GitHub
☆73Feb 1, 2026Updated 5 months ago
LaVi-Lab / EgoMask
View on GitHub
[ICCV 2025] "Fine-grained Spatiotemporal Grounding on Egocentric Videos"
☆27Jul 3, 2026Updated 3 weeks ago
byminji / map-the-flow
View on GitHub
[ICLR 2026] Official implementation of the paper "Map the Flow: Revealing Hidden Pathways of Information in VideoLLMs"
☆25Mar 3, 2026Updated 4 months ago
Accio-Lab / SwimBird
View on GitHub
☆18Apr 9, 2026Updated 3 months ago
longmalongma / TW-GRPO
View on GitHub
The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"
☆36Jun 12, 2025Updated last year
FrederikWarburg / bayesian-metric-learning
View on GitHub
☆20Dec 13, 2023Updated 2 years ago
sail-sg / Video-Next-Event-Prediction
View on GitHub
☆28Aug 9, 2025Updated 11 months ago
HumanMLLM / IRG-MotionLLM
View on GitHub
(ECCV2026) Official repository of paper "IRG-MotionLLM: Interleaving Motion Generation, Assessment and Refinement for Text-to-Motion Gene…
☆30Jul 1, 2026Updated 3 weeks ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
lcqysl / VideoSSR
View on GitHub
[CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"
☆41Nov 11, 2025Updated 8 months ago
KYRIE-LI11 / VideoMark
View on GitHub
☆23Aug 23, 2025Updated 11 months ago
LuckyyySTA / GOLF
View on GitHub
☆18Mar 16, 2026Updated 4 months ago
CVC2233 / AndroTMem
View on GitHub
AndroTMem: From Interaction Trajectories to Anchored Memory in Long-Horizon GUI Agents
☆25Jul 5, 2026Updated 3 weeks ago
fansunqi / VideoTool
View on GitHub
Official Repository for NeurIPS'25 Paper "Tool-Augmented Spatiotemporal Reasoning for Streamlining Video Question Answering Task"
☆23May 18, 2026Updated 2 months ago
facebookresearch / ProcedureVRL
View on GitHub
[CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"
☆56Aug 8, 2023Updated 2 years ago
dingyue772 / OmniSIFT
View on GitHub
[ICML2026] OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models
☆26May 21, 2026Updated 2 months ago
CYWang735 / AdaTooler-V
View on GitHub
☆72Feb 27, 2026Updated 5 months ago
JaaackHongggg / WorldSense
View on GitHub
WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs
☆50Jul 12, 2026Updated 2 weeks ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
NoSyu / VHUCM
View on GitHub
Implementation of Variational Hierarchical User-based Conversation Model
☆10Jul 2, 2021Updated 5 years ago
superxjm / HybridTransparentRecon
View on GitHub
☆13May 26, 2023Updated 3 years ago
Video-Reason / Awesome-Video-Reasoning
View on GitHub
This is a collection of recent papers on reasoning in video generation models.
☆165Jul 21, 2026Updated last week
SaraGhazanfari / CoF
View on GitHub
Chain-of-Frames [CVPR 2026]
☆40Jul 2, 2025Updated last year
worldbench / VideoLucy
View on GitHub
[NeurIPS 2025] Deep Memory Backtracking for Long Video Understanding
☆68Feb 10, 2026Updated 5 months ago
StarsThu2016 / ApproxDet
View on GitHub
☆12Nov 16, 2020Updated 5 years ago
jylins / videoseek
View on GitHub
[CVPR 2026] VideoSeek: Long-Horizon Video Agent with Tool-Guided Seeking
☆64Mar 23, 2026Updated 4 months ago