aiming-lab/ReAgent-V

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aiming-lab/ReAgent-V)

aiming-lab / ReAgent-V

[NeurIPS'25] ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding

☆46

Alternatives and similar repositories for ReAgent-V

Users that are interested in ReAgent-V are comparing it to the libraries listed below

Sorting:

64327069 / LVAgent
View on GitHub
Code of LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents
☆28Nov 24, 2025Updated 3 months ago
fansunqi / AKeyS
View on GitHub
Agentic Keyframe Search for Video Question Answering
☆16Apr 7, 2025Updated 10 months ago
aiming-lab / MIRA
View on GitHub
When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought
☆27Feb 14, 2026Updated 2 weeks ago
showlab / PANDA
View on GitHub
[NeurIPS 2025] PANDA: Towards Generalist Video Anomaly Detection via Agentic AI Engineer
☆28Oct 2, 2025Updated 5 months ago
SaraGhazanfari / EMMA
View on GitHub
EMMA [TMLR 2025]
☆12Sep 25, 2025Updated 5 months ago
aiming-lab / EduVisAgent
View on GitHub
[ICLR'26] EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization
☆28Aug 5, 2025Updated 6 months ago
tingyu215 / TS-LLaVA
View on GitHub
TS-LLaVA: Constructing Visual Tokens through Thumbnail-and-Sampling for Training-Free Video Large Language Models
☆19Jan 2, 2025Updated last year
Tencent / SelfEvolvingAgent
View on GitHub
Research works from Tencent AI Lab regarding self-evolving agents
☆82Jan 30, 2026Updated last month
mlvlab / DeepVideoR1
View on GitHub
[NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"
☆31Feb 22, 2026Updated last week
Hungryyan1 / UniCorn
View on GitHub
☆48Jan 13, 2026Updated last month
zhangce01 / DeGF
View on GitHub
[ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models
☆24Apr 14, 2025Updated 10 months ago
UCSC-VLAA / Sight-Beyond-Text
View on GitHub
[TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"
☆20Sep 15, 2023Updated 2 years ago
chenxi52 / CMPF
View on GitHub
Open-Vocabulary Panoptic Segmentation
☆27Jun 15, 2025Updated 8 months ago
ZhaoJingjing713 / HPR
View on GitHub
[CVPR 2024] Hybrid Proposal Refiner: Revisiting DETR Series from the Faster R-CNN Perspective
☆21Aug 18, 2024Updated last year
YiyangZhou / POVID
View on GitHub
[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning
☆91Apr 30, 2024Updated last year
AnilOsmanTur / conditioned_video_anomaly_diffusion
View on GitHub
☆23Sep 5, 2023Updated 2 years ago
mlvlab / VidChain
View on GitHub
Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…
☆23Jan 26, 2025Updated last year
BillChan226 / HALC
View on GitHub
[ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"
☆109Dec 4, 2024Updated last year
zhishuifeiqian / VCR-Bench
View on GitHub
VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning
☆35Jul 15, 2025Updated 7 months ago
Lzq5 / UniTime
View on GitHub
Universal Video Temporal Grounding with Generative Multi-modal Large Language Models
☆46Nov 25, 2025Updated 3 months ago
marinero4972 / CyberV
View on GitHub
☆18Jun 10, 2025Updated 8 months ago
Zhao-Jianing-SUDA / Hawkeye
View on GitHub
The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…
☆12Oct 14, 2024Updated last year
MoonshotAI / WorldVQA
View on GitHub
☆105Feb 4, 2026Updated 3 weeks ago
xuyang-liu16 / GlobalCom2
View on GitHub
[AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models
☆38Jan 27, 2026Updated last month
AIGeeksGroup / StereoAdapter
View on GitHub
[ICRA 2026] StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes
☆20Feb 17, 2026Updated last week
TencentARC / DSR_Suite
View on GitHub
☆65Feb 23, 2026Updated last week
gabfstr / DiffusionTrack
View on GitHub
Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking
☆13Apr 12, 2023Updated 2 years ago
MiniMax-AI / MiniMax-Provider-Verifier
View on GitHub
MiniMax-Provider-Verifier offers a rigorous, vendor-agnostic way to verify whether third-party deployments of the Minimax M2 model are co…
☆28Feb 18, 2026Updated last week
VoyageWang / VG-Refiner
View on GitHub
The repository of VG-Refiner paper
☆17Dec 9, 2025Updated 2 months ago
HumanMLLM / ViSpeak
View on GitHub
(ICCV2025) Official repository of paper "ViSpeak: Visual Instruction Feedback in Streaming Videos"
☆45Jul 1, 2025Updated 8 months ago
yaolinli / TimeChat-Online
View on GitHub
[ACM MM 2025] TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos
☆115Dec 12, 2025Updated 2 months ago
schowdhury671 / meerkat
View on GitHub
☆36Jul 9, 2025Updated 7 months ago
mlvlab / MELTR
View on GitHub
MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)
☆35Apr 23, 2024Updated last year
vllm-project / vllm-neuron
View on GitHub
Community maintained hardware plugin for vLLM on AWS Neuron
☆23Updated this week
senorfy / Kinect
View on GitHub
用Kinect2.0读取图像的深度等信息，分割出手部图像。用HOG提取手部图像信息，接着用SVM进行训练。目的是为了识别手势。
☆10Jan 8, 2020Updated 6 years ago
mbzuai-oryx / TrackingMeetsLMM
View on GitHub
☆10Apr 7, 2025Updated 10 months ago
josephzpng / DisTime
View on GitHub
DisTime: Distribution-based Time Representation for Video Large Language Models.
☆18Jul 10, 2025Updated 7 months ago
snumprlab / hima
View on GitHub
Official Implementation of HIMA (COLM'25)
☆19Nov 25, 2025Updated 3 months ago
aabdelfattah / alhaitham-hardware
View on GitHub
Gesture Recognition Based on ALTERA DE2-115 FPGA
☆10Mar 18, 2014Updated 11 years ago