zhishuifeiqian / VCR-BenchView external linksLinks
VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning
☆35Jul 15, 2025Updated 7 months ago
Alternatives and similar repositories for VCR-Bench
Users that are interested in VCR-Bench are comparing it to the libraries listed below
Sorting:
- [AAAI 2024] Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-Supervised 3D Object Detection☆10Jan 24, 2025Updated last year
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆21Feb 27, 2025Updated 11 months ago
- [ICLR 2025] PseDet: Revisiting the Power of Pseudo Label in Incremental Object Detection☆22Sep 16, 2025Updated 5 months ago
- ☆132Mar 22, 2025Updated 10 months ago
- [ICCV2023] DETRDistill: A Universal Knowledge Distillation Framework for DETR-families☆65Nov 3, 2023Updated 2 years ago
- A Light, Concise and Powerful Hexo's theme☆11Jul 15, 2022Updated 3 years ago
- ☆47Apr 9, 2025Updated 10 months ago
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs☆54Mar 9, 2025Updated 11 months ago
- 2nd place solution of ECCV 2020 workshop VIPriors Image Classification Challenge, https://arxiv.org/abs/2008.00261☆13Aug 22, 2021Updated 4 years ago
- ☆13Jul 10, 2024Updated last year
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated 3 weeks ago
- ☆12Jan 10, 2025Updated last year
- [NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"☆203Sep 26, 2024Updated last year
- ☆155Oct 31, 2024Updated last year
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- ☆32Jul 29, 2024Updated last year
- DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding☆66Jun 10, 2025Updated 8 months ago
- CS194-196 Course Project☆14Feb 20, 2025Updated 11 months ago
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆17Feb 13, 2025Updated last year
- Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"☆34Jul 12, 2024Updated last year
- Data and Code for CVPR 2025 paper "MMVU: Measuring Expert-Level Multi-Discipline Video Understanding"☆77Feb 28, 2025Updated 11 months ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Apr 14, 2025Updated 10 months ago
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆35Jul 1, 2024Updated last year
- The official pytorch implementation of Exploring the User Guidance for More Accurate Building Segmentation from High-Resolution Remote Se…☆18May 27, 2024Updated last year
- [ICLR 2024] ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation☆71Apr 25, 2024Updated last year
- The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"☆34Jun 12, 2025Updated 8 months ago
- J-BHI 2024: Exploiting Hierarchical Interactions for Protein Surface Learning☆17Jan 21, 2024Updated 2 years ago
- [ICCV2023] NoiseDet: Learning from Noisy Data for Semi-Superivsed 3D Object Detection☆21Feb 5, 2023Updated 3 years ago
- ☆16Jul 23, 2024Updated last year
- TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of u…☆25Jun 4, 2025Updated 8 months ago
- Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*☆31Dec 13, 2025Updated 2 months ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆56Nov 5, 2025Updated 3 months ago
- ☆48Jan 13, 2026Updated last month
- Code for the paper "Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers" [ICCV 2025]☆100Jul 28, 2025Updated 6 months ago
- [LLaVA-Video-R1]✨First Adaptation of R1 to LLaVA-Video (2025-03-18)☆68May 9, 2025Updated 9 months ago
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Nov 15, 2025Updated 3 months ago
- [ACM MM2022, TIP2024] Graph-DETR Series for Multi-View 3D Object Detection☆43Oct 11, 2023Updated 2 years ago
- ☆97Jun 23, 2025Updated 7 months ago
- LLMBind: A Unified Modality-Task Integration Framework☆19Jun 16, 2024Updated last year