DripNowhy / SherlockLinks
Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"
☆22Updated 3 months ago
Alternatives and similar repositories for Sherlock
Users that are interested in Sherlock are comparing it to the libraries listed below
Sorting:
- ☆19Updated 3 months ago
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆18Updated 3 months ago
- Fast-Slow Thinking for Large Vision-Language Model Reasoning☆17Updated 4 months ago
- Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs☆40Updated 2 months ago
- ☆214Updated 2 weeks ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 7 months ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆79Updated last year
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆40Updated 2 weeks ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆36Updated 2 months ago
- NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆85Updated 2 weeks ago
- [Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics]: VisuoThink: Empowering LVLM Reasoning with Mul…☆29Updated last month
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)☆72Updated 2 months ago
- ☆18Updated 8 months ago
- MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision☆25Updated 3 months ago
- ☆45Updated 8 months ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆47Updated last month
- Official Repository of LatentSeek☆60Updated 2 months ago
- ☆43Updated 9 months ago
- A novel alignment framework that leverages image retrieval to mitigate hallucinations in Vision Language Models.☆45Updated last week
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆25Updated 3 months ago
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward☆74Updated 3 weeks ago
- [ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models☆33Updated last month
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"☆142Updated 2 months ago
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆33Updated 3 months ago
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆34Updated 2 months ago
- [ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models☆85Updated 11 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆70Updated last year
- ☆29Updated 2 months ago
- Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"☆81Updated last week
- Official implementation of "Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology"☆60Updated last month