Episoode / Double-BenchLinks
☆18Updated 3 weeks ago
Alternatives and similar repositories for Double-Bench
Users that are interested in Double-Bench are comparing it to the libraries listed below
Sorting:
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆80Updated 6 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆136Updated last month
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆80Updated 2 weeks ago
- Doodling our way to AGI ✏️ 🖼️ 🧠☆94Updated 3 months ago
- EMPO, A Fully Unsupervised RLVR Method☆65Updated last week
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆99Updated 8 months ago
- MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision☆25Updated 3 months ago
- Survey on Data-centric Large Language Models☆84Updated last year
- More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆48Updated 3 months ago
- ☆29Updated 2 months ago
- ☆20Updated 3 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆48Updated 5 months ago
- (ArXiv25) Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning☆56Updated last month
- 关于LLM和Multimodal LLM的paper list☆43Updated last week
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆76Updated 8 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆68Updated 9 months ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆145Updated last month
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆84Updated 6 months ago
- 🔥 【Meta Awesome List】: AI/ML Research Hub - Solving the "Chasing Hot Topics" Problem for AI Researchers. 🤖 Agent-driven intelligence au…☆42Updated this week
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…☆121Updated last month
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)☆72Updated 3 months ago
- (ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training☆12Updated 6 months ago
- ☆163Updated 3 months ago
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆38Updated last month
- NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆85Updated 3 weeks ago
- Co-Reward: Self-supervised RL for LLM Reasoning via Contrastive Agreement☆27Updated 2 weeks ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆79Updated 2 months ago
- ☆50Updated last month
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆314Updated last month
- MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models☆31Updated 7 months ago