Episoode / Double-BenchLinks
Official Code Repository for Double-Bench
☆22Updated 2 weeks ago
Alternatives and similar repositories for Double-Bench
Users that are interested in Double-Bench are comparing it to the libraries listed below
Sorting:
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 8 months ago
- ☆50Updated 2 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆47Updated 2 weeks ago
- [Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics]: VisuoThink: Empowering LVLM Reasoning with Mul…☆30Updated 2 months ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆36Updated 3 months ago
- [NeurIPS25 Spotlight] EMPO, A Fully Unsupervised RLVR Method☆68Updated this week
- SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis☆67Updated 2 months ago
- Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning…☆40Updated 2 months ago
- Geometric-Mean Policy Optimization☆84Updated last week
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Updated 5 months ago
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)☆76Updated 3 weeks ago
- JudgeLRM: Large Reasoning Models as a Judge☆39Updated last month
- [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…☆42Updated 3 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated 2 months ago
- A Self-Training Framework for Vision-Language Reasoning☆86Updated 8 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆140Updated 3 months ago
- ☆17Updated 9 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆58Updated 7 months ago
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆40Updated 5 months ago
- ZeroGUI: Automating Online GUI Learning at Zero Human Cost☆92Updated 3 months ago
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆37Updated 3 months ago
- ☆38Updated 2 months ago
- ☆31Updated 3 months ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆59Updated last year
- ☆36Updated last week
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆157Updated 4 months ago
- ☆26Updated 3 weeks ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆65Updated 4 months ago
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆43Updated last month
- "what, how, where, and how well? a survey on test-time scaling in large language models" repository☆71Updated last week