General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]
☆228Nov 27, 2025Updated 7 months ago
Alternatives and similar repositories for General-Reasoner
Users that are interested in General-Reasoner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆51Mar 31, 2026Updated 3 months ago
- A series of technical report on Slow Thinking with LLM☆766Aug 13, 2025Updated 10 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Feb 4, 2026Updated 4 months ago
- ☆336May 31, 2025Updated last year
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆32Mar 1, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]☆17Jun 1, 2026Updated last month
- Official Repo for Open-Reasoner-Zero☆2,095Jun 2, 2025Updated last year
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆230Jun 20, 2026Updated last week
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆148Nov 13, 2025Updated 7 months ago
- Scaling RL on advanced reasoning models☆688Oct 20, 2025Updated 8 months ago
- [TMLR] Process Reward Models That Think☆89Nov 29, 2025Updated 7 months ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆101Apr 9, 2025Updated last year
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75May 20, 2025Updated last year
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆293Sep 25, 2025Updated 9 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Async pipelined version of Verl☆124Apr 8, 2025Updated last year
- Technical report of Kimina-Prover Preview.☆371Jul 10, 2025Updated 11 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆74Feb 25, 2025Updated last year
- A version of verl to support diverse tool use [TMLR 2026]☆1,008Jun 8, 2026Updated 3 weeks ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆68Jan 26, 2026Updated 5 months ago
- Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient☆67Aug 3, 2025Updated 10 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆190Jun 5, 2025Updated last year
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 11 months ago
- [NeurIPS 2025] RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning☆55Oct 23, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆5,017Nov 13, 2025Updated 7 months ago
- ☆16Sep 4, 2025Updated 9 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,262Aug 27, 2025Updated 10 months ago
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"☆24May 6, 2026Updated last month
- instruction-following benchmark for large reasoning models☆48Apr 19, 2026Updated 2 months ago
- The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search" [EMNLP25]☆40Feb 1, 2026Updated 5 months ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆22Jan 8, 2025Updated last year
- ☆17Aug 1, 2025Updated 11 months ago
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆745Jun 6, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆100Nov 8, 2025Updated 7 months ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆204Sep 13, 2025Updated 9 months ago
- Simple RL training for reasoning☆3,871Dec 23, 2025Updated 6 months ago
- Democratizing Reinforcement Learning for LLMs☆5,649Updated this week
- ☆47Jun 24, 2025Updated last year
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆29Oct 14, 2025Updated 8 months ago
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆168Sep 19, 2025Updated 9 months ago