General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]
☆224Nov 27, 2025Updated 4 months ago
Alternatives and similar repositories for General-Reasoner
Users that are interested in General-Reasoner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆48Mar 31, 2026Updated last week
- A series of technical report on Slow Thinking with LLM☆764Aug 13, 2025Updated 7 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Feb 4, 2026Updated 2 months ago
- ☆334May 31, 2025Updated 10 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆29Mar 1, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 6 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆225Jun 24, 2025Updated 9 months ago
- Official Repo for Open-Reasoner-Zero☆2,089Jun 2, 2025Updated 10 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆147Nov 13, 2025Updated 4 months ago
- [TMLR] Process Reward Models That Think☆84Nov 29, 2025Updated 4 months ago
- Scaling RL on advanced reasoning models☆677Oct 20, 2025Updated 5 months ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆100Apr 9, 2025Updated last year
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75May 20, 2025Updated 10 months ago
- Async pipelined version of Verl☆124Apr 8, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆291Sep 25, 2025Updated 6 months ago
- Technical report of Kimina-Prover Preview.☆366Jul 10, 2025Updated 9 months ago
- A version of verl to support diverse tool use☆947Mar 2, 2026Updated last month
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆73Feb 25, 2025Updated last year
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆65Jan 26, 2026Updated 2 months ago
- Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient☆66Aug 3, 2025Updated 8 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆187Jun 5, 2025Updated 10 months ago
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 8 months ago
- [NeurIPS 2025] RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning☆53Oct 23, 2025Updated 5 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆16Sep 4, 2025Updated 7 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,241Aug 27, 2025Updated 7 months ago
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"☆22Nov 9, 2025Updated 5 months ago
- instruction-following benchmark for large reasoning models☆44Aug 9, 2025Updated 8 months ago
- The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search" [EMNLP25]☆39Feb 1, 2026Updated 2 months ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆21Jan 8, 2025Updated last year
- ☆17Aug 1, 2025Updated 8 months ago
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆744Jun 6, 2025Updated 10 months ago
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆95Nov 8, 2025Updated 5 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆197Sep 13, 2025Updated 6 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,422Nov 13, 2025Updated 4 months ago
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆163Sep 19, 2025Updated 6 months ago
- ☆46Jun 24, 2025Updated 9 months ago
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆27Oct 14, 2025Updated 5 months ago
- [ICLR2026] Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping☆63May 22, 2025Updated 10 months ago
- Scalable RL solution for advanced reasoning of language models☆1,841Mar 18, 2025Updated last year