GraphPKU / number_cookbookLinks
Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.
☆17Updated 5 months ago
Alternatives and similar repositories for number_cookbook
Users that are interested in number_cookbook are comparing it to the libraries listed below
Sorting:
- A Sober Look at Language Model Reasoning☆81Updated 2 months ago
- ☆328Updated last month
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆69Updated 4 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆77Updated 5 months ago
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆167Updated 2 months ago
- Repo of paper "Free Process Rewards without Process Labels"☆162Updated 5 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆86Updated 6 months ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆140Updated last week
- Reinforcing General Reasoning without Verifiers☆80Updated 2 months ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆101Updated 3 weeks ago
- Official implementation of the paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"☆214Updated 2 weeks ago
- Code for "Reasoning to Learn from Latent Thoughts"☆116Updated 5 months ago
- The code implementation of Symbolic-MoE☆38Updated 5 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated 9 months ago
- ☆128Updated 2 weeks ago
- Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"☆105Updated this week
- official implementation of paper "Process Reward Model with Q-value Rankings"☆60Updated 6 months ago
- Discriminative Constrained Optimization for Reinforcing Large Reasoning Models☆36Updated last week
- [ICLR 2025] This is the official implementation for the paper: "Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluat…☆32Updated 2 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆95Updated 8 months ago
- ☆47Updated 6 months ago
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆29Updated last month
- A repo for open research on building large reasoning models☆92Updated last week
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆167Updated last month
- The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆163Updated last month
- JudgeLRM: Large Reasoning Models as a Judge☆35Updated 4 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆52Updated 3 months ago
- ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆109Updated last week
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆122Updated 10 months ago
- Official Repository of LatentSeek☆60Updated 2 months ago