GraphPKU / number_cookbookLinks
Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.
☆17Updated 3 months ago
Alternatives and similar repositories for number_cookbook
Users that are interested in number_cookbook are comparing it to the libraries listed below
Sorting:
- ☆47Updated 5 months ago
- ☆318Updated last month
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆74Updated 3 months ago
- Reinforcing General Reasoning without Verifiers☆71Updated 3 weeks ago
- A Sober Look at Language Model Reasoning☆77Updated last month
- ☆122Updated last month
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆85Updated 4 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆112Updated 3 months ago
- ☆113Updated 4 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆130Updated this week
- [ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"☆30Updated 2 weeks ago
- Process Reward Models That Think☆45Updated last week
- A repo for open research on building large reasoning models☆71Updated this week
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆64Updated last month
- Official implementation of MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems☆43Updated 3 weeks ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆67Updated 2 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆46Updated 2 months ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆103Updated last week
- [ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆144Updated this week
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆49Updated 8 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆137Updated last week
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆85Updated 6 months ago
- ☆30Updated 3 months ago
- The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆152Updated last week
- ☆33Updated 6 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆101Updated last month
- ☆210Updated 4 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆45Updated last month
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆98Updated 4 months ago
- ☆26Updated 3 months ago