Jiahao004 / DeepTheoremLinks
☆24Updated 6 months ago
Alternatives and similar repositories for DeepTheorem
Users that are interested in DeepTheorem are comparing it to the libraries listed below
Sorting:
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆30Updated 2 years ago
- Exploration of automated dataset selection approaches at large scales.☆53Updated 10 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Updated last year
- Codebase for Instruction Following without Instruction Tuning☆36Updated last year
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Updated 5 months ago
- ☆25Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆62Updated last year
- Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆48Updated 6 months ago
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆85Updated 7 months ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆19Updated 6 months ago
- ☆20Updated 9 months ago
- Directional Preference Alignment☆58Updated last year
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆56Updated 11 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆52Updated 7 months ago
- RL Scaling and Test-Time Scaling (ICML'25)☆112Updated 11 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆46Updated 8 months ago
- ☆30Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆41Updated last week
- ☆110Updated 8 months ago
- Replicating O1 inference-time scaling laws☆91Updated last year
- ☆15Updated last year
- ☆107Updated last year
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆145Updated 3 months ago
- ☆71Updated last year
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆70Updated 10 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆120Updated 8 months ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆23Updated 8 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Updated last year
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆130Updated 2 months ago