project-numina / aimo-progress-prizeLinks
☆439Updated 10 months ago
Alternatives and similar repositories for aimo-progress-prize
Users that are interested in aimo-progress-prize are comparing it to the libraries listed below
Sorting:
- ☆562Updated last month
- Technical report of Kimina-Prover Preview.☆285Updated 3 weeks ago
- ☆744Updated last month
- A project to improve skills of large language models☆415Updated this week
- [MathCoder, MathCoder-VL] Family of LLMs/LMMs for mathematical reasoning.☆289Updated 2 weeks ago
- ☆522Updated 9 months ago
- SkyRL-v0: Train Real-World Long-Horizon Agents via Reinforcement Learning☆375Updated this week
- [NeurlPS D&B 2024] Generative AI for Math: MathPile☆412Updated 2 months ago
- RewardBench: the first evaluation tool for reward models.☆590Updated this week
- ☆1,024Updated 5 months ago
- (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training☆272Updated last year
- ☆330Updated 3 months ago
- A bibliography and survey of the papers surrounding o1☆1,194Updated 6 months ago
- Large Reasoning Models☆803Updated 6 months ago
- Retrieval-Augmented Theorem Provers for Lean☆272Updated 4 months ago
- ☆180Updated 2 months ago
- Tina: Tiny Reasoning Models via LoRA☆252Updated last week
- Understanding R1-Zero-Like Training: A Critical Perspective☆973Updated 2 weeks ago
- ☆75Updated 10 months ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆232Updated 3 weeks ago
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆535Updated 2 months ago
- ☆153Updated last year
- Recipes to scale inference-time compute of open models☆1,090Updated 2 weeks ago
- Automatic evals for LLMs☆407Updated this week
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆220Updated last year
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆464Updated last week
- ☆295Updated this week
- procedural reasoning datasets☆770Updated this week
- ☆936Updated 4 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆333Updated 5 months ago