aw31 / openai-imo-2025-proofsLinks
☆475Updated last month
Alternatives and similar repositories for openai-imo-2025-proofs
Users that are interested in openai-imo-2025-proofs are comparing it to the libraries listed below
Sorting:
- Testing baseline LLMs performance across various models☆305Updated 3 weeks ago
- Decentralized RL Training at Scale☆546Updated this week
- ☆224Updated 2 months ago
- ☆448Updated 3 months ago
- Open source interpretability artefacts for R1.☆157Updated 4 months ago
- open source interpretability platform 🧠☆375Updated this week
- ☆282Updated last month
- ☆465Updated last year
- rl from zero pretrain, can it be done? yes.☆261Updated 2 weeks ago
- Technical report of Kimina-Prover Preview.☆324Updated last month
- ☆180Updated 2 weeks ago
- Evaluation of LLMs on latest math competitions☆160Updated 3 weeks ago
- ☆198Updated 5 months ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆328Updated 9 months ago
- Long context evaluation for large language models☆219Updated 6 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆531Updated last month
- Our solution for the arc challenge 2024☆174Updated 2 months ago
- procedural reasoning datasets☆1,069Updated last week
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆651Updated last week
- ☆410Updated 2 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆290Updated 2 weeks ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆519Updated last month
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆549Updated 3 weeks ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆346Updated 8 months ago
- Scaling Data for SWE-agents☆386Updated last week
- ☆387Updated this week
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆251Updated last week
- Releases from OpenAI Preparedness☆854Updated last week
- ☆119Updated 8 months ago
- A collection of formalized statements of conjectures in Lean.☆607Updated this week