aw31 / openai-imo-2025-proofsLinks

☆478

Alternatives and similar repositories for openai-imo-2025-proofs

Users that are interested in openai-imo-2025-proofs are comparing it to the libraries listed below

Sorting:

arcprize / arc-agi-benchmarking
Testing baseline LLMs performance across various models
☆322Updated 2 weeks ago
google-deepmind / alphaevolve_results
☆241Updated 5 months ago
goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆163Updated 7 months ago
ByteDance-Seed / Seed-Prover
☆308Updated 2 months ago
arcprize / ARC-AGI-2
☆544Updated 6 months ago
MoonshotAI / Kimina-Prover-Preview
Technical report of Kimina-Prover Preview.
☆346Updated 4 months ago
eth-sri / matharena
Evaluation of LLMs on latest math competitions
☆193Updated last month
anthropic-experimental / agentic-misalignment
☆527Updated 5 months ago
math-inc / strongpnt
☆262Updated 2 months ago
da-fr / arc-prize-2024
Our solution for the arc challenge 2024
☆185Updated 5 months ago
SakanaAI / ShinkaEvolve
ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution
☆684Updated last week
PrimeIntellect-ai / prime-rl
Async RL Training at Scale
☆867Updated this week
google-deepmind / alphaevolve_repository_of_problems
☆150Updated last week
LeonGuertler / TextArena
A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning
☆316Updated last month
iliao2345 / CompressARC
☆201Updated 3 months ago
thinking-machines-lab / tinker
Training API and CLI
☆238Updated this week
jerber / lang-jepa
☆128Updated 11 months ago
project-numina / aimo-progress-prize
☆474Updated last year
open-thought / reasoning-gym
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
☆1,233Updated 2 weeks ago
Goedel-LM / Goedel-Prover
☆214Updated 7 months ago
lyang36 / IMO25
An AI agent system for solving International Mathematical Olympiad (IMO) problems using Google's Gemini, OpenAI, and XAI APIs.
☆867Updated last month
marin-community / marin
Open-source framework for the research and development of foundation models.
☆640Updated this week
hijohnnylin / neuronpedia
open source interpretability platform 🧠
☆509Updated this week
McGill-NLP / nano-aha-moment
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
☆562Updated last month
magicproduct / hash-hop
Long context evaluation for large language models
☆224Updated 8 months ago
thinking-machines-lab / batch_invariant_ops
☆912Updated 3 weeks ago
google-deepmind / formal-conjectures
A collection of formalized statements of conjectures in Lean.
☆689Updated this week
SWE-Gym / SWE-Gym
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
☆579Updated 4 months ago
ekinakyurek / marc
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
☆340Updated 2 weeks ago
SakanaAI / RLT
Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.
☆349Updated 5 months ago