lyang36 / IMO25Links
An AI agent system for solving International Mathematical Olympiad (IMO) problems using Google's Gemini, OpenAI, and XAI APIs.
☆756Updated last month
Alternatives and similar repositories for IMO25
Users that are interested in IMO25 are comparing it to the libraries listed below
Sorting:
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆528Updated 2 months ago
- ☆476Updated 2 months ago
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆596Updated 6 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆548Updated 4 months ago
- Releases from OpenAI Preparedness☆860Updated 3 weeks ago
- ☆1,233Updated last week
- Technical report of Kimina-Prover Preview.☆327Updated 2 months ago
- ☆803Updated 3 weeks ago
- Atom of Thoughts for Markov LLM Test-Time Scaling☆586Updated 3 months ago
- ☆428Updated 3 weeks ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆549Updated 3 months ago
- Prompt-to-Leaderboard☆254Updated 4 months ago
- Repository for Zochi's Research☆267Updated 3 weeks ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆340Updated 2 months ago
- Code for the paper: "Learning to Reason without External Rewards"☆354Updated 2 months ago
- ☆293Updated this week
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆542Updated last month
- ☆466Updated last year
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,085Updated 3 weeks ago
- Testing baseline LLMs performance across various models☆309Updated last month
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆689Updated last month
- Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving☆358Updated last month
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆443Updated 4 months ago
- Tina: Tiny Reasoning Models via LoRA☆282Updated last month
- Scaling Data for SWE-agents☆403Updated this week
- SkyRL: A Modular Full-stack RL Library for LLMs☆862Updated last week
- ☆226Updated 3 months ago
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆252Updated 3 months ago
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆342Updated last week
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆557Updated last month