openai / openai-icpc-2025Links
OpenAI 2025 ICPC Submissions
☆55Updated last month
Alternatives and similar repositories for openai-icpc-2025
Users that are interested in openai-icpc-2025 are comparing it to the libraries listed below
Sorting:
- ☆146Updated 3 weeks ago
- ☆302Updated last month
- Technical report of Kimina-Prover Preview.☆340Updated 4 months ago
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆29Updated 2 months ago
- ☆477Updated 3 months ago
- ☆211Updated 7 months ago
- Evaluation of LLMs on latest math competitions☆178Updated 3 weeks ago
- ☆72Updated 3 months ago
- LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management☆71Updated 9 months ago
- ☆122Updated 2 months ago
- ☆44Updated 3 months ago
- LeanEuclid is a benchmark for autoformalization in the domain of Euclidean geometry, targeting the proof assistant Lean.☆112Updated 6 months ago
- ☆37Updated 2 months ago
- ☆42Updated last year
- Retrieval-Augmented Theorem Provers for Lean☆299Updated 9 months ago
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆442Updated this week
- Solving Inequality Proofs with Large Language Models.☆52Updated last week
- [NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!☆131Updated last week
- SWE Arena☆35Updated 4 months ago
- [COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆132Updated 2 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆111Updated last month
- [NeurIPS 2025 Spotlight] ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆131Updated last month
- BigOBench assesses the capacity of Large Language Models (LLMs) to comprehend time-space computational complexity of input or generated c…☆38Updated 6 months ago
- An evaluation benchmark for undergraduate competition math in Lean4, Isabelle, Coq, and natural language.☆168Updated 2 weeks ago
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Updated 6 months ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆239Updated last week
- LLMs + Lean, on your laptop or in the cloud☆192Updated 3 weeks ago
- This is the official repository for all the code of TheoremLlama☆46Updated 3 months ago
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆53Updated 3 weeks ago
- [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.☆56Updated 3 months ago