oripress / AlgoTuneLinks
AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and computer science problems. The goal is write code that solves each problem, and is faster than existing implementations.
☆63Updated last week
Alternatives and similar repositories for AlgoTune
Users that are interested in AlgoTune are comparing it to the libraries listed below
Sorting:
- ☆33Updated 9 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆84Updated 11 months ago
- RLP: Reinforcement as a Pretraining Objective☆182Updated 2 weeks ago
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆51Updated last week
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆101Updated last week
- Open source interpretability artefacts for R1.☆161Updated 5 months ago
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆55Updated 3 months ago
- Evaluation of LLMs on latest math competitions☆171Updated last month
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆186Updated 7 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆147Updated 2 weeks ago
- Open Source Replication of Anthropic's Alignment Faking Paper☆52Updated 6 months ago
- ☆193Updated 2 months ago
- ☆51Updated 7 months ago
- Fluid Language Model Benchmarking☆18Updated last month
- EvaByte: Efficient Byte-level Language Models at Scale☆109Updated 5 months ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆156Updated last month
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆187Updated last week
- 📄Small Batch Size Training for Language Models☆63Updated 2 weeks ago
- Simple repository for training small reasoning models☆40Updated 8 months ago
- The official github repo for "Diffusion Language Models are Super Data Learners".☆134Updated 2 weeks ago
- Reinforcing General Reasoning without Verifiers☆90Updated 3 months ago
- Official repo of paper LM2☆46Updated 8 months ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆101Updated last month
- Official Code Release for "Training a Generally Curious Agent"☆34Updated 5 months ago
- Physics of Language Models, Part 4☆248Updated 2 months ago
- ☆30Updated 5 months ago
- ☆142Updated last month
- ☆32Updated 6 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆77Updated 7 months ago
- AIRA-dojo: a framework for developing and evaluating AI research agents☆101Updated 3 weeks ago