oripress / AlgoTuneLinks
AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and computer science problems. The goal is write code that solves each problem, and is faster than existing implementations.
☆66Updated this week
Alternatives and similar repositories for AlgoTune
Users that are interested in AlgoTune are comparing it to the libraries listed below
Sorting:
- ☆33Updated 10 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆84Updated last year
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆53Updated 3 weeks ago
- ☆51Updated 7 months ago
- RLP: Reinforcement as a Pretraining Objective☆198Updated last month
- ☆32Updated 7 months ago
- Reinforcing General Reasoning without Verifiers☆91Updated 4 months ago
- ☆31Updated 5 months ago
- ☆19Updated 7 months ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆239Updated last week
- Open source interpretability artefacts for R1.☆163Updated 6 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆111Updated last month
- ☆154Updated 2 months ago
- A MAD laboratory to improve AI architecture designs 🧪☆132Updated 10 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆122Updated 7 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆23Updated 8 months ago
- 📄Small Batch Size Training for Language Models☆63Updated last month
- The official github repo for "Diffusion Language Models are Super Data Learners".☆145Updated this week
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆56Updated 3 months ago
- Simple repository for training small reasoning models☆44Updated 9 months ago
- ☆86Updated last year
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆188Updated 8 months ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆103Updated last week
- [ICML2025 Oral] LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models☆81Updated 3 months ago
- nanoGPT-like codebase for LLM training☆110Updated this week
- Fluid Language Model Benchmarking☆19Updated last month
- ☆38Updated last year
- EvaByte: Efficient Byte-level Language Models at Scale☆110Updated 6 months ago
- AIRA-dojo: a framework for developing and evaluating AI research agents☆106Updated last month
- moodist☆22Updated last month