oripress / AlgoTuneLinks
AlgoTune is a benchmark made up of 155 math, physics, and computer science problems. The goal is write code that solves each problem, and is faster than existing implementations.
☆28Updated this week
Alternatives and similar repositories for AlgoTune
Users that are interested in AlgoTune are comparing it to the libraries listed below
Sorting:
- Efficient Scaling laws and collaborative pretraining.☆16Updated 5 months ago
- ☆34Updated 3 weeks ago
- Official Code Release for "Training a Generally Curious Agent"☆28Updated 2 months ago
- Simple repository for training small reasoning models☆33Updated 5 months ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Updated 5 months ago
- The official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆35Updated this week
- ☆33Updated 6 months ago
- Learn online intrinsic rewards from LLM feedback☆41Updated 7 months ago
- Lottery Ticket Adaptation☆39Updated 7 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆19Updated 4 months ago
- Code for minimum-entropy coupling.☆32Updated last year
- ☆13Updated last year
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆89Updated 2 weeks ago
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆29Updated last year
- Jax like function transformation engine but micro, microjax☆33Updated 8 months ago
- Stochastic Parameter Decomposition☆27Updated this week
- ☆22Updated last month
- ☆20Updated last year
- ☆20Updated 3 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 4 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆60Updated 4 months ago
- ☆29Updated 2 years ago
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆50Updated last week
- Code for☆27Updated 7 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 5 months ago
- implementation of dualformer☆18Updated 4 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 8 months ago
- Measuring General Intelligence With Generated Games (Preprint)☆25Updated last month
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆23Updated 3 months ago
- ☆23Updated last month