commit-0 / commit0Links
Commit0: Library Generation from Scratch
☆160Updated 2 months ago
Alternatives and similar repositories for commit0
Users that are interested in commit0 are comparing it to the libraries listed below
Sorting:
- r2e: turn any github repository into a programming agent environment☆129Updated 3 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆175Updated 4 months ago
- RepoQA: Evaluating Long-Context Code Understanding☆113Updated 9 months ago
- ☆99Updated 2 months ago
- Evaluation of LLMs on latest math competitions☆155Updated 2 weeks ago
- ☆41Updated 6 months ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆134Updated last week
- ☆130Updated 4 months ago
- Train your own SOTA deductive reasoning model☆103Updated 5 months ago
- Functional Benchmarks and the Reasoning Gap☆88Updated 10 months ago
- Long context evaluation for large language models☆220Updated 5 months ago
- Scaling Data for SWE-agents☆328Updated this week
- SWE Arena☆33Updated last month
- Evaluating LLMs with fewer examples☆160Updated last year
- ☆108Updated 2 months ago
- accompanying material for sleep-time compute paper☆99Updated 3 months ago
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆52Updated 3 weeks ago
- ☆118Updated 5 months ago
- A benchmark that challenges language models to code solutions for scientific problems☆127Updated last week
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆131Updated last year
- Open source interpretability artefacts for R1.☆157Updated 3 months ago
- ☆136Updated 4 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 6 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆95Updated 2 weeks ago
- ⚖️ Awesome LLM Judges ⚖️☆108Updated 3 months ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆49Updated 9 months ago
- ☆95Updated 3 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆68Updated 3 months ago
- Storing long contexts in tiny caches with self-study☆121Updated last week