commit-0 / commit0Links
Commit0: Library Generation from Scratch
☆162Updated 4 months ago
Alternatives and similar repositories for commit0
Users that are interested in commit0 are comparing it to the libraries listed below
Sorting:
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆182Updated 6 months ago
- r2e: turn any github repository into a programming agent environment☆129Updated 5 months ago
- RepoQA: Evaluating Long-Context Code Understanding☆117Updated 10 months ago
- Long context evaluation for large language models☆221Updated 6 months ago
- ☆55Updated 7 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 10 months ago
- Evaluation of LLMs on latest math competitions☆164Updated this week
- Evaluating LLMs with CommonGen-Lite☆91Updated last year
- ☆81Updated 2 weeks ago
- Evaluating LLMs with fewer examples☆161Updated last year
- Train your own SOTA deductive reasoning model☆106Updated 6 months ago
- ☆133Updated 6 months ago
- ☆72Updated last year
- ☆111Updated 3 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 8 months ago
- Storing long contexts in tiny caches with self-study☆181Updated this week
- SWE Arena☆34Updated 2 months ago
- Functional Benchmarks and the Reasoning Gap☆88Updated 11 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- ☆99Updated 8 months ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆140Updated last week
- ⚖️ Awesome LLM Judges ⚖️☆127Updated 4 months ago
- Utilities for efficient fine-tuning, inference and evaluation of code generation models☆21Updated last year
- LILO: Library Induction with Language Observations☆88Updated last year
- ☆122Updated 6 months ago
- ☆101Updated this week
- accompanying material for sleep-time compute paper☆111Updated 4 months ago
- ☆54Updated last year
- Website for hosting the Open Foundation Models Cheat Sheet.☆268Updated 4 months ago
- Score LLM pretraining data with classifiers☆55Updated last year