WecoAI / weco-cliLinks
The Platform for Self-Improving Code. Ideal for GPU kernels, ML model development, feature engineering, prompt engineering, and other optimizable code.
☆26Updated this week
Alternatives and similar repositories for weco-cli
Users that are interested in weco-cli are comparing it to the libraries listed below
Sorting:
- ☆121Updated last month
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Updated 9 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆65Updated 10 months ago
- The history files when recording human interaction while solving ARC tasks☆118Updated this week
- ☆108Updated last week
- Draw more samples☆195Updated last year
- Open-source release accompanying Gao et al. 2025☆218Updated this week
- Inference-time scaling for LLMs-as-a-judge.☆316Updated last month
- ☆136Updated 8 months ago
- Training-Ready RL Environments + Evals☆190Updated this week
- SoTA Approach for ARC-AGI 2☆151Updated 2 months ago
- A reading list of relevant papers and projects on foundation model annotation☆28Updated 9 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆280Updated last month
- Learning Universal Predictors☆81Updated last year
- 🧬 The Huxley-Gödel Machine☆307Updated 2 weeks ago
- ☆67Updated 5 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆143Updated 8 months ago
- LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management☆75Updated 11 months ago
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆115Updated this week
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆120Updated 2 months ago
- Collection of LLM completions for reasoning-gym task datasets☆30Updated 5 months ago
- ☆14Updated last year
- This repository explains and provides examples for "concept anchoring" in GPT4.☆71Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆106Updated 2 weeks ago
- ☆59Updated 10 months ago
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆121Updated last month
- Open source interpretability artefacts for R1.☆164Updated 7 months ago
- Benchmarking Goal-Oriented Software Engineering☆60Updated this week
- ☆234Updated 5 months ago
- Train your own SOTA deductive reasoning model☆107Updated 9 months ago