SWE-bench / SWE-smithLinks
Scaling Data for SWE-agents
☆212Updated this week
Alternatives and similar repositories for SWE-smith
Users that are interested in SWE-smith are comparing it to the libraries listed below
Sorting:
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆173Updated 2 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆477Updated 3 weeks ago
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆204Updated last week
- A benchmark for LLMs on complicated tasks in the terminal☆134Updated this week
- SWE Arena☆33Updated last month
- SkyRL-v0: Train Real-World Long-Horizon Agents via Reinforcement Learning☆343Updated last week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆207Updated 3 weeks ago
- RepoQA: Evaluating Long-Context Code Understanding☆108Updated 7 months ago
- Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆73Updated last month
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆175Updated this week
- ☆126Updated 2 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆171Updated 4 months ago
- AWM: Agent Workflow Memory☆271Updated 4 months ago
- ☆92Updated 3 weeks ago
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆61Updated last week
- Simple extension on vLLM to help you speed up reasoning model without training.☆152Updated this week
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆206Updated 3 weeks ago
- Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement☆92Updated 3 months ago
- accompanying material for sleep-time compute paper☆90Updated last month
- A simple unified framework for evaluating LLMs☆215Updated last month
- Prompt-to-Leaderboard☆231Updated 3 weeks ago
- ☆114Updated 3 months ago
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆530Updated 2 months ago
- ☆41Updated 4 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆146Updated 3 months ago
- Code for the paper 🌳 Tree Search for Language Model Agents☆199Updated 10 months ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆367Updated this week
- Enhancing AI Software Engineering with Repository-level Code Graph☆178Updated 2 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆169Updated this week
- ☆93Updated 10 months ago