SWE-bench/sb-cli

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SWE-bench/sb-cli)

SWE-bench / sb-cli

Run SWE-bench evaluations remotely

☆58

Alternatives and similar repositories for sb-cli

Users that are interested in sb-cli are comparing it to the libraries listed below

Sorting:

SWE-agent / SWE-ReX
View on GitHub
Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.
☆443Updated this week
SWE-bench / experiments
View on GitHub
Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.
☆247Updated this week
SWE-bench / SWE-smith
View on GitHub
[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents
☆577Updated this week
SalesforceAIResearch / swecomm
View on GitHub
☆28Nov 10, 2025Updated 3 months ago
InternLM / SWE-Fixer
View on GitHub
☆132May 8, 2025Updated 9 months ago
aorwall / SWE-bench-docker
View on GitHub
☆104Jul 17, 2024Updated last year
nju-websoft / TSQA
View on GitHub
TSQA: Tabular Scenario Based Question Answering (AAAI 2021)
☆18Dec 17, 2020Updated 5 years ago
google-research-datasets / wikifact
View on GitHub
Wikipedia based dataset to train relationship classifiers and fact extraction models
☆26May 25, 2021Updated 4 years ago
CodeClash-ai / CodeClash
View on GitHub
Benchmarking Goal-Oriented Software Engineering
☆114Jan 7, 2026Updated last month
CUHK-Shenzhen-SE / UTBoost
View on GitHub
[ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench
☆35Aug 12, 2025Updated 6 months ago
aixiv-org / aiXiv
View on GitHub
Preprint server for AI Scientists and Robot Scientists
☆49Aug 25, 2025Updated 6 months ago
SWE-Perf / SWE-Perf
View on GitHub
☆47Oct 28, 2025Updated 4 months ago
zhenyuhe00 / SWE-Swiss
View on GitHub
SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution
☆104Sep 24, 2025Updated 5 months ago
Aider-AI / aider-swe-bench
View on GitHub
Harness used to benchmark aider against SWE Bench benchmarks
☆79Jun 27, 2024Updated last year
thesofakillers / aideml
View on GitHub
AIDE: the Machine Learning CodeGen Agent
☆25Oct 7, 2024Updated last year
Samuel-IG16 / alx-higher_level_programming
View on GitHub
Learning the basic fundamentals of high level programming with Python and JavaScript
☆11Mar 10, 2023Updated 2 years ago
SWE-Gym / SWE-Gym
View on GitHub
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
☆632Jul 29, 2025Updated 7 months ago
ahmadluay9 / travel-planner-app
View on GitHub
This project aims to build a traveling recommendation application using Google Places API and OpenAI LLM.
☆11Mar 19, 2024Updated last year
aws-samples / sample-data-analyst-bi
View on GitHub
A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…
☆27Feb 13, 2026Updated 2 weeks ago
microsoft / TechExcel-Designing-your-own-copilot-using-copilot-studio
View on GitHub
☆17Jan 23, 2026Updated last month
facebookresearch / swe-rl
View on GitHub
[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
☆678Mar 16, 2025Updated 11 months ago
SWE-bench / SWE-bench
View on GitHub
SWE-bench: Can Language Models Resolve Real-world Github Issues?
☆4,337Feb 19, 2026Updated last week
Azure / agent-innovator-lab
View on GitHub
Agent Innovator Lab – building AI agents on Azure, covering search optimization, agent design, evaluation, and RAG best practices.
☆52Feb 20, 2026Updated last week
spraakbanken / SuperLim-2
View on GitHub
A Swedish Natural Language Understanding Benchmark
☆11Dec 12, 2025Updated 2 months ago
microsoft / github-copilot-workshops-labs
View on GitHub
GitHub Copilot Adoption Plan - Workshops - Labs
☆19Sep 18, 2025Updated 5 months ago
zhimin-z / zhimin-z
View on GitHub
☆12Jan 11, 2026Updated last month
microsoft / github-copilot-workshops-full
View on GitHub
GitHub Copilot Adoption Plan - Workshops - Full Solution
☆18Feb 18, 2026Updated last week
ruvnet / openai-cost-estimator
View on GitHub
GPT API Cost Estimation for Enterprises
☆13Oct 24, 2023Updated 2 years ago
microsoft / BUILD25-LAB333
View on GitHub
This repository hosts the instructions and workshop materials for Lab 333 - Evaluate Reasoning Models for Your Generative AI Solutions
☆19May 21, 2025Updated 9 months ago
Mohit21GoJs / agent-orchestration-examples
View on GitHub
Example for agent orchestration
☆19Mar 31, 2025Updated 11 months ago
lyzs90 / e8tbot
View on GitHub
Food Recommendation ChatBot
☆10Dec 23, 2016Updated 9 years ago
chicks2014 / SQLGPT
View on GitHub
SQLGPT is an advanced SQL query generator powered by natural language processing. Seamlessly transforming plain English queries into comp…
☆10Oct 24, 2023Updated 2 years ago
cybertheory / AutonomousSphere
View on GitHub
AutonomousSphere is an agentic collaboration server. Agents talk, act, and use tools like teammates. Federated servers form an internet o…
☆16May 13, 2025Updated 9 months ago
redowul / reddit-purger
View on GitHub
A Python script to delete all comment and submission data from a given Reddit account.
☆11Jan 5, 2021Updated 5 years ago
sachs7 / flight_finder_and_trip_planner_crewai
View on GitHub
A CrewAI agent based app that helps you in finding flights and planning your itinerary at the destination with top recommended places to …
☆16Nov 30, 2024Updated last year
hokiedsp / matlab-hatchfill2
View on GitHub
MATLAB function to fill an area with hatching ~~or speckling~~
☆11Mar 4, 2018Updated 7 years ago
THU-KEG / PairJudgeRM
View on GitHub
☆14Apr 14, 2025Updated 10 months ago
Azure-Samples / azure-sql-db-developers-workshop
View on GitHub
A workshop for developing with the Azure SQL Database and Azure Services
☆13Feb 10, 2026Updated 2 weeks ago
Azure-Samples / aihlsignited-medindexer
View on GitHub
Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search
☆14Jun 18, 2025Updated 8 months ago