Run SWE-bench evaluations remotely
☆58Aug 14, 2025Updated 6 months ago
Alternatives and similar repositories for sb-cli
Users that are interested in sb-cli are comparing it to the libraries listed below
Sorting:
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆443Updated this week
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆247Updated this week
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆577Updated this week
- ☆28Nov 10, 2025Updated 3 months ago
- ☆132May 8, 2025Updated 9 months ago
- ☆104Jul 17, 2024Updated last year
- TSQA: Tabular Scenario Based Question Answering (AAAI 2021)☆18Dec 17, 2020Updated 5 years ago
- Wikipedia based dataset to train relationship classifiers and fact extraction models☆26May 25, 2021Updated 4 years ago
- Benchmarking Goal-Oriented Software Engineering☆114Jan 7, 2026Updated last month
- [ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench☆35Aug 12, 2025Updated 6 months ago
- Preprint server for AI Scientists and Robot Scientists☆49Aug 25, 2025Updated 6 months ago
- ☆47Oct 28, 2025Updated 4 months ago
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution☆104Sep 24, 2025Updated 5 months ago
- Harness used to benchmark aider against SWE Bench benchmarks☆79Jun 27, 2024Updated last year
- AIDE: the Machine Learning CodeGen Agent☆25Oct 7, 2024Updated last year
- Learning the basic fundamentals of high level programming with Python and JavaScript☆11Mar 10, 2023Updated 2 years ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆632Jul 29, 2025Updated 7 months ago
- This project aims to build a traveling recommendation application using Google Places API and OpenAI LLM.☆11Mar 19, 2024Updated last year
- A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…☆27Feb 13, 2026Updated 2 weeks ago
- ☆17Jan 23, 2026Updated last month
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆678Mar 16, 2025Updated 11 months ago
- SWE-bench: Can Language Models Resolve Real-world Github Issues?☆4,337Feb 19, 2026Updated last week
- Agent Innovator Lab – building AI agents on Azure, covering search optimization, agent design, evaluation, and RAG best practices.☆52Feb 20, 2026Updated last week
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- GitHub Copilot Adoption Plan - Workshops - Labs☆19Sep 18, 2025Updated 5 months ago
- ☆12Jan 11, 2026Updated last month
- GitHub Copilot Adoption Plan - Workshops - Full Solution☆18Feb 18, 2026Updated last week
- GPT API Cost Estimation for Enterprises☆13Oct 24, 2023Updated 2 years ago
- This repository hosts the instructions and workshop materials for Lab 333 - Evaluate Reasoning Models for Your Generative AI Solutions☆19May 21, 2025Updated 9 months ago
- Example for agent orchestration☆19Mar 31, 2025Updated 11 months ago
- Food Recommendation ChatBot☆10Dec 23, 2016Updated 9 years ago
- SQLGPT is an advanced SQL query generator powered by natural language processing. Seamlessly transforming plain English queries into comp…☆10Oct 24, 2023Updated 2 years ago
- AutonomousSphere is an agentic collaboration server. Agents talk, act, and use tools like teammates. Federated servers form an internet o…☆16May 13, 2025Updated 9 months ago
- A Python script to delete all comment and submission data from a given Reddit account.☆11Jan 5, 2021Updated 5 years ago
- A CrewAI agent based app that helps you in finding flights and planning your itinerary at the destination with top recommended places to …☆16Nov 30, 2024Updated last year
- MATLAB function to fill an area with hatching ~~or speckling~~☆11Mar 4, 2018Updated 7 years ago
- ☆14Apr 14, 2025Updated 10 months ago
- A workshop for developing with the Azure SQL Database and Azure Services☆13Feb 10, 2026Updated 2 weeks ago
- Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search☆14Jun 18, 2025Updated 8 months ago