SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents
☆85Jun 16, 2026Updated last week
Alternatives and similar repositories for SWE-PolyBench
Users that are interested in SWE-PolyBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation for the paper, StackEval: Benchmarking LLMs in Coding Assistance, https://arxiv.org/abs/2412.05288☆20Oct 30, 2024Updated last year
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving☆341Dec 18, 2025Updated 6 months ago
- ☆32Dec 11, 2024Updated last year
- Agentless Lite: RAG-based SWE-Bench software engineering scaffold☆49Apr 15, 2025Updated last year
- ☆79Jun 19, 2026Updated last week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- An MCP server wrapper for reducing tokens consumed by MCP tools, available in typescript, python, and rust☆86Jun 22, 2026Updated last week
- Reproducing TinySleepNet for sleep stage prediction based on the signal channel EEG using PyTorch and implementing a new smaller and fast…☆10May 9, 2021Updated 5 years ago
- ☆28Aug 13, 2025Updated 10 months ago
- 《CMake Practice》☆15Oct 28, 2019Updated 6 years ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆272Mar 29, 2026Updated 3 months ago
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆682Jun 22, 2026Updated last week
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆696Jul 29, 2025Updated 11 months ago
- CLARA: Confidence of Labels and Raters☆11Jun 3, 2023Updated 3 years ago
- This repository contains a PyTorch implementation of the ICSE'26 paper "Scrub It Out! Erasing Sensitive Memorization in Code Language Mod…☆30Sep 18, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A collection of scripts and tools for analyzing SWE agents.☆16May 7, 2025Updated last year
- Code for the main RoboTutor app. Many sound and image assets not included.☆14Nov 5, 2019Updated 6 years ago
- This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software E…☆1,438Jul 18, 2025Updated 11 months ago
- SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner☆39Jun 29, 2025Updated last year
- ☆12Mar 15, 2024Updated 2 years ago
- Official repository for the paper "Fast Predictive Uncertainty for Classification with Bayesian Deep Networks". Accepted at UAI 2022. htt…☆13May 25, 2022Updated 4 years ago
- Monash Scalable Time Series Evaluation Repository☆20Aug 28, 2025Updated 10 months ago
- PyTorch implementation of Optimistic Adam proposed in Training GANs with Optimism (https://arxiv.org/pdf/1711.00141.pdf)☆20Jan 16, 2021Updated 5 years ago
- An easy way to view current and overall statistics for corona virus in your terminal☆11Jun 12, 2020Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Erlang DTrace consumer☆23May 26, 2017Updated 9 years ago
- Benchmarks for Macro Neural Architecture Search; used and described in the paper "Local Search is a Remarkably Strong Baseline for Neural…☆13Jul 25, 2024Updated last year
- CLI to extract article contents in bulk using Newspaper3k and multithreading.☆12Apr 15, 2018Updated 8 years ago
- Simple pub/sub architecture with AWS Copilot☆10Feb 20, 2026Updated 4 months ago
- ☆29Mar 2, 2026Updated 3 months ago
- Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.☆116Jan 14, 2026Updated 5 months ago
- Minimum DevSecOps with Monitoring Options on Amazon EKS☆13Jun 17, 2026Updated last week
- AI for Mathematics Paper List☆17Jan 14, 2025Updated last year
- Interpretable Deep Clustering for Tabular Data (ICML 2024)☆18Aug 26, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The very simple ETS wrapper simplifying cross-process ETS handling (like `Agent`, but `:ets`).☆13Jun 7, 2019Updated 7 years ago
- leveldb backed mail repl.☆10May 5, 2015Updated 11 years ago
- Post processing library used to analyze memory snapshots☆32May 29, 2026Updated last month
- A specification for OpenInference, a semantic mapping of ML inferences☆47Apr 17, 2024Updated 2 years ago
- Harness used to benchmark aider against SWE Bench benchmarks☆85Jun 27, 2024Updated 2 years ago
- Google I/O 2013 Experiment☆74Jun 15, 2015Updated 11 years ago
- ☆13May 8, 2026Updated last month