A benchmarking tool for evaluating AI coding assistants on real-world software engineering tasks from the SWE-Bench dataset.
☆63Jan 22, 2026Updated 2 months ago
Alternatives and similar repositories for refact-bench
Users that are interested in refact-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICSE 2025] The Seeds of the FUTURE Sprout from History: Fuzzing for Unveiling Vulnerabilities in Prospective Deep-Learning Libraries (AC…☆20Dec 22, 2025Updated 3 months ago
- Official repo for "ProSec: Fortifying Code LLMs with Proactive Security Alignment"☆17Feb 26, 2026Updated last month
- ☆13May 19, 2024Updated last year
- ☆13Jun 27, 2025Updated 9 months ago
- Semi-automated modelling and Model-Based Testing for CosmWasm contracts☆17Jun 28, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ⚙️ Program slicer based on the Mozilla Lithium Tool for Java (also dubbed as Tandem-FL).☆11Oct 21, 2024Updated last year
- Reproducing BugsInPy: Benchmarking Bugs in Python Projects☆14Sep 4, 2023Updated 2 years ago
- A SCL Unit Testing library☆11Nov 13, 2018Updated 7 years ago
- The CompCert formally-verified C compiler☆11Apr 4, 2026Updated last week
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- BiasFinder | IEEE TSE | Metamorphic Test Generation to Uncover Bias for Sentiment Analysis Systems☆11Jan 18, 2022Updated 4 years ago
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆15Oct 2, 2025Updated 6 months ago
- ☆16Feb 28, 2024Updated 2 years ago
- LLM-based and retrieval-augmented Control Code Generation☆23Oct 22, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- LLM-based mutation testing☆14Feb 3, 2025Updated last year
- Nyx: Detecting Exploitable Front-Running Vulnerabilities in Smart Contracts☆22May 11, 2024Updated last year
- Efficient APR with LLMs http://arxiv.org/pdf/2402.06598☆16May 28, 2024Updated last year
- Color palette and swatches for macOS's color picker.☆20Jun 9, 2020Updated 5 years ago
- The TacTok automated Coq proof script synthesis tool☆17Jan 9, 2024Updated 2 years ago
- [ACL25] FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation☆49Jan 28, 2026Updated 2 months ago
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆13Dec 5, 2023Updated 2 years ago
- ☆14Oct 11, 2017Updated 8 years ago
- ☆25Apr 7, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆52Mar 9, 2026Updated last month
- Language models for Coq based on data collected from the coq lsp.☆29Feb 23, 2026Updated last month
- [USENIX Security 2025] SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks☆20Sep 18, 2025Updated 6 months ago
- BENZENE: A Practical Root Cause Analysis System with an Under-Constrained State Mutation☆25Mar 28, 2024Updated 2 years ago
- Siren: Byzantine-robust Federated Learning via Proactive Alarming (SoCC '21)☆11Mar 28, 2024Updated 2 years ago
- ☆17Jul 17, 2021Updated 4 years ago
- White-box Fairness Testing through Adversarial Sampling☆14Apr 16, 2021Updated 5 years ago
- A naive interpreter for IR of NJU compiler principle lab3, to accelerate interpretation, the ir will be compiled to machine-friendly bina…☆16Jun 17, 2020Updated 5 years ago
- ☆26Apr 9, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 面试经验记录☆14Sep 11, 2019Updated 6 years ago
- [CVPR'24] LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning☆15Jan 15, 2025Updated last year
- Improving Machine Translation Systems via Isotopic Replacement☆12Apr 14, 2023Updated 3 years ago
- ☆29Mar 18, 2024Updated 2 years ago
- ☆13May 17, 2025Updated 10 months ago
- Tests that check correctness of a single statement☆14Nov 25, 2024Updated last year
- OpenCopilot flows editor☆12Oct 31, 2023Updated 2 years ago