☆46Jan 21, 2026Updated 2 months ago
Alternatives and similar repositories for SWE-QA-Bench
Users that are interested in SWE-QA-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SWE-Exp: Experience-Driven Software Issue Resolution☆38Oct 17, 2025Updated 5 months ago
- something for paper agent☆11Dec 18, 2024Updated last year
- UGround: Towards Unified Visual Grounding with Unrolled Transformers☆22Feb 15, 2026Updated last month
- Multi-Granularity LLM Debugger [ICSE2026]☆96Jul 6, 2025Updated 8 months ago
- ☆48Oct 28, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- DependEval: a hierarchical benchmark for evaluating LLMs on repository-level code understanding across 8 programming languages.☆15Jul 28, 2025Updated 8 months ago
- Interface for GenAI-Arena [NeurIPS24]☆17Feb 27, 2024Updated 2 years ago
- ☆23Dec 25, 2025Updated 3 months ago
- Must-read papers on Repository-level Code Generation & Issue Resolution 🔥☆269Dec 22, 2025Updated 3 months ago
- ☆119Mar 18, 2026Updated last week
- AI powered coding Agent☆36Oct 22, 2025Updated 5 months ago
- This is the repository for the paper titled "ThinkRepair: Self-Directed Automated Program Repair" accepted by ISSTA'24.☆30Jan 10, 2026Updated 2 months ago
- Defect Library for LLM-enabled Software☆23Dec 31, 2025Updated 2 months ago
- Just sort out the photos and generate the photo wall with one click; Realize the alignment of a large number of photos of any scale witho…☆18Oct 3, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆20Feb 6, 2026Updated last month
- DSN jailbreak Attack & Evaluation Ensemble☆17Feb 7, 2026Updated last month
- TDD-Bench-Verified is a new benchmark for generating test cases for test-driven development (TDD)☆27Sep 18, 2025Updated 6 months ago
- [NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs☆49Nov 29, 2024Updated last year
- 该仓库为2021届某位学长开源的XJTUSE的实验/作业☆24Aug 25, 2024Updated last year
- Rui Qian, Xin Yin, Dejing Dou†: Reasoning to Attend: Try to Understand How <SEG> Token Works (CVPR 2025)☆52Feb 4, 2026Updated last month
- CodeMind is a generic framework for evaluating inductive code reasoning of LLMs. It is equipped with a static analysis component that ena…☆42Feb 18, 2026Updated last month
- ☆45Jan 6, 2025Updated last year
- Twinkle✨: Training workbench to make your model glow.☆198Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 全国计算机等级考试二级python的学习笔记(适用2020年)☆20Mar 15, 2020Updated 6 years ago
- ☆77Updated this week
- 基于论文摘要的文本分类与关键词抽取挑战赛—Task 1☆23Aug 10, 2023Updated 2 years ago
- official implementation of [USENIX Sec'25] StruQ: Defending Against Prompt Injection with Structured Queries☆68Nov 10, 2025Updated 4 months ago
- ☆15Jun 22, 2022Updated 3 years ago
- ☆41Jul 19, 2025Updated 8 months ago
- ☆79Mar 6, 2026Updated 3 weeks ago
- Implementation of a multi-turn Chain of Thought (CoT) reasoning system, powered by the Llama 3.1 70B model on Groq.☆19Sep 22, 2024Updated last year
- MCE: Clone Human Souls with LLM Native Agent Skills | 基于 LLM Agent Skills 的心智克隆工程 | Agent Skills | Mind Skills | Mind Clone☆48Dec 21, 2025Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [EMNLP 2023] CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation☆59Nov 16, 2023Updated 2 years ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- ☆68Mar 12, 2026Updated 2 weeks ago
- Advances and Frontiers of LLM-based Issue Resolution in Software Engineering A Comprehensive Survey☆74Mar 19, 2026Updated last week
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- Repository of IPBench☆20Jan 4, 2026Updated 2 months ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago