☆47Apr 7, 2026Updated last week
Alternatives and similar repositories for SWE-QA-Bench
Users that are interested in SWE-QA-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution☆26Nov 11, 2025Updated 5 months ago
- SWE-Exp: Experience-Driven Software Issue Resolution☆38Oct 17, 2025Updated 6 months ago
- something for paper agent☆11Dec 18, 2024Updated last year
- ☆49Oct 28, 2025Updated 5 months ago
- Interface for GenAI-Arena [NeurIPS24]☆17Feb 27, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆26Apr 7, 2026Updated last week
- AI powered coding Agent☆36Oct 22, 2025Updated 5 months ago
- 香港vps推荐☆38Dec 11, 2025Updated 4 months ago
- ☆146Mar 18, 2026Updated last month
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆76Mar 23, 2026Updated 3 weeks ago
- [ICLR 2026] Official Implementation of "FeatureBench: Benchmarking Agentic Coding for Complex Feature Development"☆48Mar 31, 2026Updated 2 weeks ago
- A timer theme of Wallpaper Engine (13k Subscribers)☆13Oct 26, 2022Updated 3 years ago
- Detection of LLM-Generated Codes [ICSE2025]☆32Jul 5, 2025Updated 9 months ago
- ☆21Feb 6, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A simple but relative strong Reversi bot running on botzone.org☆15Mar 12, 2022Updated 4 years ago
- DSN jailbreak Attack & Evaluation Ensemble☆17Feb 7, 2026Updated 2 months ago
- [NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs☆49Nov 29, 2024Updated last year
- TDD-Bench-Verified is a new benchmark for generating test cases for test-driven development (TDD)☆27Sep 18, 2025Updated 7 months ago
- Rui Qian, Xin Yin, Dejing Dou†: Reasoning to Attend: Try to Understand How <SEG> Token Works (CVPR 2025)☆53Feb 4, 2026Updated 2 months ago
- ☆80Mar 30, 2026Updated 2 weeks ago
- Twinkle✨: Training workbench to make your model glow.☆210Updated this week
- 基于论文摘要的文本分类与关键词抽取挑战赛—Task 1☆23Aug 10, 2023Updated 2 years ago
- official implementation of [USENIX Sec'25] StruQ: Defending Against Prompt Injection with Structured Queries☆69Nov 10, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [EMNLP 2023] CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation☆59Nov 16, 2023Updated 2 years ago
- ☆42Jul 19, 2025Updated 8 months ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text☆34Jul 26, 2023Updated 2 years ago
- ☆69Mar 12, 2026Updated last month
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- [ACL 2026] Repository of IPBench☆21Apr 6, 2026Updated last week
- ☆15Jul 26, 2022Updated 3 years ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The source code and the data for ACL 2022 paper "Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Dat…☆14Apr 21, 2023Updated 2 years ago
- EA-HAS-Bench: Energy-Aware Hyperparameter and Architecture Search Benchmark (ICLR Spotlight 2023)☆18Dec 8, 2024Updated last year
- Code for the paper "FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024"☆13Feb 14, 2025Updated last year
- A collection list for Large Language Model (LLM) Watermark☆60Mar 30, 2026Updated 2 weeks ago
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆73Aug 31, 2024Updated last year
- ☆12Feb 22, 2024Updated 2 years ago
- A PostgreSQL extension for collecting statistics about sorts, helping tuning work_mem☆13Jan 14, 2023Updated 3 years ago