☆47Apr 7, 2026Updated last month
Alternatives and similar repositories for SWE-QA-Bench
Users that are interested in SWE-QA-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution☆28Nov 11, 2025Updated 5 months ago
- SWE-Exp: Experience-Driven Software Issue Resolution☆40Oct 17, 2025Updated 6 months ago
- something for paper agent☆11Dec 18, 2024Updated last year
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- Multi-Granularity LLM Debugger [ICSE2026]☆98Jul 6, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- LongCodeZip: Compress Long Context for Code Language Models [ASE2025]☆155Feb 5, 2026Updated 3 months ago
- ☆50Oct 28, 2025Updated 6 months ago
- ☆27Apr 7, 2026Updated last month
- Must-read papers on Repository-level Code Generation & Issue Resolution 🔥☆291Apr 30, 2026Updated last week
- 香港vps推荐☆40Dec 11, 2025Updated 4 months ago
- Guide: from fragile multi-agent app to prod ready with orra - code and resources.☆14Mar 24, 2025Updated last year
- Reading notes on Speculative Decoding papers☆32Apr 16, 2026Updated 3 weeks ago
- AI powered coding Agent☆37Oct 22, 2025Updated 6 months ago
- This is the repository for the paper titled "ThinkRepair: Self-Directed Automated Program Repair" accepted by ISSTA'24.☆31Jan 10, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆25Oct 2, 2024Updated last year
- A timer theme of Wallpaper Engine (13k Subscribers)☆13Oct 26, 2022Updated 3 years ago
- [ICLR 2026] Official Implementation of "FeatureBench: Benchmarking Agentic Coding for Complex Feature Development"☆60Apr 27, 2026Updated last week
- ☆22Feb 6, 2026Updated 3 months ago
- ☆153Mar 18, 2026Updated last month
- DSN jailbreak Attack & Evaluation Ensemble☆17Feb 7, 2026Updated 3 months ago
- Control your smart home from the terminal☆12Jun 4, 2021Updated 4 years ago
- TDD-Bench-Verified is a new benchmark for generating test cases for test-driven development (TDD)☆29Apr 28, 2026Updated last week
- A tool for simulating an arbitrary connection between two network endpoints☆19May 31, 2019Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆45Jan 6, 2025Updated last year
- [NAACL 2025] Benchmark for Repository-Level Code Generation, focus on Executability, Correctness from Test Cases and Usage of Contexts fr…☆44Jan 8, 2026Updated 4 months ago
- A modern, blazing-fast SQL IDE for the cloud era. Query PostgreSQL, MySQL, SQLite & MongoDB from anywhere — your browser is your new data…☆45Mar 27, 2026Updated last month
- ☆15Jun 22, 2022Updated 3 years ago
- ☆17Feb 4, 2025Updated last year
- ☆84Mar 30, 2026Updated last month
- 基于论文摘要的文本分类与关键词抽取挑战赛—Task 1☆23Aug 10, 2023Updated 2 years ago
- official implementation of [USENIX Sec'25] StruQ: Defending Against Prompt Injection with Structured Queries☆71Nov 10, 2025Updated 5 months ago
- Implementation of a multi-turn Chain of Thought (CoT) reasoning system, powered by the Llama 3.1 70B model on Groq.☆18Sep 22, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Robotic arm using dynamixels☆17Jan 17, 2021Updated 5 years ago
- MCE: Clone Human Souls with LLM Native Agent Skills | 基于 LLM Agent Skills 的心智克隆工程 | Agent Skills | Mind Skills | Mind Clone☆53Dec 21, 2025Updated 4 months ago
- Advanced Shodan-based scanner for discovering, verifying, and enumerating Model Context Protocol (MCP) servers and AI infrastructure tool…☆45Mar 31, 2026Updated last month
- A production-grade implementation of an Investment Portfolio Management System created for testing LLM translation of real world legacy a…☆22Oct 30, 2024Updated last year
- ☆69Mar 12, 2026Updated last month
- ☆45Jul 19, 2025Updated 9 months ago
- ☆105Mar 6, 2026Updated 2 months ago