TDD-Bench-Verified is a new benchmark for generating test cases for test-driven development (TDD)
☆29Apr 28, 2026Updated last week
Alternatives and similar repositories for TDD-Bench-Verified
Users that are interested in TDD-Bench-Verified are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Improving Machine Translation Systems via Isotopic Replacement☆12Apr 14, 2023Updated 3 years ago
- A Deep Learning-Based Clone Detection Approach☆18Jul 27, 2017Updated 8 years ago
- ☆11Mar 25, 2021Updated 5 years ago
- Benchmarking Goal-Oriented Software Engineering☆149Jan 7, 2026Updated 3 months ago
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆76Apr 28, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆27Mar 13, 2024Updated 2 years ago
- ☆27Apr 7, 2026Updated 3 weeks ago
- A library for building intraprocedural PDGs for Java programs☆37Sep 28, 2023Updated 2 years ago
- 香港vps推荐☆40Dec 11, 2025Updated 4 months ago
- Building self-refined guardrails via DSPy☆14Jul 2, 2024Updated last year
- [EMNLP 2024] Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction☆17Nov 9, 2024Updated last year
- A curated list of software engineering research, data set, tool.☆32Dec 16, 2022Updated 3 years ago
- A dataset to shed light upon Serverless computing.☆14Mar 19, 2021Updated 5 years ago
- Clawdbot安装教程:从零开始到接入飞书☆42Apr 9, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆267Jul 13, 2025Updated 9 months ago
- AI powered coding Agent☆37Oct 22, 2025Updated 6 months ago
- The source code of paper "Semantic Enhanced Text-to-SQL Parsing via Iteratively Learning Schema Linking Graph" in KDD2022.☆15Jan 9, 2023Updated 3 years ago
- Formalization of Machine Learning Theory with Applications to Program Synthesis☆78Mar 31, 2026Updated last month
- ☆47Apr 7, 2026Updated 3 weeks ago
- SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution☆27Nov 11, 2025Updated 5 months ago
- 3D visualization for code structure and code quality☆16May 31, 2019Updated 6 years ago
- ☆27Updated this week
- An ANTLR4 grammar for ECMAScript 5.1☆16Jul 13, 2017Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A simple project to get information from the git repository of your project☆29Apr 12, 2023Updated 3 years ago
- Load Tensorflow pb file using Bert/TextCNNs, an ensemble model using Java.☆10Aug 20, 2021Updated 4 years ago
- Reproduction Package for the paper "Type-Constrained Code Generation with Language Models" [PLDI 2025]☆88Mar 11, 2026Updated last month
- ☆16Jan 23, 2026Updated 3 months ago
- ☆10Mar 24, 2026Updated last month
- 🔍 Code Search Tools & Experiments☆12Mar 1, 2026Updated 2 months ago
- Builds a WMT18-like corpus for word-level QE with annotations in the source and target words.☆10Sep 19, 2022Updated 3 years ago
- [ICLR 2026] Official Implementation of "FeatureBench: Benchmarking Agentic Coding for Complex Feature Development"☆58Apr 27, 2026Updated last week
- ☆11Jul 20, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆153Mar 18, 2026Updated last month
- ☆26May 30, 2023Updated 2 years ago
- 基于中心度的中文关键短语抽取工具☆11Sep 2, 2022Updated 3 years ago
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆204Aug 16, 2024Updated last year
- Python bindings for libsrcml☆17Aug 25, 2025Updated 8 months ago
- Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.☆59Jul 31, 2024Updated last year
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated last year