IBM/TDD-Bench-Verified

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/IBM/TDD-Bench-Verified)

IBM / TDD-Bench-Verified

TDD-Bench-Verified is a new benchmark for generating test cases for test-driven development (TDD)

☆33

Alternatives and similar repositories for TDD-Bench-Verified

Users that are interested in TDD-Bench-Verified are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

codellm-devkit / python-sdk
View on GitHub
The official Python SDK for Codellm-Devkit
☆22Updated this week
DeepDiagnosis / ICSE2022
View on GitHub
☆12Jun 13, 2022Updated 4 years ago
IBMStreams / OSStreams
View on GitHub
Open-source, Cloud-native Streams
☆12Apr 7, 2021Updated 5 years ago
whl97 / LS-Score
View on GitHub
☆15Nov 24, 2020Updated 5 years ago
zysszy / CAT
View on GitHub
Improving Machine Translation Systems via Isotopic Replacement
☆12Apr 14, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
SecureAIAutonomyLab / AutoSafeCoder
View on GitHub
☆28Sep 15, 2024Updated last year
OptimizeAIHub / Cypress-Copilot
View on GitHub
Cypress Copilot is a Visual Studio Code extension that accelerates BDD (Behavior Driven Development) testing with AI-generated code sugge…
☆38Jun 11, 2026Updated last month
Alan-Cha / graphql-complexity-paper-artifact
View on GitHub
☆12Aug 17, 2021Updated 4 years ago
liuqingli / CCLearner
View on GitHub
A Deep Learning-Based Clone Detection Approach
☆17Jul 27, 2017Updated 8 years ago
konveyor / tackle-config-discover
View on GitHub
☆15Sep 2, 2022Updated 3 years ago
logic-star-ai / swt-bench
View on GitHub
[NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation
☆85Updated this week
jeffsvajlenko / CloneWorks
View on GitHub
☆11Mar 25, 2021Updated 5 years ago
PrasannS / rlhf-length-biases
View on GitHub
☆27Mar 13, 2024Updated 2 years ago
Dacoband / stream-cart-mobile
View on GitHub
Stream Cart Mobile is a cutting-edge e-commerce mobile application built with Flutter, designed specifically for live streaming commerce.…
☆18Oct 3, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ptiger10 / tada
View on GitHub
Test-driven data pipelines in pure Go
☆11Jul 16, 2020Updated 6 years ago
schettino72 / import-deps
View on GitHub
find python module imports
☆25Feb 7, 2026Updated 5 months ago
microsoft / FEA-Bench
View on GitHub
[ACL25] FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation
☆57Jan 28, 2026Updated 5 months ago
SERG-Delft / serg-delft.github.io
View on GitHub
SERG web site
☆11Jul 3, 2026Updated 3 weeks ago
IBM / AIMMX
View on GitHub
Automated AI Model Metadata eXtractor - automatically extracts and infers AI model-related from software repositories
☆11Sep 21, 2025Updated 10 months ago
deepppl / deepppl
View on GitHub
Deep Probabilistic Programming Language
☆19Jul 25, 2024Updated 2 years ago
konveyor / tackle-container-advisor
View on GitHub
Recommends containerization plan for legacy applications.
☆30Jul 12, 2023Updated 3 years ago
IBM / Lale.jl
View on GitHub
a Julia wrapper of Python's lale automl package
☆18Oct 11, 2021Updated 4 years ago
SWE-rebench / SWE-bench-fork
View on GitHub
Fork to run instances from SWE-rebench
☆31Jun 3, 2026Updated last month
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
YoshikiHigo / TinyPDG
View on GitHub
A library for building intraprocedural PDGs for Java programs
☆38Sep 28, 2023Updated 2 years ago
brickpop / serverless-tdd-starter
View on GitHub
Test Driven Development starter for Serverless and NodeJS. Featuring persistent MongoDB connections.
☆11Jan 4, 2023Updated 3 years ago
CosineAI / experiments
View on GitHub
Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.
☆14Sep 4, 2024Updated last year
ucsb-mlsec / PatchPilot
View on GitHub
PatchPilot: A Stable and Cost-Efficient Agentic Patching Framework
☆21Jun 5, 2025Updated last year
frantic / delphi-tdd-example
View on GitHub
Test Driven Development example with Delphi and DUnit
☆18Jul 13, 2013Updated 13 years ago
tangbc / js-test-workflows
View on GitHub
A collection of different JavaScript test workflows, helping you choose a suitable/preferable Test-Driven Development.
☆10Jul 30, 2018Updated 7 years ago
Hazem-Ben-Khalfallah / test-cherry
View on GitHub
An Intellij Plugin that generates unit test methods with meaningful names based in described behaviours with @should tags in methods ja…
☆10Dec 14, 2025Updated 7 months ago
dongjunKANG / VIM
View on GitHub
☆11Oct 16, 2023Updated 2 years ago
IBM / FormalML
View on GitHub
Formalization of Machine Learning Theory with Applications to Program Synthesis
☆77Mar 31, 2026Updated 3 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
whisperzqh / ProjectGen
View on GitHub
☆15Nov 28, 2025Updated 7 months ago
Freeky7819 / DragonMemory
View on GitHub
Neural Memory Compression System for RAG Applications
☆20Nov 20, 2025Updated 8 months ago
NoCode-bench / NoCode-bench
View on GitHub
☆21May 20, 2026Updated 2 months ago
Hambaobao / SWE-Flow
View on GitHub
SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner
☆40Jun 29, 2025Updated last year
CarperAI / CodeReviewSE
View on GitHub
Stuff related to scraping the Code Review StackExchange
☆11Jan 19, 2023Updated 3 years ago
awen-li / PyRTFuzz
View on GitHub
Two-Level Collaborative Fuzzing for Python Runtimes
☆18Nov 25, 2023Updated 2 years ago
CodeClash-ai / CodeClash
View on GitHub
Benchmarking Goal-Oriented Software Engineering
☆190Jul 16, 2026Updated last week