TDD-Bench-Verified is a new benchmark for generating test cases for test-driven development (TDD)
☆26Sep 18, 2025Updated 6 months ago
Alternatives and similar repositories for TDD-Bench-Verified
Users that are interested in TDD-Bench-Verified are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Improving Machine Translation Systems via Isotopic Replacement☆12Apr 14, 2023Updated 3 years ago
- ☆27Sep 15, 2024Updated last year
- ☆11Mar 25, 2021Updated 5 years ago
- ☆12Aug 17, 2021Updated 4 years ago
- Benchmarking Goal-Oriented Software Engineering☆134Jan 7, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Automated AI Model Metadata eXtractor - automatically extracts and infers AI model-related from software repositories☆11Sep 21, 2025Updated 6 months ago
- a Julia wrapper of Python's lale automl package☆18Oct 11, 2021Updated 4 years ago
- SWE-Exp: Experience-Driven Software Issue Resolution☆38Oct 17, 2025Updated 5 months ago
- AI powered coding Agent☆36Oct 22, 2025Updated 5 months ago
- 香港vps推荐☆38Dec 11, 2025Updated 4 months ago
- Building self-refined guardrails via DSPy☆14Jul 2, 2024Updated last year
- [EMNLP 2024] Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction☆17Nov 9, 2024Updated last year
- A curated list of software engineering research, data set, tool.☆32Dec 16, 2022Updated 3 years ago
- A dataset to shed light upon Serverless computing.☆14Mar 19, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆260Jul 13, 2025Updated 9 months ago
- ICSE 2018 paper implement☆18Jan 8, 2019Updated 7 years ago
- ☆47Apr 7, 2026Updated last week
- Formalization of Machine Learning Theory with Applications to Program Synthesis☆78Mar 31, 2026Updated 2 weeks ago
- Stuff related to scraping the Code Review StackExchange☆12Jan 19, 2023Updated 3 years ago
- SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution☆26Nov 11, 2025Updated 5 months ago
- ☆27Jan 20, 2026Updated 2 months ago
- A timer theme of Wallpaper Engine (13k Subscribers)☆13Oct 26, 2022Updated 3 years ago
- Load Tensorflow pb file using Bert/TextCNNs, an ensemble model using Java.☆10Aug 20, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- MTTM: Metamorphic Testing for Textual Content Moderation Software☆32Feb 10, 2023Updated 3 years ago
- ☆16Jan 23, 2026Updated 2 months ago
- 🔍 Code Search Tools & Experiments☆12Mar 1, 2026Updated last month
- Builds a WMT18-like corpus for word-level QE with annotations in the source and target words.☆10Sep 19, 2022Updated 3 years ago
- ☆11Jul 20, 2021Updated 4 years ago
- ☆26May 30, 2023Updated 2 years ago
- 基于中心度的中文关键短语抽取工具☆11Sep 2, 2022Updated 3 years ago
- piggybacking on the Dafny language implementation to explore interactive semi-automated verified program synthesis, combining LLMs and sy…☆16Mar 26, 2026Updated 2 weeks ago
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆196Aug 16, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Dec 2, 2021Updated 4 years ago
- Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.☆59Jul 31, 2024Updated last year
- Code for EMNLP 2022 paper "A Unified Encoder-Decoder Framework with Entity Memory"☆15Apr 24, 2023Updated 2 years ago
- Hosts our tool for mining simple "stupid'' bugs (SStuBs).☆38May 20, 2022Updated 3 years ago
- ☆21Oct 6, 2021Updated 4 years ago
- An Intellij Plugin that generates unit test methods with meaningful names based in described behaviours with @should tags in methods ja…☆10Dec 14, 2025Updated 4 months ago
- ☆14Jun 11, 2025Updated 10 months ago