LLM4SoftwareTesting / TestEvalLinks

☆27

Alternatives and similar repositories for TestEval

Users that are interested in TestEval are comparing it to the libraries listed below

Sorting:

FudanSELab / ClassEval
Benchmark ClassEval for class-level code generation.
☆145Updated 9 months ago
GhabiX / SRepair
✅SRepair: Powerful LLM-based Program Repairer with $0.029/Fixed Bug
☆67Updated last year
gai4se / LLM4SE
Large Language Models for Software Engineering
☆239Updated 2 weeks ago
iSEngLab / AwesomeLLM4APR
A Systematic Literature Review on Large Language Models for Automated Program Repair
☆199Updated 8 months ago
EngineeringSoftware / teco
TeCo: an ML+Execution model for test completion
☆31Updated last year
Intelligent-CAT-Lab / PLTranslationEmpirical
Artifact repository for the paper "Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code", In P…
☆49Updated 3 months ago
allanj / repo-level-codegen-papers
Repo-Level Code generation papers
☆199Updated 3 weeks ago
afortunado-aceptado / Rudra
This repo is for our submission for ICSE 2025.
☆20Updated last year
oceaneLIU / GraphCoder
☆49Updated last year
CoderEval / CoderEval
A collection of practical code generation tasks and tests in open source projects. Complementary to HumanEval by OpenAI.
☆149Updated 7 months ago
ZJU-ACES-ISE / ChatUniTest
☆145Updated last week
iSEngLab / GAMMA
[2023 ASE] GAMMA: Revisiting Template-based Automated Program Repair via Mask Prediction
☆22Updated 2 years ago
iSEngLab / AwesomeLearningAPR
[TOSEM 2023] A Survey of Learning-based Automated Program Repair
☆70Updated last year
seketeam / EvoCodeBench
An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories
☆62Updated 11 months ago
sola-st / ExecutionAgent
LLM agent to automatically set up arbitrary projects and run their test suites
☆45Updated 3 weeks ago
facebookresearch / testgeneval
TestGenEval A Real World Unit Test Generation and Test Completion Benchmark
☆20Updated 7 months ago
DeepSoftwareAnalytics / RLCoder
Reinforcement Learning for Repository-Level Code Completion
☆35Updated 11 months ago
soarsmu / BugsInPy
BugsInPy: Benchmarking Bugs in Python Projects
☆105Updated last year
k4black / codebleu
Pip compatible CodeBLEU metric implementation available for linux/macos/win
☆102Updated 4 months ago
lin-tan / clm
For our ICSE23 paper "Impact of Code Language Models on Automated Program Repair" by Nan Jiang, Kevin Liu, Thibaud Lutellier, and Lin Tan
☆61Updated 9 months ago
githubhuyang / refactory
Refactory: Re-factoring based Program Repair applied to Programming Assignments
☆41Updated 2 years ago
jkoppel / QuixBugs
A multi-lingual program repair benchmark set based on the Quixey Challenge
☆125Updated 2 years ago
microsoft / codamosa
☆142Updated 2 months ago
nju-websoft / DraCo
Dataflow-guided retrieval augmentation for repository-level code completion, ACL 2024 (main)
☆26Updated 4 months ago
thunlp / DebugBench
The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".
☆80Updated last year
iSEngLab / APPT
[TSE 2024] APPT: Boosting Automated Patch Correctness Prediction via Fine-tuning Pre-trained Models
☆14Updated last year
ASSERT-KTH / repairllama
RepairLLaMA: Efficient Representations and Fine-Tuned Adapters for Program Repair http://arxiv.org/pdf/2312.15698
☆32Updated 2 months ago
iCSawyer / SEConfPaperList
List of research papers of ICSE, FSE, ASE, and ISSTA since 2020.
☆20Updated 3 months ago
JohnnyPeng18 / TypeGen
This is the tool released in the ASE'23 paper "Generative Type Inference for Python".
☆27Updated last year
microsoft / methods2test
methods2test is a supervised dataset consisting of Test Cases and their corresponding Focal Methods from a set of Java software repositor…
☆160Updated last year