k4black / codebleuLinks
Pip compatible CodeBLEU metric implementation available for linux/macos/win
☆97Updated 3 months ago
Alternatives and similar repositories for codebleu
Users that are interested in codebleu are comparing it to the libraries listed below
Sorting:
- Benchmark ClassEval for class-level code generation.☆144Updated 8 months ago
- Large Language Models for Software Engineering☆235Updated last week
- Repo-Level Code generation papers☆192Updated 3 months ago
- A collection of practical code generation tasks and tests in open source projects. Complementary to HumanEval by OpenAI.☆145Updated 6 months ago
- [TOSEM 2023] A Survey of Learning-based Automated Program Repair☆71Updated last year
- An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories☆61Updated 11 months ago
- ☆30Updated 2 years ago
- ☆25Updated 2 years ago
- For our ICSE23 paper "Impact of Code Language Models on Automated Program Repair" by Nan Jiang, Kevin Liu, Thibaud Lutellier, and Lin Tan☆61Updated 8 months ago
- ☆46Updated 2 years ago
- This repo is for our submission for ICSE 2025.☆20Updated last year
- ✅SRepair: Powerful LLM-based Program Repairer with $0.029/Fixed Bug☆67Updated last year
- This repo illustrates how to evaluate the artifacts in the paper An Extensive Study on Pre-trained Models for Program Understanding and G…☆25Updated 2 years ago
- ☆21Updated 7 months ago
- Artifact repository for the paper "Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code", In P…☆49Updated 3 months ago
- NaturalCC: An Open-Source Toolkit for Code Intelligence☆301Updated 3 weeks ago
- Code and data for XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence☆74Updated 5 months ago
- [TSE 2024] APPT: Boosting Automated Patch Correctness Prediction via Fine-tuning Pre-trained Models☆14Updated last year
- List of research papers of ICSE, FSE, ASE, and ISSTA since 2020.☆19Updated 2 months ago
- Enhacing Code Pre-trained Models by Contrastive Learning☆35Updated 2 years ago
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆145Updated 11 months ago
- Replication Package for "Natural Attack for Pre-trained Models of Code", ICSE 2022☆46Updated 10 months ago
- ☆59Updated 2 years ago
- ☆34Updated last year
- EvoEval: Evolving Coding Benchmarks via LLM☆74Updated last year
- ☆41Updated 2 years ago
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".☆78Updated last year
- ☆23Updated 9 months ago
- ☆48Updated last year
- A multi-lingual program repair benchmark set based on the Quixey Challenge☆124Updated 2 years ago