wala / blanca
BLANCA - Benchmarks for LANguage models on Coding Artifacts
☆9Updated 3 years ago
Alternatives and similar repositories for blanca:
Users that are interested in blanca are comparing it to the libraries listed below
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆46Updated last year
- ☆26Updated 2 months ago
- ☆59Updated 10 months ago
- ☆15Updated 3 years ago
- Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"☆14Updated 2 years ago
- code for "Natural Language to Code Translation with Execution"☆40Updated 2 years ago
- ☆21Updated 4 months ago
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆44Updated 2 months ago
- A plugin for code generation in PyCharm/IntelliJ using tranX☆35Updated 2 years ago
- ☆74Updated last year
- PROSE Public Benchmark Suite☆24Updated 5 months ago
- Training and Benchmarking LLMs for Code Preference.☆33Updated 4 months ago
- Two Automatic code completion IDE extensions for @JetBrains and @microsoft/vscode based on Transformer-based large language models for so…☆55Updated 11 months ago
- Incremental Python parser for constrained generation of code by LLMs.☆15Updated 5 months ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆19Updated 2 years ago
- ☆42Updated last month
- Code for "Learning Structural Edits via Incremental Tree Transformations" (ICLR'21)☆41Updated 3 years ago
- Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)☆86Updated last year
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated 11 months ago
- PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets suitable for graph-based machine learning. W…☆87Updated 2 years ago
- code for "Implant Global and Local Hierarchy Information to Sequence based Code Representation Models"☆12Updated 3 months ago
- The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/f…☆22Updated 4 years ago
- Code for the NLP4Prog workshop paper "Reading StackOverflow Encourages Cheating: Adding Question TextImproves Extractive Code Generation"☆21Updated 3 years ago
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆51Updated 11 months ago
- This is the repository for the paper Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descripti…☆25Updated 2 years ago
- ML models often mispredict, and it is hard to tell when and why. We present a data mining based approach to discover whether there is a c…☆18Updated 2 years ago
- ☆14Updated last year
- Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"☆108Updated last year
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆36Updated this week