wala / blancaLinks
BLANCA - Benchmarks for LANguage models on Coding Artifacts
☆9Updated 3 years ago
Alternatives and similar repositories for blanca
Users that are interested in blanca are comparing it to the libraries listed below
Sorting:
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆48Updated last year
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆45Updated 5 months ago
- ☆27Updated 5 months ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆29Updated 10 months ago
- ☆26Updated this week
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆73Updated 2 years ago
- Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)☆86Updated 2 years ago
- Web queries dataset for code search☆32Updated 2 years ago
- ☆15Updated 3 years ago
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"☆10Updated last year
- ☆75Updated 3 months ago
- Training and Benchmarking LLMs for Code Preference.☆33Updated 7 months ago
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆53Updated last year
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆50Updated 3 weeks ago
- ☆45Updated 4 months ago
- We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…☆23Updated 2 years ago
- ☆44Updated last year
- ☆29Updated 2 years ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24Updated 3 years ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆78Updated last year
- Documenting large text datasets 🖼️ 📚☆12Updated 6 months ago
- A plugin for code generation in PyCharm/IntelliJ using tranX☆36Updated 3 years ago
- ☆14Updated last year
- code for "Natural Language to Code Translation with Execution"☆41Updated 2 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Weakly Supervised Text-to-SQL Parsing through Question Decomposition☆22Updated last year
- ☆34Updated this week
- ☆47Updated last year
- Models and datasets for annotated code search.☆35Updated 2 years ago