microsoft / prose-benchmarksLinks
PROSE Public Benchmark Suite
☆26Updated 8 months ago
Alternatives and similar repositories for prose-benchmarks
Users that are interested in prose-benchmarks are comparing it to the libraries listed below
Sorting:
- NaturalCodeBench (Findings of ACL 2024)☆65Updated 8 months ago
- ☆47Updated last year
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆58Updated last year
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆61Updated 8 months ago
- ☆26Updated last week
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆48Updated last year
- xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval☆83Updated 9 months ago
- Training and Benchmarking LLMs for Code Preference.☆33Updated 7 months ago
- ☆76Updated 3 months ago
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆39Updated last month
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated this week
- ☆41Updated last year
- ☆1Updated 9 months ago
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆65Updated 10 months ago
- ☆45Updated this week
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆58Updated 2 years ago
- CodeUltraFeedback: aligning large language models to coding preferences☆71Updated last year
- ☆35Updated last year
- ☆27Updated 5 months ago
- Semantic Code Search☆35Updated 2 years ago
- We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…☆23Updated 2 years ago
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆50Updated 3 weeks ago
- evol augment any dataset online☆59Updated last year
- ☆34Updated last week
- ☆31Updated last week
- code for "Natural Language to Code Translation with Execution"☆41Updated 2 years ago
- Code for "RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆22Updated 3 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- ☆20Updated 2 months ago
- ☆35Updated last year