jszheng21 / RACE
RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.
☆10Updated 4 months ago
Alternatives and similar repositories for RACE:
Users that are interested in RACE are comparing it to the libraries listed below
- ☆28Updated last month
- ☆14Updated last year
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆22Updated 3 months ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Updated last year
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆34Updated last month
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆37Updated 4 months ago
- Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"☆30Updated last year
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆21Updated 2 months ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Updated last year
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆30Updated last year
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆14Updated 2 months ago
- [EMNLP 2024] A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models.☆16Updated 4 months ago
- Towards Systematic Measurement for Long Text Quality☆31Updated 5 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated 11 months ago
- ☆13Updated last year
- Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)☆25Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆44Updated last month
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆13Updated last year
- ☆40Updated last year
- L-CITEEVAL: DO LONG-CONTEXT MODELS TRULY LEVERAGE CONTEXT FOR RESPONDING?☆23Updated 4 months ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated last year
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆55Updated 7 months ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆9Updated 2 months ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆16Updated last month
- The official code and dataset for EMNLP 2022 paper "COPEN: Probing Conceptual Knowledge in Pre-trained Language Models".☆19Updated last year
- Code for embedding and retrieval research.☆16Updated last year
- Repo for paper: Controllable Text Generation with Language Constraints☆19Updated last year
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆15Updated 3 months ago