feyzaakyurek / dune
Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.
☆23Updated 8 months ago
Alternatives and similar repositories for dune:
Users that are interested in dune are comparing it to the libraries listed below
- ☆34Updated 2 years ago
- Constrained Decoding Project☆17Updated last year
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Updated 2 years ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆30Updated last year
- Methods and evaluation for aligning language models temporally☆29Updated last year
- ☆44Updated last year
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Updated last month
- ☆21Updated last year
- ☆41Updated last year
- ☆17Updated last year
- Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.☆21Updated 2 years ago
- [EMNLP 2022] Summarization as Indirect Supervision for Relation Extraction (SuRE)☆28Updated 2 years ago
- ☆15Updated last year
- ☆49Updated last year
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…☆23Updated last month
- Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models☆42Updated 3 years ago
- ☆82Updated 2 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆59Updated last year
- BeHonest: Benchmarking Honesty in Large Language Models☆31Updated 8 months ago
- ☆42Updated last year
- [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.☆16Updated 3 years ago
- ☆16Updated 2 months ago
- ☆36Updated last year
- ☆21Updated 3 years ago
- ☆17Updated last year
- code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification☆27Updated 3 years ago
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆23Updated 9 months ago
- ☆13Updated last year
- Restore safety in fine-tuned language models through task arithmetic☆28Updated last year