Betswish / MIRAGE
Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/
☆22Updated last month
Alternatives and similar repositories for MIRAGE:
Users that are interested in MIRAGE are comparing it to the libraries listed below
- ☆44Updated last year
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆58Updated last year
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated 2 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆37Updated 2 years ago
- ☆44Updated 7 months ago
- ☆34Updated 3 years ago
- Token-level Reference-free Hallucination Detection☆94Updated last year
- ☆48Updated 2 years ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆78Updated last year
- ☆47Updated last year
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆13Updated last year
- Supporting code for ReCEval paper☆28Updated 7 months ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆20Updated 7 months ago
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆52Updated 8 months ago
- [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.☆16Updated 3 years ago
- The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)☆15Updated 2 years ago
- ☆82Updated 2 years ago
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated 2 years ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated last year
- This repository contains data, code and models for contextual noncompliance.☆21Updated 8 months ago
- First explanation metric (diagnostic report) for text generation evaluation☆61Updated last month
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 2 years ago
- ☆21Updated 2 years ago
- Evaluate the Quality of Critique☆34Updated 10 months ago
- ☆73Updated last year
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆19Updated last year
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.☆17Updated 2 years ago
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆25Updated 7 months ago
- AbstainQA, ACL 2024☆25Updated 6 months ago