carriex / lfqa_evalLinks
ACL 2023 paper "A Critical Evaluation of Evaluations for Long-form Question Answering"
☆21Updated last year
Alternatives and similar repositories for lfqa_eval
Users that are interested in lfqa_eval are comparing it to the libraries listed below
Sorting:
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"☆62Updated 3 years ago
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/☆26Updated 10 months ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆42Updated 2 years ago
- Code for Editing Factual Knowledge in Language Models☆142Updated 4 years ago
- ☆187Updated 7 months ago
- ☆84Updated last week
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆100Updated 3 years ago
- Companion repo for "Evaluating Verifiability in Generative Search Engines".☆85Updated 2 years ago
- Codebase, data and models for the SummaC paper in TACL☆108Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Updated last year
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…☆27Updated 6 months ago
- code associated with ACL 2021 DExperts paper☆118Updated 2 years ago
- ☆50Updated 2 years ago
- ☆48Updated 2 years ago
- Train Dense Passage Retriever (DPR) with a single GPU☆136Updated 4 years ago
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆75Updated 3 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 3 years ago
- ☆83Updated 2 years ago
- Entity-Based Knowledge Conflicts in Question Answering. Code repo for EMNLP2021 paper: https://aclanthology.org/2021.emnlp-main.565/☆76Updated 3 years ago
- ☆177Updated last year
- Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".☆83Updated 3 weeks ago
- ☆50Updated 3 years ago
- Token-level Reference-free Hallucination Detection☆98Updated 2 years ago
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆285Updated 3 years ago
- FRANK: Factuality Evaluation Benchmark☆59Updated 3 years ago
- Code for paper "CrossFit : A Few-shot Learning Challenge for Cross-task Generalization in NLP" (https://arxiv.org/abs/2104.08835)☆113Updated 3 years ago
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Updated 2 years ago
- ☆35Updated 4 years ago
- ☆39Updated 2 years ago
- ☆103Updated last year