RARR: Researching and Revising What Language Models Say, Using Language Models
☆52Jun 22, 2023Updated 2 years ago
Alternatives and similar repositories for RARR
Users that are interested in RARR are comparing it to the libraries listed below
Sorting:
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆63Dec 25, 2023Updated 2 years ago
- Code for the arxiv paper: Complex Claim Verification with Evidence Retrieved in the Wild☆13Nov 27, 2023Updated 2 years ago
- ☆32May 10, 2024Updated last year
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"☆11Sep 20, 2024Updated last year
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆56Jul 3, 2023Updated 2 years ago
- Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context☆18Nov 15, 2024Updated last year
- ☆71Nov 27, 2024Updated last year
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆152Mar 11, 2024Updated last year
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆511Oct 9, 2024Updated last year
- ☆43Sep 3, 2024Updated last year
- About Data and Codes for EMNLP 2023 System Demo Paper "QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking"☆19Dec 19, 2023Updated 2 years ago
- ☆76Feb 16, 2024Updated 2 years ago
- ☆15Feb 21, 2024Updated 2 years ago
- The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"☆23Dec 21, 2023Updated 2 years ago
- ☆19Nov 8, 2023Updated 2 years ago
- Companion repo for "Evaluating Verifiability in Generative Search Engines".☆85May 12, 2023Updated 2 years ago
- fact checking of GPT and other LLMs☆22Jul 18, 2024Updated last year
- ☆22Dec 9, 2023Updated 2 years ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆229Dec 2, 2024Updated last year
- ☆16Jun 5, 2023Updated 2 years ago
- EACL 2017☆26Apr 22, 2018Updated 7 years ago
- Review of papers I read☆14Dec 11, 2020Updated 5 years ago
- Data and Code for EMNLP 2023 paper "QTSumm: Query-Focused Summarization over Tabular Data"☆22Mar 29, 2024Updated last year
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆19Jul 16, 2023Updated 2 years ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆31Aug 18, 2024Updated last year
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆417Apr 13, 2025Updated 10 months ago
- Code and dataset for the paper: Generating Literal and Implied Subquestions to Fact-check Complex Claims☆30May 30, 2023Updated 2 years ago
- Forward-Looking Active REtrieval-augmented generation (FLARE)☆667Nov 20, 2023Updated 2 years ago
- Model Selection with Large Language Models for Reasoning (EMNLP2023 Findings)☆30Dec 23, 2023Updated 2 years ago
- FacTool: Factuality Detection in Generative AI☆913Aug 19, 2024Updated last year
- The code base for paper: "ReAcTable: Enhancing ReAct for Table Question Answering"☆35Apr 28, 2024Updated last year
- code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"☆10Oct 20, 2022Updated 3 years ago
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"☆121Apr 23, 2022Updated 3 years ago
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Jul 25, 2023Updated 2 years ago
- Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering☆30Dec 2, 2022Updated 3 years ago
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆309May 1, 2025Updated 10 months ago
- ☆33Aug 30, 2023Updated 2 years ago
- A virtual caregiver system that extracts the expression of mental and physical health states through dialogue-based human-computer intera…☆14Jan 29, 2023Updated 3 years ago
- Token-level Reference-free Hallucination Detection☆98Jul 25, 2023Updated 2 years ago