salesforce / factualNLGLinks

Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"

☆60

Alternatives and similar repositories for factualNLG

Users that are interested in factualNLG are comparing it to the libraries listed below

Sorting:

allenai / Lila
A unified benchmark for math reasoning
☆89Updated 2 years ago
OSU-NLP-Group / AttrScore
Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"
☆56Updated 2 years ago
microsoft / HaDes
Token-level Reference-free Hallucination Detection
☆97Updated 2 years ago
google-research / true
Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".
☆81Updated last month
McGill-NLP / instruct-qa
Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"
☆86Updated last year
yixinL7 / SumLLM
Repo for "On Learning to Summarize with Large Language Models as References"
☆43Updated 2 years ago
yizhongw / Tk-Instruct
Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.
☆182Updated 3 years ago
allenai / DecomP
Repository for Decomposed Prompting
☆95Updated 2 years ago
seonghyeonye / TAPP
[AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
☆78Updated last year
chaitanyamalaviya / ExpertQA
[Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers
☆135Updated last year
orhonovich / instruction-induction
☆67Updated 3 years ago
wzhouad / context-faithful-llm
Code and data for paper "Context-faithful Prompting for Large Language Models".
☆41Updated 2 years ago
ryokamoi / wice
This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.
☆42Updated last year
GXimingLu / neurologic_decoding
☆82Updated 2 years ago
eladsegal / strategyqa
The official code of TACL 2021, "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies".
☆81Updated 3 years ago
nelson-liu / evaluating-verifiability-in-generative-search-engines
Companion repo for "Evaluating Verifiability in Generative Search Engines".
☆86Updated 2 years ago
nkandpa2 / long_tail_knowledge
Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"
☆78Updated 2 years ago
princeton-nlp / TRIME
[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674
☆196Updated 2 years ago
nicola-decao / KnowledgeEditor
Code for Editing Factual Knowledge in Language Models
☆142Updated 3 years ago
jzbjyb / ReAtt
Retrieval as Attention
☆82Updated 2 years ago
google-deepmind / streamingqa
☆49Updated 2 years ago
joeljang / ELM
[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning
☆98Updated 2 years ago
violet-zct / fairseq-detect-hallucination
Detect hallucinated tokens for conditional sequence generation.
☆64Updated 3 years ago
oriyor / reasoning-on-cots
Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"
☆96Updated last year
google-research / dialog-inpainting
☆97Updated 3 years ago
archiki / ReCEval
Supporting code for ReCEval paper
☆30Updated last year
kayoyin / interpret-lm
Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)
☆62Updated 3 years ago
OpenMatch / COCO-DR
[EMNLP 2022] This is the code repo for our EMNLP‘22 paper "COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contr…
☆50Updated 2 years ago
Leezekun / dialogic
[EMNLP 2022] Code and data for "Controllable Dialogue Simulation with In-Context Learning"
☆35Updated 2 years ago
jingtaozhan / disentangled-retriever
An easy-to-use python toolkit for flexibly adapting various neural ranking models to target domain.
☆60Updated 2 years ago