voidism / EARLinks

Code for the ACL 2023 long paper - Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering

☆37

Alternatives and similar repositories for EAR

Users that are interested in EAR are comparing it to the libraries listed below

Sorting:

microsoft / HaDes
Token-level Reference-free Hallucination Detection
☆94Updated last year
xu1998hz / InstructScore_SEScore3
First explanation metric (diagnostic report) for text generation evaluation
☆62Updated 4 months ago
yixinL7 / SumLLM
Repo for "On Learning to Summarize with Large Language Models as References"
☆44Updated 2 years ago
wzhouad / context-faithful-llm
Code and data for paper "Context-faithful Prompting for Large Language Models".
☆40Updated 2 years ago
Alsace08 / SumCoT
[ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"
☆54Updated last year
yumeng5 / SuperGen
[NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding
☆67Updated 2 years ago
sunlab-osu / Understanding-CoT
☆87Updated 2 years ago
yuzhaouoe / pretraining-data-packing
[ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training
☆22Updated 10 months ago
McGill-NLP / instruct-qa
Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"
☆86Updated 11 months ago
csitfun / LogiCoT
the instructions and demonstrations for building a formal logical reasoning capable GLM
☆53Updated 10 months ago
ielab / PromptReps
Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval
☆50Updated 3 weeks ago
sean0042 / Open_WikiTable
Open-WikiTable :Dataset for Open Domain Question Answering with Complex Reasoning over Table
☆24Updated 2 years ago
wenhuchen / Time-Sensitive-QA
Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"
☆72Updated 3 years ago
BeastyZ / LLM-Verified-Retrieval
Repo for Llatrieval
☆30Updated 10 months ago
XinyuanLu00 / SciTab
The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"
☆22Updated last year
jzbjyb / ReAtt
Retrieval as Attention
☆83Updated 2 years ago
OSU-NLP-Group / AttrScore
Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"
☆56Updated 2 years ago
WENGSYX / Self-Verification
We have released the code and demo program required for LLM with self-verification
☆60Updated last year
google-research-datasets / GSM-IC
Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…
☆60Updated 2 years ago
littlehacker26 / Discriminator-Cooperative-Unlikelihood-Prompt-Tuning
The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…
☆26Updated last year
Betswish / MIRAGE
Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/
☆24Updated 4 months ago
DAMO-NLP-SG / TempReason
☆33Updated last year
DAMO-NLP-SG / contrastive-cot
Contrastive Chain-of-Thought Prompting
☆64Updated last year
Yushi-Hu / IC-DST
Code base of In-Context Learning for Dialogue State tracking
☆45Updated last year
Nanami18 / Snowballed_Hallucination
☆44Updated 10 months ago
violet-zct / fairseq-detect-hallucination
Detect hallucinated tokens for conditional sequence generation.
☆64Updated 3 years ago
eladsegal / strategyqa
The official code of TACL 2021, "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies".
☆75Updated 2 years ago
luohongyin / UniLC
Interpretable unified language safety checking with large language models
☆31Updated 2 years ago
matt-seb-ho / WikiWhy
WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…
☆47Updated last year
DevSinghSachan / unsupervised-passage-reranking
Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"
☆101Updated 2 years ago