nelson-liu/evaluating-verifiability-in-generative-search-engines

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nelson-liu/evaluating-verifiability-in-generative-search-engines)

nelson-liu / evaluating-verifiability-in-generative-search-engines

Companion repo for "Evaluating Verifiability in Generative Search Engines".

☆87

Alternatives and similar repositories for evaluating-verifiability-in-generative-search-engines

Users that are interested in evaluating-verifiability-in-generative-search-engines are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

utcsnlp / lfqa_discourse
View on GitHub
A repository for ACL 2022 paper "How do we answer complex questions: Discourse structure of long form answers"
☆19May 31, 2025Updated last year
tingofurro / summac
View on GitHub
Codebase, data and models for the SummaC paper in TACL
☆110Jan 30, 2025Updated last year
Alibaba-NLP / HLATR
View on GitHub
Hybrid List Aware Transformer Reranking
☆19Oct 25, 2022Updated 3 years ago
FreedomIntelligence / DPTDR
View on GitHub
Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval
☆26Aug 7, 2023Updated 2 years ago
martiansideofthemoon / longeval-summarization
View on GitHub
Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…
☆45Aug 10, 2024Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
BeastyZ / LLM-Verified-Retrieval
View on GitHub
Repo for Llatrieval
☆32Aug 21, 2024Updated last year
yuhongqian / ANCE-PRF
View on GitHub
☆12May 17, 2022Updated 4 years ago
da03 / criticize_text_generation
View on GitHub
A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …
☆12Mar 18, 2023Updated 3 years ago
nelson-liu / website
View on GitHub
☆13Feb 5, 2022Updated 4 years ago
princeton-nlp / ALCE
View on GitHub
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
☆523Oct 9, 2024Updated last year
lemurproject / ClueWeb22
View on GitHub
☆17Dec 11, 2024Updated last year
NEUIR / ConAE
View on GitHub
[EMNLP 2022] This is the code repo for our EMNLP‘22 paper "Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder"…
☆13Oct 20, 2022Updated 3 years ago
arian-askari / ChatGPT-RetrievalQA-CIKM2023
View on GitHub
A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on…
☆141Jan 15, 2024Updated 2 years ago
cdegroc / warc-clueweb
View on GitHub
Python library for reading ClueWeb09's warc files
☆21Sep 6, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AlexWan0 / rag-convincingness
View on GitHub
☆29Feb 26, 2024Updated 2 years ago
danielsc / dogbreeds
View on GitHub
☆21Feb 19, 2019Updated 7 years ago
shmsw25 / FActScore
View on GitHub
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…
☆452Apr 13, 2025Updated last year
danieldeutsch / qaeval
View on GitHub
☆15Aug 3, 2021Updated 4 years ago
lovodkin93 / attribute-first-then-generate
View on GitHub
Repository for "Attribute First, then Generate: Locally-attributable Grounded Text Generation", ACL 2024
☆30Dec 19, 2024Updated last year
project-miracl / hagrid
View on GitHub
A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution
☆36Aug 2, 2023Updated 2 years ago
thunlp / ReInfoSelect
View on GitHub
☆36Jun 12, 2023Updated 3 years ago
henryzhao5852 / BeamDR
View on GitHub
☆15Oct 10, 2021Updated 4 years ago
OSU-NLP-Group / AttrScore
View on GitHub
Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"
☆56Jul 3, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
DaoD / DCL
View on GitHub
From Easy to Hard: A Dual Curriculum Learning Framework for Context-Aware Document Ranking
☆14Oct 25, 2022Updated 3 years ago
amazon-science / tofueval
View on GitHub
☆32May 10, 2024Updated 2 years ago
RUCAIBox / HaluAgent
View on GitHub
☆23Jul 1, 2024Updated 2 years ago
anthonywchen / RARR
View on GitHub
RARR: Researching and Revising What Language Models Say, Using Language Models
☆54Jun 22, 2023Updated 3 years ago
yale-nlp / ODSum
View on GitHub
Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"
☆11Sep 20, 2024Updated last year
NEUIR / P3Ranker
View on GitHub
[SIGIR '22] Code for our SIGIR 2022 accepted paper : P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Pr…
☆18Sep 24, 2023Updated 2 years ago
LeeSureman / MoT
View on GitHub
code for Preprint paper at Arxiv: MoT: Pre-thinking and Recalling Enable ChatGPT to Self-Improve with Memory-of-Thoughts
☆24Nov 29, 2023Updated 2 years ago
krishnap25 / mauve
View on GitHub
Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.
☆315Jul 12, 2024Updated 2 years ago
wellecks / naturalprover
View on GitHub
NaturalProver: Grounded Mathematical Proof Generation with Language Models
☆40Mar 24, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
amazon-science / irgr
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
castorini / TREC-COVID
View on GitHub
TREC-COVID results - this is a mirror of data on the TREC website in a more convenient format.
☆15Aug 31, 2020Updated 5 years ago
ZeweiChu / DiscoEval
View on GitHub
EMNLP DiscoEval paper
☆43Nov 12, 2019Updated 6 years ago
iai-group / UserSimCRS
View on GitHub
Conversational Recommender System Evaluation via Simulation
☆22Jul 21, 2026Updated last week
sebastian-hofstaetter / matchmaker
View on GitHub
Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch
☆265Jan 27, 2023Updated 3 years ago
thunlp / ConvDR
View on GitHub
Code repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"
☆43Dec 9, 2021Updated 4 years ago
najoungkim / COGS
View on GitHub
☆63Sep 13, 2022Updated 3 years ago