Companion repo for "Evaluating Verifiability in Generative Search Engines".
☆85May 12, 2023Updated 2 years ago
Alternatives and similar repositories for evaluating-verifiability-in-generative-search-engines
Users that are interested in evaluating-verifiability-in-generative-search-engines are comparing it to the libraries listed below
Sorting:
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆137Mar 14, 2024Updated last year
- Repo for Llatrieval☆31Aug 21, 2024Updated last year
- Codebase, data and models for the SummaC paper in TACL☆109Jan 30, 2025Updated last year
- ☆21Feb 19, 2019Updated 7 years ago
- A repository for ACL 2022 paper "How do we answer complex questions: Discourse structure of long form answers"☆19May 31, 2025Updated 9 months ago
- Hybrid List Aware Transformer Reranking☆19Oct 25, 2022Updated 3 years ago
- Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval☆26Aug 7, 2023Updated 2 years ago
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆511Oct 9, 2024Updated last year
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Aug 10, 2024Updated last year
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- Image recommendation service with image on the input that outputs most similar images from database.☆14Sep 19, 2020Updated 5 years ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆417Apr 13, 2025Updated 10 months ago
- ☆32May 10, 2024Updated last year
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"☆11Sep 20, 2024Updated last year
- ☆12Jul 6, 2023Updated 2 years ago
- Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.☆308Jul 12, 2024Updated last year
- Repository for "Attribute First, then Generate: Locally-attributable Grounded Text Generation", ACL 2024☆29Dec 19, 2024Updated last year
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆56Jul 3, 2023Updated 2 years ago
- From Easy to Hard: A Dual Curriculum Learning Framework for Context-Aware Document Ranking☆14Oct 25, 2022Updated 3 years ago
- RARR: Researching and Revising What Language Models Say, Using Language Models☆53Jun 22, 2023Updated 2 years ago
- Conversational Recommender System Evaluation via Simulation☆19Updated this week
- ☆17Dec 11, 2024Updated last year
- Code and Data for "Language Modeling with Editable External Knowledge"☆36Jun 19, 2024Updated last year
- ☆32Mar 31, 2020Updated 5 years ago
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆35Aug 2, 2023Updated 2 years ago
- ☆13Feb 5, 2022Updated 4 years ago
- Repository for DEMETR: Diagnosing Evaluation Metrics for Translation☆17Nov 29, 2022Updated 3 years ago
- NaturalProver: Grounded Mathematical Proof Generation with Language Models☆39Mar 24, 2023Updated 2 years ago
- ☆62Sep 13, 2022Updated 3 years ago
- A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on…☆142Jan 15, 2024Updated 2 years ago
- Knowledge graph based information retrieval☆14Dec 26, 2018Updated 7 years ago
- TREC-COVID results - this is a mirror of data on the TREC website in a more convenient format.☆15Aug 31, 2020Updated 5 years ago
- All materials that accompany/are needed to reproduce ACL 2020 paper - Interpreting Pretrained Contextualized Representations via Reductio…☆19Apr 25, 2020Updated 5 years ago
- ICLR 2022 (Spolight): Continual Learning With Filter Atom Swapping☆16Jul 5, 2023Updated 2 years ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆18Apr 25, 2021Updated 4 years ago
- [ICML‘2024] "LoCoCo: Dropping In Convolutions for Long Context Compression", Ruisi Cai, Yuandong Tian, Zhangyang Wang, Beidi Chen☆17Sep 7, 2024Updated last year
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 3 years ago
- ☆18Aug 21, 2025Updated 6 months ago