Companion repo for "Evaluating Verifiability in Generative Search Engines".
☆86May 12, 2023Updated 3 years ago
Alternatives and similar repositories for evaluating-verifiability-in-generative-search-engines
Users that are interested in evaluating-verifiability-in-generative-search-engines are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆138Mar 14, 2024Updated 2 years ago
- A repository for ACL 2022 paper "How do we answer complex questions: Discourse structure of long form answers"☆19May 31, 2025Updated 11 months ago
- Hybrid List Aware Transformer Reranking☆20Oct 25, 2022Updated 3 years ago
- Codebase, data and models for the SummaC paper in TACL☆109Jan 30, 2025Updated last year
- Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval☆26Aug 7, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆45Aug 10, 2024Updated last year
- Repo for Llatrieval☆31Aug 21, 2024Updated last year
- ☆12May 17, 2022Updated 4 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆12Mar 18, 2023Updated 3 years ago
- ☆17Dec 11, 2024Updated last year
- ☆13Feb 5, 2022Updated 4 years ago
- ☆29Feb 26, 2024Updated 2 years ago
- Python library for reading ClueWeb09's warc files☆21Sep 6, 2018Updated 7 years ago
- A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on…☆141Jan 15, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- All materials that accompany/are needed to reproduce ACL 2020 paper - Interpreting Pretrained Contextualized Representations via Reductio…☆19Apr 25, 2020Updated 6 years ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆438Apr 13, 2025Updated last year
- Repository for "Attribute First, then Generate: Locally-attributable Grounded Text Generation", ACL 2024☆30Dec 19, 2024Updated last year
- [EMNLP 2022] This is the code repo for our EMNLP‘22 paper "Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder"…☆13Oct 20, 2022Updated 3 years ago
- ☆15Aug 3, 2021Updated 4 years ago
- ☆15Oct 10, 2021Updated 4 years ago
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆36Aug 2, 2023Updated 2 years ago
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆56Jul 3, 2023Updated 2 years ago
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"☆11Sep 20, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆33May 10, 2024Updated 2 years ago
- [ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"☆63Dec 26, 2025Updated 5 months ago
- RARR: Researching and Revising What Language Models Say, Using Language Models☆53Jun 22, 2023Updated 2 years ago
- Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.☆311Jul 12, 2024Updated last year
- [SIGIR '22] Code for our SIGIR 2022 accepted paper : P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Pr…☆18Sep 24, 2023Updated 2 years ago
- code for Preprint paper at Arxiv: MoT: Pre-thinking and Recalling Enable ChatGPT to Self-Improve with Memory-of-Thoughts☆24Nov 29, 2023Updated 2 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- EMNLP DiscoEval paper☆43Nov 12, 2019Updated 6 years ago
- ☆63Sep 13, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"☆43Dec 9, 2021Updated 4 years ago
- NaturalProver: Grounded Mathematical Proof Generation with Language Models☆39Mar 24, 2023Updated 3 years ago
- ☆24Jun 28, 2023Updated 2 years ago
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,866Apr 6, 2023Updated 3 years ago
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆28Mar 26, 2024Updated 2 years ago
- INSCIT: Information-Seeking Conversations with Mixed-Initiative Interactions☆16Jan 21, 2025Updated last year
- ☆11Sep 18, 2017Updated 8 years ago