This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.
☆42Dec 15, 2023Updated 2 years ago
Alternatives and similar repositories for wice
Users that are interested in wice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and dataset for the paper: Generating Literal and Implied Subquestions to Fact-check Complex Claims☆28May 30, 2023Updated 3 years ago
- ☆46May 26, 2023Updated 3 years ago
- Official implementation of the ACL 2023 paper: "Zero-shot Faithful Factual Error Correction"☆17Aug 14, 2023Updated 2 years ago
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆28Mar 26, 2024Updated 2 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆139Mar 14, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Toolkit for building prompt templates for language models☆12Sep 30, 2022Updated 3 years ago
- A zero-shot faithfulness evaluation metric for text summarization☆11Oct 17, 2023Updated 2 years ago
- ☆19Oct 2, 2023Updated 2 years ago
- Dataset and model in the paper "SciXGen: A Scientific Paper Dataset for Context-Aware Text Generation"☆13Feb 14, 2022Updated 4 years ago
- Source code of our EMNLP 2024 paper "FactAlign: Long-form Factuality Alignment of Large Language Models"☆19Oct 3, 2024Updated last year
- XWikisCorpus, cross-lingual summarisation, multi-lingual summarisation, pre-trained language models, zero-shot and few-shot summarisation…☆10Nov 4, 2022Updated 3 years ago
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Sep 8, 2022Updated 3 years ago
- Masking tokens to modify the predictions of a pretrained sentence classifier☆16Feb 4, 2020Updated 6 years ago
- ☆22Jun 1, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- implementation of paper "Large Language Models are In-Context Semantic Reasoners rather than Symbolic Reasoners"☆20Aug 17, 2023Updated 2 years ago
- public repo for ESTER dataset and modeling (EMNLP'21)☆20Feb 2, 2022Updated 4 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆84Nov 26, 2020Updated 5 years ago
- ☆12Feb 27, 2022Updated 4 years ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆442Apr 13, 2025Updated last year
- ☆11Sep 24, 2024Updated last year
- ☆20Feb 17, 2024Updated 2 years ago
- PropSegmEnt is an annotated dataset for segmenting English text into propositions, and recognizing proposition-level entailment relations…☆21Dec 21, 2022Updated 3 years ago
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Jul 25, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆22Dec 13, 2023Updated 2 years ago
- ☆24May 31, 2024Updated 2 years ago
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆49Dec 7, 2023Updated 2 years ago
- Data and code for "A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization" (ACL 2020)☆48Jun 12, 2023Updated 3 years ago
- The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"☆23Dec 21, 2023Updated 2 years ago
- Repro is a library for easily running code from published papers via Docker.☆42Sep 22, 2023Updated 2 years ago
- ☆69May 1, 2025Updated last year
- Code and model checkpoints for the MultiVerS model for scientific claim verification.☆54Aug 16, 2023Updated 2 years ago
- FRANK: Factuality Evaluation Benchmark☆59Dec 13, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]☆212Aug 27, 2025Updated 9 months ago
- Official repository of FactKG☆67Apr 22, 2025Updated last year
- ☆19Nov 30, 2020Updated 5 years ago
- ☆22Feb 1, 2024Updated 2 years ago
- Learning and research after DeepSeek-R1, around test-time computing, resurgence of RL, and new LLM learning/application paradigms.☆23Apr 23, 2026Updated last month
- Repository of data and code to use the models described in the paper "Citation Needed: A Taxonomy and Algorithmic Assessment of Wikipedia…☆11Nov 21, 2022Updated 3 years ago
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆32Jan 7, 2026Updated 5 months ago