JiaQiSJTU / FaithEval-FFLM
A zero-shot faithfulness evaluation metric for text summarization
☆11Updated last year
Alternatives and similar repositories for FaithEval-FFLM:
Users that are interested in FaithEval-FFLM are comparing it to the libraries listed below
- ☆9Updated 4 months ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆11Updated last year
- ☆52Updated 5 months ago
- ☆13Updated 3 weeks ago
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆39Updated last year
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆36Updated 4 months ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆23Updated 4 months ago
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆12Updated 2 years ago
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…☆25Updated last year
- ☆33Updated 2 years ago
- GPT as Human☆18Updated last month
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆48Updated last year
- BeHonest: Benchmarking Honesty in Large Language Models☆31Updated 5 months ago
- First explanation metric (diagnostic report) for text generation evaluation☆63Updated 6 months ago
- Constrained Decoding Project☆18Updated last year
- Code for paper "Nearest Neighbor Knowledge Distillation for Neural Machine Translation" by Zhixian Yang, Renliang Sun, and Xiaojun Wan. T…☆30Updated 2 years ago
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Updated last year
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Updated last year
- ☆57Updated last month
- ☆34Updated last year
- Codes for Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback (ACL 2024 Findings)☆13Updated 6 months ago
- Official implementation of the ACL 2023 paper: "Zero-shot Faithful Factual Error Correction"☆17Updated last year
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆77Updated 11 months ago
- ☆17Updated last year
- Visual and Embodied Concepts evaluation benchmark☆21Updated last year
- Source code for paper "ATP: AMRize Than Parse! Enhancing AMR Parsing with PseudoAMRs" @NAACL-2022☆14Updated last year
- ☆17Updated 2 years ago
- ☆13Updated 2 years ago
- Methods and evaluation for aligning language models temporally☆27Updated 10 months ago
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆14Updated 2 years ago