JiaQiSJTU / FaithEval-FFLMLinks
A zero-shot faithfulness evaluation metric for text summarization
☆11Updated last year
Alternatives and similar repositories for FaithEval-FFLM
Users that are interested in FaithEval-FFLM are comparing it to the libraries listed below
Sorting:
- ☆55Updated 10 months ago
- ☆37Updated last year
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Updated 3 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆59Updated last year
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated 4 months ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆23Updated 10 months ago
- ☆10Updated 5 months ago
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Updated 2 years ago
- ☆42Updated last year
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆49Updated 2 years ago
- ☆16Updated 3 years ago
- ☆17Updated 4 months ago
- ☆75Updated 6 months ago
- ☆33Updated 2 years ago
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆46Updated 2 months ago
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆39Updated last year
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆66Updated last year
- Source code for Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts☆17Updated 10 months ago
- ☆21Updated last year
- BeHonest: Benchmarking Honesty in Large Language Models☆34Updated 11 months ago
- LongMIT: Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets☆36Updated 9 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆62Updated last year
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆81Updated last year
- ☆32Updated last year
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆178Updated last year
- EMNLP2022 "Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment"☆18Updated 2 years ago
- Personality Alignment of Language Models☆37Updated 2 weeks ago
- ☆15Updated 2 years ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆33Updated last year
- ☆64Updated 2 years ago