ssu-humane / HerO
The code for HerO: a fact-checking pipeline based on open LLMs (the runner-up in AVeriTeC)
☆10Updated last week
Alternatives and similar repositories for HerO:
Users that are interested in HerO are comparing it to the libraries listed below
- ☆57Updated 4 months ago
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"☆28Updated 4 months ago
- The implementation of "RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question" [ACL 2023]☆15Updated 11 months ago
- Data and info for the paper "ParaDetox: Text Detoxification with Parallel Data"☆30Updated 5 months ago
- ☆128Updated last year
- Code and data for Marked Personas (ACL 2023)☆23Updated last year
- Codebase, data and models for the SummaC paper in TACL☆89Updated 2 months ago
- Awesome LLM for NLG Evaluation Papers☆23Updated last year
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages☆28Updated 3 months ago
- BARTScore: Evaluating Generated Text as Text Generation☆345Updated 2 years ago
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆198Updated last year
- ☆68Updated 3 months ago
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆77Updated 4 years ago
- Code and data of the EMNLP2023 paper "Detection of Multiple Mental Disorders from Social Media with Two-Stream Psychiatric Experts"☆8Updated last year
- Official repository of "HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning", Findings of EMNLP 2023☆23Updated last year
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models☆80Updated 6 months ago
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆134Updated 3 months ago
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆242Updated 2 years ago
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Updated last year
- ☆16Updated last month
- Code and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"☆19Updated last year
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"☆62Updated 2 years ago
- [ACL 2024] FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Model☆14Updated 7 months ago
- 🌲 Code for our EMNLP 2023 paper - 🎄 "Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Mode…☆48Updated last year
- ☆25Updated 2 years ago
- Official Code for EMNLP 2023 paper: "Unveiling the Implicit Toxicity in Large Language Models""☆11Updated last year
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆331Updated 10 months ago
- EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural langu…☆105Updated 10 months ago
- Multilingual Large Language Models Evaluation Benchmark☆119Updated 7 months ago
- Repository for the Bias Benchmark for QA dataset.☆105Updated last year