ssu-humane / HerOLinks
The code for HerO: a fact-checking pipeline based on open LLMs (the runner-up in AVeriTeC)
☆10Updated 3 months ago
Alternatives and similar repositories for HerO
Users that are interested in HerO are comparing it to the libraries listed below
Sorting:
- ☆59Updated 7 months ago
- ☆75Updated 6 months ago
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆252Updated 2 years ago
- The implementation of "RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question" [ACL 2023]☆16Updated last year
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"☆30Updated 7 months ago
- ☆179Updated 2 weeks ago
- Codebase, data and models for the SummaC paper in TACL☆96Updated 4 months ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆101Updated 2 years ago
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆202Updated last year
- Awesome LLM for NLG Evaluation Papers☆24Updated last year
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆167Updated 3 years ago
- Code and data for Marked Personas (ACL 2023)☆26Updated 2 years ago
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages☆34Updated last month
- Multilingual Large Language Models Evaluation Benchmark☆124Updated 10 months ago
- ☆26Updated 2 years ago
- This is a repository for sharing papers in the field of persona-based conversational AI. The related source code for each paper is linked…☆162Updated 11 months ago
- Repository for the Bias Benchmark for QA dataset.☆118Updated last year
- Official code for the ACL 2024 paper: Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New …☆52Updated last year
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Updated 2 years ago
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆80Updated 4 years ago
- Data and info for the paper "ParaDetox: Text Detoxification with Parallel Data"☆30Updated 2 months ago
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆120Updated last year
- ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Models☆47Updated last year
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆15Updated last year
- Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"☆126Updated last year
- ☆19Updated last year
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆138Updated 6 months ago
- BARTScore: Evaluating Generated Text as Text Generation☆352Updated 3 years ago
- [AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model☆13Updated last year
- [EMNLP'2024 Findings] Explore generated documents for enhanced IR with LLMs. We enhance BM25 to surpass strong dense retriever on many da…☆13Updated 3 months ago