ssu-humane / HerOLinks
The code for HerO: a fact-checking pipeline based on open LLMs (the runner-up in AVeriTeC)
☆10Updated 4 months ago
Alternatives and similar repositories for HerO
Users that are interested in HerO are comparing it to the libraries listed below
Sorting:
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆252Updated 2 years ago
- Tools for checking ACL paper submissions☆764Updated 2 months ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆363Updated 3 months ago
- ☆75Updated 6 months ago
- Official code for the ACL 2024 paper: Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New …☆52Updated last year
- Data and info for the paper "ParaDetox: Text Detoxification with Parallel Data"☆31Updated 3 months ago
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"☆30Updated 7 months ago
- The implementation of "RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question" [ACL 2023]☆16Updated last year
- BARTScore: Evaluating Generated Text as Text Generation☆353Updated 3 years ago
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆139Updated 7 months ago
- ☆60Updated 7 months ago
- Multilingual Large Language Models Evaluation Benchmark☆127Updated 11 months ago
- Code and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"☆20Updated last year
- Code and data for Marked Personas (ACL 2023)☆26Updated 2 years ago
- Awesome LLM for NLG Evaluation Papers☆24Updated last year
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆79Updated 4 years ago
- ☆242Updated last year
- Codebase, data and models for the SummaC paper in TACL☆97Updated 5 months ago
- ☆182Updated 2 weeks ago
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆122Updated last year
- This is a repository for sharing papers in the field of persona-based conversational AI. The related source code for each paper is linked…☆163Updated last year
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆488Updated last year
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆203Updated last year
- ☆140Updated last year
- Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment"☆363Updated last year
- Fusion-in-Decoder☆574Updated last year
- SemEval2024-task8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection☆76Updated last year
- Repository for the Bias Benchmark for QA dataset.☆123Updated last year
- [ACL 2024] Unveiling Linguistic Regions in Large Language Models☆31Updated last year
- EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural langu…☆109Updated last year