ssu-humane / HerOLinks
The code for HerO: a fact-checking pipeline based on open LLMs (the runner-up in AVeriTeC)
☆10Updated 5 months ago
Alternatives and similar repositories for HerO
Users that are interested in HerO are comparing it to the libraries listed below
Sorting:
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆255Updated 2 years ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆376Updated 4 months ago
- Awesome LLM for NLG Evaluation Papers☆25Updated last year
- ☆83Updated 8 months ago
- Multilingual Large Language Models Evaluation Benchmark☆130Updated last year
- This is a repository for sharing papers in the field of persona-based conversational AI. The related source code for each paper is linked…☆166Updated last year
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"☆31Updated 9 months ago
- Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment"☆373Updated last year
- ☆185Updated 2 months ago
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆144Updated 2 weeks ago
- ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Models☆47Updated last year
- ☆62Updated 9 months ago
- The implementation of "RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question" [ACL 2023]☆16Updated last year
- BARTScore: Evaluating Generated Text as Text Generation☆357Updated 3 years ago
- Data and info for the paper "ParaDetox: Text Detoxification with Parallel Data"☆31Updated 5 months ago
- Codebase, data and models for the SummaC paper in TACL☆99Updated 7 months ago
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆123Updated last year
- Code and data for Marked Personas (ACL 2023)☆28Updated 2 years ago
- Official code for the ACL 2024 paper: Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New …☆54Updated last year
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆209Updated last year
- ConvGQR: Generative Query Reformulation for Conversational Search. A codebase for ACL 2023 accepted paper.☆32Updated last year
- ☆244Updated last year
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆196Updated 9 months ago
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆499Updated last year
- Tools for checking ACL paper submissions☆774Updated last week
- ☆45Updated last year
- Automatically Update NLP Papers Daily using Github Actions (ref: https://github.com/Vincentqyw/cv-arxiv-daily)☆100Updated this week
- ☆286Updated last year
- Simple replication of DPR (Dense Passage Retrieval)☆47Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆93Updated 10 months ago