ssu-humane / HerOLinks
The code for HerO: a fact-checking pipeline based on open LLMs (the runner-up in AVeriTeC)
☆12Updated 10 months ago
Alternatives and similar repositories for HerO
Users that are interested in HerO are comparing it to the libraries listed below
Sorting:
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆258Updated 2 years ago
- Code and data for Marked Personas (ACL 2023)☆28Updated 2 years ago
- ☆71Updated last year
- Awesome LLM for NLG Evaluation Papers☆25Updated 2 years ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆413Updated 9 months ago
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages☆44Updated 5 months ago
- Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment"☆404Updated last year
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"☆33Updated last year
- Codebase, data and models for the SummaC paper in TACL☆108Updated last year
- ☆89Updated last year
- BARTScore: Evaluating Generated Text as Text Generation☆366Updated 3 years ago
- Multilingual Large Language Models Evaluation Benchmark☆133Updated last year
- Automatically Update NLP Papers Daily using Github Actions (ref: https://github.com/Vincentqyw/cv-arxiv-daily)☆104Updated this week
- Data and info for the paper "ParaDetox: Text Detoxification with Parallel Data"☆33Updated 9 months ago
- ☆188Updated 6 months ago
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆214Updated last year
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆153Updated 5 months ago
- Tools for checking ACL paper submissions☆891Updated last month
- This is a repository for sharing papers in the field of persona-based conversational AI. The related source code for each paper is linked…☆169Updated last year
- SemEval2024-task8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection☆80Updated last year
- The implementation of "RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question" [ACL 2023]☆16Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆95Updated last year
- Code for the arxiv paper: Complex Claim Verification with Evidence Retrieved in the Wild☆13Updated 2 years ago
- Simple replication of DPR (Dense Passage Retrieval)☆54Updated 2 years ago
- Official code for the ACL 2024 paper: Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New …☆59Updated last year
- ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Models☆47Updated 2 years ago
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆128Updated last year
- Code and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"☆21Updated last year
- ☆158Updated 2 years ago
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆85Updated 4 years ago