ssu-humane / HerOLinks
The code for HerO: a fact-checking pipeline based on open LLMs (the runner-up in AVeriTeC)
☆12Updated 9 months ago
Alternatives and similar repositories for HerO
Users that are interested in HerO are comparing it to the libraries listed below
Sorting:
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆257Updated 2 years ago
- ☆89Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆133Updated last year
- Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment"☆401Updated last year
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆414Updated 8 months ago
- Code and data for Marked Personas (ACL 2023)☆28Updated 2 years ago
- Awesome LLM for NLG Evaluation Papers☆25Updated last year
- Tools for checking ACL paper submissions☆870Updated last month
- BARTScore: Evaluating Generated Text as Text Generation☆367Updated 3 years ago
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆153Updated 4 months ago
- Codebase, data and models for the SummaC paper in TACL☆107Updated 11 months ago
- SemEval2024-task8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection☆79Updated last year
- ☆71Updated last year
- ☆47Updated 3 months ago
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆214Updated last year
- Automatically Update NLP Papers Daily using Github Actions (ref: https://github.com/Vincentqyw/cv-arxiv-daily)☆103Updated this week
- This is a repository for sharing papers in the field of persona-based conversational AI. The related source code for each paper is linked…☆167Updated last year
- ☆188Updated 6 months ago
- The implementation of "RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question" [ACL 2023]☆16Updated last year
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages☆43Updated 5 months ago
- ☆254Updated last year
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"☆33Updated last year
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆127Updated last year
- Data and info for the paper "ParaDetox: Text Detoxification with Parallel Data"☆33Updated 9 months ago
- Code and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"☆21Updated last year
- Official Repository for "BlendX: Complex Multi-intent Detection with Blended Patterns"☆27Updated 5 months ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆215Updated last year
- Simple replication of DPR (Dense Passage Retrieval)☆51Updated 2 years ago
- Official repository for "Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory" accepted at EMNLP Find…☆31Updated last year
- ☆294Updated 2 years ago