jinlanfu / GPTScoreLinks
Source Code of Paper "GPTScore: Evaluate as You Desire"
☆257Updated 2 years ago
Alternatives and similar repositories for GPTScore
Users that are interested in GPTScore are comparing it to the libraries listed below
Sorting:
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆213Updated last year
- ☆189Updated 4 months ago
- ☆293Updated last year
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆398Updated 7 months ago
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆498Updated last year
- ☆176Updated last year
- Scaling Sentence Embeddings with Large Language Models☆110Updated last year
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆167Updated 4 years ago
- ☆141Updated 2 years ago
- BARTScore: Evaluating Generated Text as Text Generation☆360Updated 3 years ago
- contrastive decoding☆204Updated 3 years ago
- RARR: Researching and Revising What Language Models Say, Using Language Models☆49Updated 2 years ago
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆140Updated last year
- ☆280Updated 10 months ago
- Prod Env☆433Updated 2 years ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆117Updated last year
- ☆351Updated 4 years ago
- ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Models☆47Updated last year
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆523Updated last year
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆60Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆133Updated last year
- An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi☆271Updated 2 years ago
- A Survey of Attributions for Large Language Models☆218Updated last year
- ☆99Updated 2 years ago
- Do Large Language Models Know What They Don’t Know?☆101Updated last year
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆77Updated last year
- Accompanying repo for the RLPrompt paper☆357Updated last year
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆100Updated 2 years ago
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆153Updated 2 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated 8 months ago