Source Code of Paper "GPTScore: Evaluate as You Desire"
☆259Feb 21, 2023Updated 3 years ago
Alternatives and similar repositories for GPTScore
Users that are interested in GPTScore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment"☆416Feb 4, 2024Updated 2 years ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Mar 8, 2023Updated 3 years ago
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆216Feb 10, 2024Updated 2 years ago
- ☆40Jun 7, 2023Updated 2 years ago
- A benchmark dataset for evaluating dialog system and natural language generation metrics.☆39Jun 13, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆217Dec 24, 2023Updated 2 years ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆425Apr 13, 2025Updated 11 months ago
- ☆62Oct 30, 2022Updated 3 years ago
- BARTScore: Evaluating Generated Text as Text Generation☆368Jun 27, 2022Updated 3 years ago
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Jul 25, 2023Updated 2 years ago
- Code for the ICLR 2019 paper "Learning to Represent Edits"☆13Dec 8, 2022Updated 3 years ago
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models☆608Jun 26, 2024Updated last year
- ☆144Sep 10, 2023Updated 2 years ago
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆157Mar 11, 2024Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Codebase, data and models for the SummaC paper in TACL