krystalan / chatgpt_as_nlg_evaluatorView external linksLinks
Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study
☆43Mar 8, 2023Updated 2 years ago
Alternatives and similar repositories for chatgpt_as_nlg_evaluator
Users that are interested in chatgpt_as_nlg_evaluator are comparing it to the libraries listed below
Sorting:
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆258Feb 21, 2023Updated 2 years ago
- The implementation of the paper "Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters".☆17May 24, 2022Updated 3 years ago
- Benchmark for evaluating open-ended generation☆50Nov 6, 2024Updated last year
- Codes for paper "Stylized Story Generation with Style-Guided Planning"☆12May 9, 2021Updated 4 years ago
- AMI Meeting Parallel Corpus☆11Dec 11, 2020Updated 5 years ago
- Script for generating the rotowire-modified dataset (Iso et al; ACL 2019)☆12Sep 19, 2021Updated 4 years ago
- Resource, Evaluation and Detection Papers for ChatGPT☆456Mar 21, 2024Updated last year
- GEMBA — GPT Estimation Metric Based Assessment☆145Dec 15, 2025Updated 2 months ago
- HANNA, a large annotated dataset of Human-ANnotated NArratives for ASG evaluation.☆35Oct 15, 2024Updated last year
- Fine grained Empathy Direction Detection☆15Dec 11, 2020Updated 5 years ago
- [EMNLP 2022] Code and data for "Controllable Dialogue Simulation with In-Context Learning"☆34Feb 22, 2023Updated 2 years ago
- EMNLP 2022: ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization☆36Jan 13, 2024Updated 2 years ago
- ☆10Nov 29, 2021Updated 4 years ago
- Implementation of DTMT with incremental decoding☆13Feb 20, 2021Updated 4 years ago
- Code for WikiAsp: Multi-document aspect-based summarization.☆43Dec 9, 2020Updated 5 years ago
- ☆35May 31, 2019Updated 6 years ago
- WSDM‘2022: Knowledge Enhanced Sports Game Summarization☆18Jun 16, 2022Updated 3 years ago
- ☆20Jan 15, 2024Updated 2 years ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Feb 15, 2024Updated 2 years ago
- ☆18May 13, 2021Updated 4 years ago
- The official code of our paper at EMNLP 2022: Back to the Future: Bidirectional Information Decoupling Network for Multi-turn Dialogue Mo…☆16Feb 17, 2023Updated 3 years ago
- Code for the ACL2022 main conference paper "A Variational Hierarchical Model for Neural Cross-Lingual Summarization"☆18Sep 5, 2022Updated 3 years ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆50Mar 28, 2023Updated 2 years ago
- Code for the paper "A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis"☆19Jun 12, 2025Updated 8 months ago
- Language Understanding Augmentation Toolkit for Robustness Testing☆20Jan 22, 2023Updated 3 years ago
- Codes for our CCL 2021 paper: Incorporating Commonsense Knowledge into Abstractive Dialogue Summarization via Heterogeneous Graph Network…☆26Jul 28, 2021Updated 4 years ago
- The official repository for our EMNLP 2024 paper, Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretab…☆21Feb 23, 2025Updated 11 months ago
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Jan 27, 2023Updated 3 years ago
- Code for the paper "Rule induction for global explanation of trained models"☆21Jul 25, 2024Updated last year
- [ACL 2023] The code for our ACL'23 paper Cold-Start Data Selection for Few-shot Language Model Fine-tuning: A Prompt-Based Uncertainty Pr…☆24Jun 1, 2024Updated last year
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆27Jun 7, 2024Updated last year
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆215Feb 10, 2024Updated 2 years ago
- [ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"☆53Sep 21, 2023Updated 2 years ago
- ☆31Jan 26, 2026Updated 3 weeks ago
- codes for the IJCAI 2022 paper "Psychiatric Scale Guided Risky Post Screening for Early Detection of Depression"☆21Mar 17, 2023Updated 2 years ago
- This repository contains PyTorch implementations of the models from the paper An Empirical Study MIME: MIMicking Emotions for Empathetic …☆45Mar 14, 2023Updated 2 years ago
- Code for paper "Prompt-Based Metric Learning for Few-shot NER".☆23Nov 14, 2023Updated 2 years ago
- Multi-turn response selection using dialogue dependency relations☆24Sep 1, 2021Updated 4 years ago
- [NeurIPS 23' Oral] Emergence of Shape Bias in Convolutional Neural Networks through Activation Sparsity☆28Apr 24, 2024Updated last year