HLTCHKUST / chatgpt-evaluation
This respository contains the code for extracting the test samples we used in our paper: "A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity"
☆77Updated last year
Alternatives and similar repositories for chatgpt-evaluation:
Users that are interested in chatgpt-evaluation are comparing it to the libraries listed below
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Updated 2 years ago
- ☆116Updated 2 years ago
- ☆36Updated last year
- reStructured Pre-training☆98Updated 2 years ago
- Code for Editing Factual Knowledge in Language Models☆136Updated 3 years ago
- DSTC10 Track1 - MOD: Internet Meme Incorporated Open-domain Dialog☆50Updated 2 years ago
- Official code for "Continual Prompt Tuning for Dialog State Tracking" (ACL 2022).☆27Updated 2 years ago
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆59Updated last year
- This project maintains a reading list for general text generation tasks☆65Updated 3 years ago
- Lexically constrained text generation with CBART.☆48Updated 2 years ago
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆64Updated 2 years ago
- ☆66Updated 2 years ago
- Official Code for "PPT: Pre-trained Prompt Tuning for Few-shot Learning". ACL 2022☆108Updated 2 years ago
- ☆78Updated 2 years ago
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆41Updated 2 years ago
- GIFT (ACL 2023) & MPC-BERT (ACL 2021) for Multi-Party Conversation Understanding☆41Updated last year
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆57Updated 2 years ago
- ☆39Updated last year
- ☆61Updated 2 years ago
- Interpretable unified language safety checking with large language models☆30Updated last year
- Official repository of the AAAI'2022 paper "GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning…☆107Updated 2 years ago
- Codes for our ACL21 paper: Language Model as an Annotator: Exploring DialoGPT for Dialogue Summarization☆94Updated 3 years ago
- Benchmark for evaluating open-ended generation☆48Updated 4 months ago
- Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models☆42Updated 3 years ago
- An Interpretable Neuro-Symbolic Framework for Task-Oriented Dialogue Generation☆23Updated 3 years ago
- Source code for paper on commonsense reasoning for 2020 Annual Conference of the Association for Computational Linguistics (ACL) 2020.☆28Updated 7 months ago
- [EMNLP 2022] Code and data for "Controllable Dialogue Simulation with In-Context Learning"☆34Updated 2 years ago
- Code base of In-Context Learning for Dialogue State tracking☆45Updated last year
- ACL'23: Unified Demonstration Retriever for In-Context Learning☆36Updated last year
- ☆26Updated 2 years ago