CLARIN-PL / chatgpt-evaluation-01-2023
Code, datasets and results of the ChatGPT evaluation presented in paper "ChatGPT: Jack of all trades, master of none"
☆29Updated last year
Related projects ⓘ
Alternatives and complementary repositories for chatgpt-evaluation-01-2023
- TBC☆26Updated 2 years ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆42Updated last year
- ☆55Updated last year
- A benchmark dataset for evaluating dialog system and natural language generation metrics.☆35Updated 2 years ago
- Influence Experiments☆35Updated last year
- Code for the ACL2022 paper "Synthetic Question Value Estimation for Domain Adaptation of Question Answering"☆16Updated 2 years ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated 7 months ago
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆52Updated last year
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆62Updated 2 years ago
- [EMNLP 2022] Code and data for "Controllable Dialogue Simulation with In-Context Learning"☆34Updated last year