HLTCHKUST / chatgpt-evaluation
This respository contains the code for extracting the test samples we used in our paper: "A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity"
☆77Updated last year
Alternatives and similar repositories for chatgpt-evaluation:
Users that are interested in chatgpt-evaluation are comparing it to the libraries listed below
- [EMNLP 2023] C-STS: Conditional Semantic Textual Similarity☆70Updated 9 months ago
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆59Updated last year
- Lexically constrained text generation with CBART.☆48Updated 2 years ago
- ☆61Updated 2 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated last week
- [NLPCC 2022] Kformer: Knowledge Injection in Transformer Feed-Forward Layers☆36Updated 2 years ago
- This project maintains a reading list for general text generation tasks☆65Updated 3 years ago
- ☆78Updated 2 years ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Updated 2 years ago
- ⚡Research papers about leveraging the capabilities of language models⚡☆52Updated last year
- Source code for ACL 2022 Paper "Prompt-based Data Augmentation for Low-Resource NLU Tasks"☆68Updated last year
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning☆99Updated last year
- ☆35Updated last year
- Code for Aesop: Paraphrase Generation with Adaptive Syntactic Control (EMNLP 2021)☆27Updated 3 years ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆58Updated last year
- ☆173Updated 7 months ago
- reStructured Pre-training☆98Updated 2 years ago
- Code for Editing Factual Knowledge in Language Models☆136Updated 3 years ago
- ☆116Updated 2 years ago
- ☆39Updated last year
- ☆26Updated 2 years ago
- ConTextual Mask Auto-Encoder for Dense Passage Retrieval☆35Updated 4 months ago
- Code and data for the FACTOR paper☆44Updated last year
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆64Updated 2 years ago
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆151Updated last year
- ☆82Updated last year
- Implementation of ICLR 2022 paper "Enhancing Cross-lingual Transfer by Manifold Mixup".☆21Updated 2 years ago
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆55Updated 9 months ago
- EMNLP 2022: ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization☆35Updated last year
- Official code for "Continual Prompt Tuning for Dialog State Tracking" (ACL 2022).☆27Updated 2 years ago