HLTCHKUST / chatgpt-evaluation
This respository contains the code for extracting the test samples we used in our paper: "A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity"
☆77Updated last year
Alternatives and similar repositories for chatgpt-evaluation:
Users that are interested in chatgpt-evaluation are comparing it to the libraries listed below
- ☆60Updated 2 years ago
- This project maintains a reading list for general text generation tasks☆65Updated 3 years ago
- GIFT (ACL 2023) & MPC-BERT (ACL 2021) for Multi-Party Conversation Understanding☆41Updated last year
- ☆39Updated last year
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆64Updated 2 years ago
- ☆85Updated 2 years ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Updated last year
- TBC☆26Updated 2 years ago
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆59Updated last year
- Official code for "Continual Prompt Tuning for Dialog State Tracking" (ACL 2022).☆27Updated last year
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆97Updated last year
- ☆90Updated 10 months ago
- First explanation metric (diagnostic report) for text generation evaluation☆63Updated 7 months ago
- ☆36Updated 10 months ago
- A toolkit for evaluation of natural language generation (NLG), including BLEU, ROUGE, METEOR, and CIDEr.☆31Updated 4 years ago
- ☆116Updated 2 years ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆57Updated last year
- EMNLP 2022: Generating Natural Language Proofs with Verifier-Guided Search https://arxiv.org/abs/2205.12443☆83Updated 5 months ago
- An Interpretable Neuro-Symbolic Framework for Task-Oriented Dialogue Generation☆23Updated 2 years ago
- Interpretable unified language safety checking with large language models☆30Updated last year
- ☆82Updated last year
- ⚡Research papers about leveraging the capabilities of language models⚡☆52Updated last year
- [EMNLP 2023] C-STS: Conditional Semantic Textual Similarity☆69Updated 8 months ago
- [NLPCC 2022] Kformer: Knowledge Injection in Transformer Feed-Forward Layers☆36Updated 2 years ago
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆55Updated 8 months ago
- Do Large Language Models Know What They Don’t Know?☆91Updated 3 months ago
- An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"☆131Updated 2 years ago
- Official Code for "PPT: Pre-trained Prompt Tuning for Few-shot Learning". ACL 2022☆109Updated 2 years ago
- ☆28Updated last year
- Code and data for "Retrieval Enhanced Model for Commonsense Generation" (ACL-IJCNLP 2021).☆28Updated 3 years ago