HLTCHKUST / chatgpt-evaluation
This respository contains the code for extracting the test samples we used in our paper: "A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity"
☆77Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for chatgpt-evaluation
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆61Updated last year
- TBC☆26Updated 2 years ago
- ☆36Updated 7 months ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆42Updated last year
- This project maintains a reading list for general text generation tasks☆65Updated 2 years ago
- ☆46Updated last month
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆54Updated 10 months ago
- Code for "Small Models are Valuable Plug-ins for Large Language Models"☆122Updated last year
- domain adaptation in NLP☆52Updated 3 years ago
- ☆59Updated last year
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆150Updated last year
- [EMNLP 2023] C-STS: Conditional Semantic Textual Similarity☆66Updated 5 months ago
- ☆80Updated 2 years ago
- ☆83Updated last year
- Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models☆41Updated 2 years ago
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆92Updated last year
- ☆43Updated 7 months ago
- EMNLP 2022: Generating Natural Language Proofs with Verifier-Guided Search https://arxiv.org/abs/2205.12443☆81Updated 2 months ago
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆38Updated 2 years ago
- [EMNLP 2022] Code and data for "Controllable Dialogue Simulation with In-Context Learning"☆34Updated last year
- On Transferability of Prompt Tuning for Natural Language Processing☆97Updated 6 months ago
- Lexically constrained text generation with CBART.☆47Updated 2 years ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated 11 months ago
- Code for ACL2023 paper: Pre-Training to Learn in Context☆107Updated 3 months ago
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆62Updated 2 years ago
- ☆80Updated last year
- [NLPCC 2022] Kformer: Knowledge Injection in Transformer Feed-Forward Layers☆35Updated 2 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆61Updated 4 months ago
- ☆167Updated 3 months ago
- Do Large Language Models Know What They Don’t Know?☆85Updated last week