Spico197 / awesome-lm-evaluation
π©Ί A collection of ChatGPT evaluation reports on various bechmarks.
β48Updated 2 years ago
Alternatives and similar repositories for awesome-lm-evaluation:
Users that are interested in awesome-lm-evaluation are comparing it to the libraries listed below
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Geneβ¦β26Updated last year
- PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogβ¦β27Updated 3 years ago
- β14Updated 2 years ago
- βοΈ ChatGPT as a writing partner.β14Updated 2 years ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Studyβ43Updated 2 years ago
- First explanation metric (diagnostic report) for text generation evaluationβ61Updated last month
- A toolkit for evaluation of natural language generation (NLG), including BLEU, ROUGE, METEOR, and CIDEr.β31Updated 4 years ago
- Code for our SIGIR 2022 accepted paper : P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Lβ¦β17Updated last year
- [EMNLP 2023] C-STS: Conditional Semantic Textual Similarityβ71Updated 11 months ago
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedbackβ39Updated last year
- [ACL 2022] Ditch the Gold Standard: Re-evaluating Conversational Question Answeringβ45Updated 2 years ago
- Code for NAACL2022 Long Paper "An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling"β27Updated 2 years ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planningβ36Updated last year
- Code for embedding and retrieval research.β16Updated last year
- EMNLP 2022: ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarizationβ35Updated last year
- Source code for ACL 2022 Paper "Prompt-based Data Augmentation for Low-Resource NLU Tasks"β69Updated 2 years ago
- Code for the ACL 2022 paper "Contextual Representation Learning beyond Masked Language Modeling"β34Updated 2 years ago
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"β57Updated 2 years ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"β11Updated last year
- Code and data for "Retrieval Enhanced Model for Commonsense Generation" (ACL-IJCNLP 2021).β28Updated 3 years ago
- Code for paper "Nearest Neighbor Knowledge Distillation for Neural Machine Translation" by Zhixian Yang, Renliang Sun, and Xiaojun Wan. Tβ¦β30Updated 2 years ago
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractorsβ36Updated 2 months ago
- β71Updated 2 years ago
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuningβ100Updated last year
- β31Updated last year
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)β17Updated last year
- The repository for ACL 2022 paper: Other Roles Matter! Enhancing Role-Oriented Dialogue Summarization via Role Interactionsβ26Updated 2 years ago
- Official code for "Continual Prompt Tuning for Dialog State Tracking" (ACL 2022).β27Updated 2 years ago
- Repo for paper: Controllable Text Generation with Language Constraintsβ19Updated last year
- πΌ Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Expertsβ38Updated 6 months ago