Spico197 / awesome-lm-evaluationLinks
π©Ί A collection of ChatGPT evaluation reports on various bechmarks.
β50Updated 2 years ago
Alternatives and similar repositories for awesome-lm-evaluation
Users that are interested in awesome-lm-evaluation are comparing it to the libraries listed below
Sorting:
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Studyβ43Updated 2 years ago
- βοΈ ChatGPT as a writing partner.β14Updated 2 years ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planningβ36Updated 2 years ago
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedbackβ40Updated 2 years ago
- β14Updated 2 years ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"β11Updated 2 years ago
- Code for NAACL2022 Long Paper "An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling"β28Updated 2 years ago
- β17Updated 7 months ago
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Geneβ¦β27Updated last year
- This project maintains a reading list for general text generation tasksβ66Updated 3 years ago
- First explanation metric (diagnostic report) for text generation evaluationβ62Updated 7 months ago
- reStructured Pre-trainingβ98Updated 2 years ago
- code for Teaching LM to Translate with Comparisonβ39Updated last year
- Code for Aesop: Paraphrase Generation with Adaptive Syntactic Control (EMNLP 2021)β26Updated 3 years ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"β19Updated 3 years ago
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"β58Updated 3 years ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.β22Updated last year
- EMNLP 2022: ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarizationβ36Updated last year
- self-adaptive in-context learningβ45Updated 2 years ago
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. πβ12Updated 2 years ago
- β57Updated last year
- Paradigm shift in natural language processingβ42Updated 3 years ago
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)β17Updated last year
- PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogβ¦β27Updated 4 years ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".β41Updated 2 years ago
- Code for embedding and retrieval research.β17Updated 2 years ago
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuningβ100Updated 2 years ago
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractorsβ38Updated 8 months ago
- An (incomplete) overview of information extractionβ41Updated 3 years ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Modelsβ19Updated 2 years ago