coastalcph / zeroshot_lexglueLinks
Zero-shot evaluation on LEXGLUE tasks with GTP3.5
☆28Updated 2 years ago
Alternatives and similar repositories for zeroshot_lexglue
Users that are interested in zeroshot_lexglue are comparing it to the libraries listed below
Sorting:
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Updated 11 months ago
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆69Updated 2 years ago
- The official code of TACL 2021, "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies".☆75Updated 2 years ago
- On Transferability of Prompt Tuning for Natural Language Processing☆99Updated last year
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 3 years ago
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆32Updated 2 years ago
- Code for paper "CrossFit : A Few-shot Learning Challenge for Cross-task Generalization in NLP" (https://arxiv.org/abs/2104.08835)☆111Updated 3 years ago
- TBC☆27Updated 2 years ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆78Updated last year
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated 4 months ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆39Updated 2 years ago
- Token-level Reference-free Hallucination Detection☆94Updated last year
- Code for Editing Factual Knowledge in Language Models☆138Updated 3 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Updated 2 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Updated 11 months ago
- DEMix Layers for Modular Language Modeling☆53Updated 3 years ago
- Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)☆78Updated last year
- ☆82Updated 2 years ago
- Efficient Memory-Augmented Transformers☆34Updated 2 years ago
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification☆41Updated 2 years ago
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Updated 2 years ago
- Apps built using Inspired Cognition's Critique.☆58Updated 2 years ago
- Official code repository for "Exploring Neural Models for Query-Focused Summarization".☆50Updated 2 years ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Updated last year
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆67Updated 2 years ago
- Detect hallucinated tokens for conditional sequence generation.☆64Updated 3 years ago
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆56Updated 2 years ago
- The Multitask Long Document Benchmark☆39Updated 2 years ago
- Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering☆30Updated 2 years ago
- Retrieval as Attention☆83Updated 2 years ago