This repository contains the code for "Generating Datasets with Pretrained Language Models".
☆189Aug 17, 2021Updated 4 years ago
Alternatives and similar repositories for dino
Users that are interested in dino are comparing it to the libraries listed below
Sorting:
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,628Jun 12, 2023Updated 2 years ago
- ☆75Jul 2, 2021Updated 4 years ago
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆152Jun 10, 2022Updated 3 years ago
- Convenient Text-to-Text Training for Transformers☆19Dec 10, 2021Updated 4 years ago
- This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".☆89Aug 20, 2021Updated 4 years ago
- ☆32Oct 30, 2023Updated 2 years ago
- SummVis is an interactive visualization tool for text summarization.☆254Jun 17, 2022Updated 3 years ago
- This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.☆11May 7, 2020Updated 5 years ago
- [ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723☆730Aug 29, 2022Updated 3 years ago
- Collection of NLP model explanations and accompanying analysis tools☆144Jun 26, 2023Updated 2 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆175Jun 6, 2021Updated 4 years ago
- Uncertainty-aware Self-training☆124Dec 20, 2023Updated 2 years ago
- ☆16May 14, 2024Updated last year
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28May 2, 2022Updated 3 years ago
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆786May 19, 2024Updated last year
- Few-shot Learning of GPT-3☆357Sep 18, 2023Updated 2 years ago
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆286Oct 20, 2022Updated 3 years ago
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,155Feb 20, 2024Updated 2 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆138Sep 6, 2023Updated 2 years ago
- Code for Editing Factual Knowledge in Language Models☆142Jan 28, 2022Updated 4 years ago
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆26May 1, 2022Updated 3 years ago
- Attribute statements generated by LLMs to preceding tokens using attention weights.☆22Apr 22, 2025Updated 10 months ago
- This repository contains the FewGLUE dataset for few-shot natural language understanding.☆160Sep 16, 2020Updated 5 years ago
- Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.☆1,413Aug 30, 2023Updated 2 years ago
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆445May 9, 2022Updated 3 years ago
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆24Jun 19, 2023Updated 2 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆209Aug 31, 2021Updated 4 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- Materials for ACL-2022 tutorial: Knowledge-Augmented Methods for Natural Language Processing☆286Aug 8, 2022Updated 3 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆137Aug 2, 2023Updated 2 years ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆18Apr 25, 2021Updated 4 years ago
- ☆18Apr 16, 2021Updated 4 years ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- Library for Knowledge Intensive Language Tasks☆967Mar 31, 2022Updated 3 years ago
- ☆344Aug 3, 2021Updated 4 years ago