princeton-nlp / semsup
Semantic Supervision: Enabling Generalization over Output Spaces
☆16Updated last year
Related projects: ⓘ
- ☆63Updated 2 years ago
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …☆55Updated 3 years ago
- Code and Experiments for ACL-IJCNLP 2021 Paper "Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for …☆56Updated 2 years ago
- [EMNLP 2021] Code and data for our paper "Visually Grounded Reasoning across Languages and Cultures"☆28Updated 2 years ago
- Hard-Coded Gaussian Attention for Neural Machine Translation☆36Updated last year
- ☆13Updated this week
- DEMix Layers for Modular Language Modeling☆51Updated 3 years ago
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Updated 2 years ago
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆32Updated 2 years ago
- N/A☆18Updated 2 years ago
- Code for EMNLP 2020 paper CoDIR☆41Updated last year
- ☆44Updated 2 years ago
- Code for the paper "UnNatural Language Inference" to appear at ACL 2021 (Long Paper)☆36Updated 3 years ago
- Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks☆13Updated last year
- ☆22Updated 3 years ago
- ☆21Updated 3 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Updated 2 years ago
- Rationales for Sequential Predictions☆40Updated 2 years ago
- ☆49Updated last year
- Exploring Few-Shot Adaptation of Language Models with Tables☆23Updated 2 years ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Updated last year
- Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"☆32Updated 3 years ago
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆31Updated 2 years ago
- Implementation for Variational Information Bottleneck for Effective Low-resource Fine-tuning, ICLR 2021☆36Updated 3 years ago
- Code for "Open Vocabulary Extreme Classification Using Generative Models"☆24Updated 2 years ago
- Code for the paper "Modelling Latent Translations for Cross-Lingual Transfer"☆17Updated 2 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 2 years ago
- ☆11Updated last year
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- [EMNLP 2021] Code and data for our paper "Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers…☆20Updated 2 years ago