nng555 / ssmba
☆63Updated 3 years ago
Alternatives and similar repositories for ssmba:
Users that are interested in ssmba are comparing it to the libraries listed below
- Official Pytorch implementation of (Roles and Utilization of Attention Heads in Transformer-based Neural Language Models), ACL 2020☆16Updated last month
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …☆55Updated 4 years ago
- Official code for the ICLR 2020 paper 'ARE PPE-TRAINED LANGUAGE MODELS AWARE OF PHRASES? SIMPLE BUT STRONG BASELINES FOR GRAMMAR INDCUTIO…☆30Updated last year
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Updated 3 years ago
- Code for the paper "UnNatural Language Inference" to appear at ACL 2021 (Long Paper)☆36Updated 3 years ago
- PyTorch implementation of A Surprisingly Effective Fix for Deep Latent Variable Modeling of Text (EMNLP 2019)☆48Updated 5 years ago
- Code for EMNLP 2020 paper CoDIR☆41Updated 2 years ago
- ☆46Updated 5 years ago
- OOD Generalization and Detection (ACL 2020)☆60Updated 5 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆31Updated 3 years ago
- Implementation of Mixout with PyTorch☆75Updated 2 years ago
- Language Model Baselines for PyTorch☆42Updated 4 years ago
- Code and datasets for the EMNLP 2020 paper "Calibration of Pre-trained Transformers"☆59Updated last year
- This code repository presents the pytorch implementation of the paper “Implicit Deep Latent Variable Models for Text Generation”(EMNLP 20…☆54Updated 3 years ago
- ☆47Updated 4 years ago
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…☆49Updated 4 years ago
- ☆45Updated 3 years ago
- ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation☆25Updated 4 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆43Updated 2 years ago
- Pytorch implementation of DiffMask☆55Updated last year
- ☆42Updated 4 years ago
- This is a repository with the code for the EMNLP 2020 paper "Information-Theoretic Probing with Minimum Description Length"☆71Updated 8 months ago
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆30Updated 3 years ago
- Source Code for paper "Learning from Explanations with Neural Execution Tree", ICLR 2020☆18Updated 4 years ago
- ☆22Updated 4 years ago
- Encoder-Agnostic Adaptation for Conditional Language Generation☆79Updated 9 months ago
- Evaluating Machines by their Real-World Language Use☆33Updated last year
- Implementation of the paper 'Plug and Play Autoencoders for Conditional Text Generation'☆43Updated 4 years ago
- Code Release for "On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies"☆15Updated 4 years ago