nng555 / ssmbaLinks
☆62Updated 3 years ago
Alternatives and similar repositories for ssmba
Users that are interested in ssmba are comparing it to the libraries listed below
Sorting:
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …☆55Updated 4 years ago
- OOD Generalization and Detection (ACL 2020)☆60Updated 5 years ago
- Official Pytorch implementation of (Roles and Utilization of Attention Heads in Transformer-based Neural Language Models), ACL 2020☆16Updated 6 months ago
- Code for the paper "UnNatural Language Inference" to appear at ACL 2021 (Long Paper)☆36Updated 4 years ago
- Code accompanying our papers on the "Generative Distributional Control" framework☆118Updated 2 years ago
- This is a repository with the code for the EMNLP 2020 paper "Information-Theoretic Probing with Minimum Description Length"☆71Updated last year
- Implementation of Mixout with PyTorch☆75Updated 2 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Updated 4 years ago
- ☆48Updated 5 years ago
- ☆45Updated 3 years ago
- Code for Paper: Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data☆36Updated 4 years ago
- Code and datasets for the EMNLP 2020 paper "Calibration of Pre-trained Transformers"☆61Updated 2 years ago
- Implementation of the paper 'Plug and Play Autoencoders for Conditional Text Generation'☆43Updated 4 years ago
- Official code for the ICLR 2020 paper 'ARE PPE-TRAINED LANGUAGE MODELS AWARE OF PHRASES? SIMPLE BUT STRONG BASELINES FOR GRAMMAR INDCUTIO…☆30Updated 2 years ago
- Language Model Baselines for PyTorch☆41Updated 5 years ago
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆40Updated 4 years ago
- Code for EMNLP 2020 paper CoDIR☆41Updated 2 years ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆102Updated 4 years ago
- ☆22Updated 4 years ago
- PyTorch implementation of A Surprisingly Effective Fix for Deep Latent Variable Modeling of Text (EMNLP 2019)☆48Updated 5 years ago
- ☆44Updated 5 years ago
- ☆44Updated 6 years ago
- Code for papers "A Surprisingly Robust Trick for Winograd Schema Challenge" and "WikiCREM: A Large Unsupervised Corpus for Coreference Re…☆71Updated 2 years ago
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆34Updated 3 years ago
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆66Updated 4 years ago
- Transfer Learning in Dialogue Benchmarking Toolkit☆14Updated 2 years ago
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.☆51Updated 5 years ago
- ☆42Updated 4 years ago
- [NAACL 2021] Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering☆36Updated 4 years ago
- Cascaded Text Generation with Markov Transformers