LaraQianYang / Ouroboros
Ouroboros: On Accelerating Training of Transformer-Based Language Models
☆10Updated 5 years ago
Alternatives and similar repositories for Ouroboros:
Users that are interested in Ouroboros are comparing it to the libraries listed below
- The Importance of Being Recurrent for Modeling Hierarchical Structure☆25Updated 6 years ago
- ☆31Updated 5 years ago
- Tensorflow Source code for "Recurrently Controlled Recurrent Networks" (NIPS 2018)☆23Updated 6 years ago
- ☆12Updated 6 years ago
- Implementation of "Effective Adversarial Regularization for Neural Machine Translation", ACL 2019☆21Updated 5 years ago
- Code for the paper "Inoculation by Fine-Tuning: A Method for Analyzing Challenge Datasets", to be presented at NAACL 2019.☆19Updated 5 years ago
- Code for the article "Automatic Temperature Control for Neural Machine Translation" (EMNLP 2018)☆13Updated 5 years ago
- Code for paper "Interactive Machine Comprehension with Information Seeking Agents" -- public version☆23Updated 5 years ago
- Code for NAACL19 Paper "How Large a Vocabulary Does Text Classification Need? A Variational Approach to Vocabulary Selection"☆42Updated 5 years ago
- An example of DyNet autobatching for the NIPS "how to code a paper" workshop☆12Updated 7 years ago
- PhD thesis (updating) of Jiatao Gu from HKU☆19Updated 6 years ago
- [EMNLP 2018] On Tree-Based Neural Sentence Modeling.☆65Updated 5 years ago
- Code for "Variational Sequential Labelers for Semi-Supervised Learning" (EMNLP 2018)☆34Updated 6 years ago
- Code for "A Multi-Task Approach for Disentangling Syntax and Semantics in Sentence Representations" (NAACL 2019)☆67Updated 3 years ago
- Fine-grained Gating for Reading Comprehension☆19Updated 7 years ago
- Text Content Manipulation☆44Updated 4 years ago
- Enhancing Sentence Embedding with Generalized Pooling☆11Updated 6 years ago
- Codebase accompanying the paper 'Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts', (Emelin, D…☆11Updated 2 years ago
- Learn models that are robust to spurious correlations in the dataset.☆26Updated 5 years ago
- Code to reproduce results in our ACL 2018 paper "Did the Model Understand the Question?"☆33Updated 6 years ago
- Maximal Mutual Information (MMI) Tagger☆24Updated 5 years ago
- This repository contains the code used for Ordered Memory paper☆28Updated 5 years ago
- PyTorch implementation of A Surprisingly Effective Fix for Deep Latent Variable Modeling of Text (EMNLP 2019)☆47Updated 5 years ago
- Implementation of "Modeling Past and Future for Neural Machine Translation"☆15Updated 6 years ago
- Question Answering with Interactive Text (QAit), code for EMNLP 2019 paper "Interactive Language Learning by Question Answering"☆44Updated 5 years ago
- NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)☆63Updated 2 years ago
- PyTorch implementation of ACL paper https://arxiv.org/abs/1906.02656☆25Updated last year
- ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation☆25Updated 4 years ago
- ☆61Updated 6 years ago
- Tensorflow Implementation of Improving Variational Encoder-Decoders in Dialogue Generation☆27Updated 6 years ago