Smerity / pytorch-lamb
Implementation of https://arxiv.org/abs/1904.00962
☆15Updated 5 years ago
Related projects: ⓘ
- Code repo for "Transformer on a Diet" paper☆31Updated 4 years ago
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …☆55Updated 3 years ago
- ☆14Updated 5 years ago
- A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.☆47Updated 4 years ago
- PyTorch code for meta seq2seq learning☆43Updated 4 years ago
- ☆36Updated this week
- ☆64Updated 4 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Updated 3 years ago
- Boolean Question Answering with multi-task learning and uses large LM embeddings like BERT, RoBERTa☆18Updated 5 years ago
- Improving Neural Text Generation with Reinforcement Learning☆21Updated 3 years ago
- Open source implementation of SeaRNN (ICLR 2018, https://openreview.net/forum?id=HkUR_y-RZ)☆49Updated 6 years ago
- ☆47Updated 4 years ago
- Humans understand novel sentences by composing meanings and roles of core language components. In contrast, neural network models for nat…☆27Updated 4 years ago
- This is a repository with the code for the EMNLP 2020 paper "Information-Theoretic Probing with Minimum Description Length"☆68Updated last month
- ☆63Updated 2 years ago
- a Pytorch implementation of the Reformer Network (https://openreview.net/pdf?id=rkgNKkHtvB)☆53Updated last year
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Updated last year
- ☆42Updated 3 years ago
- PhD thesis (updating) of Jiatao Gu from HKU☆19Updated 6 years ago
- This repo contains code to reproduce some of the results presented in the paper "SentenceMIM: A Latent Variable Language Model"☆28Updated 2 years ago
- Code for EMNLP 2019 paper "Attention is not not Explanation"☆57Updated 3 years ago
- Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"☆32Updated 3 years ago
- PyTorch code for the EMNLP 2020 paper "Embedding Words in Non-Vector Space with Unsupervised Graph Learning"☆40Updated 3 years ago
- Implementation of experiments in paper "Learning from Rules Generalizing Labeled Exemplars" to appear in ICLR2020 (https://openreview.net…☆49Updated last year
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Updated 3 years ago
- [ACL 2018] Conditional Generators of Words Definitions☆33Updated 6 years ago
- Language Model Baselines for PyTorch☆42Updated 4 years ago
- ☆21Updated 3 years ago
- Code for the publication Learning to Reason with Third-Order Tensor Products.☆38Updated 5 years ago
- Tools for training pytorch language models☆27Updated 3 years ago