google-deepmind / lambLinks
LAnguage Modelling Benchmarks
☆137Updated 5 years ago
Alternatives and similar repositories for lamb
Users that are interested in lamb are comparing it to the libraries listed below
Sorting:
- Cascaded Text Generation with Markov Transformers☆129Updated 2 years ago
- ☆218Updated 5 years ago
- Efficient Transformers for research, PyTorch and Tensorflow using Locality Sensitive Hashing☆95Updated 5 years ago
- ☆47Updated 6 years ago
- Latent Alignment and Variational Attention☆327Updated 6 years ago
- A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.☆85Updated last year
- An LSTM in PyTorch with best practices (weight dropout, forget bias, etc.) built-in. Fully compatible with PyTorch LSTM.☆133Updated 5 years ago
- ☆176Updated 4 years ago
- Pytorch implementation of R-Transformer. Some parts of the code are adapted from the implementation of TCN and Transformer.☆230Updated 5 years ago
- Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs☆107Updated 2 years ago
- Implementation of Universal Transformer in Pytorch☆260Updated 6 years ago
- Text classification models. Used a submodule for other projects.☆68Updated 6 years ago
- Non-Monotonic Sequential Text Generation (ICML 2019)☆72Updated 6 years ago
- PyTorch implementations of LSTM Variants (Dropout + Layer Norm)☆136Updated 4 years ago
- Code for EMNLP18 paper "Spherical Latent Spaces for Stable Variational Autoencoders"☆168Updated 6 years ago
- PyTorch implementation for Interpretable Dialog Generation ACL 2018, It is released by Tiancheng Zhao (Tony) from Dialog Research Center…☆197Updated 6 years ago
- Code for Multi-Head Attention: Collaborate Instead of Concatenate☆152Updated 2 years ago
- PyTorch implementation of latent space reinforcement learning for E2E dialog published at NAACL 2019. It is released by Tiancheng Zhao (…☆144Updated 6 years ago
- ☆121Updated 6 years ago
- Source code for the NAACL 2019 paper "SEQ^3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sen…☆125Updated 3 years ago
- Bi-Directional Block Self-Attention☆122Updated 7 years ago
- a Pytorch implementation of the Reformer Network (https://openreview.net/pdf?id=rkgNKkHtvB)☆53Updated 2 years ago
- Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning☆311Updated 4 years ago
- Two-Layer Hierarchical Softmax Implementation for PyTorch☆69Updated 4 years ago
- ☆153Updated 7 years ago
- Assessing syntactic abilities of BERT☆148Updated 6 years ago
- ☆114Updated 6 years ago
- Codes for "Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View"☆148Updated 6 years ago
- Unsupervised Learning of Transferable Relational Graphs☆69Updated 6 years ago
- Source Code for DialogWAE: Multimodal Response Generation with Conditional Wasserstein Autoencoder (https://arxiv.org/abs/1805.12352)☆125Updated 6 years ago