google-deepmind / lambLinks

LAnguage Modelling Benchmarks

☆138

Alternatives and similar repositories for lamb

Users that are interested in lamb are comparing it to the libraries listed below

Sorting:

benkrause / dynamiceval-transformer
☆47Updated 6 years ago
lancopku / Prime
A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.
☆85Updated 2 years ago
laiguokun / Funnel-Transformer
☆218Updated 5 years ago
harvardnlp / urnng
☆178Updated 5 years ago
harvardnlp / cascaded-generation
Cascaded Text Generation with Markov Transformers
☆129Updated 2 years ago
keitakurita / Better_LSTM_PyTorch
An LSTM in PyTorch with best practices (weight dropout, forget bias, etc.) built-in. Fully compatible with PyTorch LSTM.
☆134Updated 5 years ago
andreamad8 / Universal-Transformer-Pytorch
Implementation of Universal Transformer in Pytorch
☆261Updated 6 years ago
leimao / Two-Layer-Hierarchical-Softmax-PyTorch
Two-Layer Hierarchical Softmax Implementation for PyTorch
☆69Updated 4 years ago
tanyuqian / learning-data-manipulation
NeurIPS 2019 - Learning Data Manipulation for Augmentation and Weighting
☆109Updated 4 years ago
DSE-MSU / R-transformer
Pytorch implementation of R-Transformer. Some parts of the code are adapted from the implementation of TCN and Transformer.
☆230Updated 6 years ago
yunjey / seq2seq-dataloader
PyTorch DataLoader for seq2seq
☆85Updated 6 years ago
jiacheng-xu / vmf_vae_nlp
Code for EMNLP18 paper "Spherical Latent Spaces for Stable Variational Autoencoders"
☆169Updated 6 years ago
serrano-s / attn-tests
Checking the interpretability of attention on text classification models
☆49Updated 6 years ago
sarahwie / attention
Code for EMNLP 2019 paper "Attention is not not Explanation"
☆58Updated 4 years ago
cerebroai / reformers
Efficient Transformers for research, PyTorch and Tensorflow using Locality Sensitive Hashing
☆95Updated 5 years ago
wellecks / nonmonotonic_text
Non-Monotonic Sequential Text Generation (ICML 2019)
☆72Updated 6 years ago
bplank / semi-supervised-baselines
Code for "Strong Baselines for Neural Semi-supervised Learning under Domain Shift" (Ruder & Plank, 2018 ACL)
☆61Updated 2 years ago
alex-tifrea / poincare_glove
Implementation of the "Poincare Glove: Hyperbolic word embeddings" paper
☆88Updated 4 years ago
wouterkool / stochastic-beam-search
Implementation of Stochastic Beam Search using Fairseq
☆105Updated 6 years ago
zbloss / reformer_lm
a Pytorch implementation of the Reformer Network (https://openreview.net/pdf?id=rkgNKkHtvB)
☆53Updated 2 years ago
cambridgeltl / parameter-factorization
Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer
☆39Updated 4 years ago
epfml / collaborative-attention
Code for Multi-Head Attention: Collaborate Instead of Concatenate
☆152Updated 2 years ago
ofirpress / sandwich_transformer
This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …
☆55Updated 4 years ago
pclucas14 / GansFallingShort
Code for "Language GANs Falling Short"
☆59Updated 4 years ago
IKMLab / arct2
Code for reproducing experiments in our ACL 2019 paper "Probing Neural Network Comprehension of Natural Language Arguments"
☆53Updated 3 years ago
seba-1511 / lstms.pth
PyTorch implementations of LSTM Variants (Dropout + Layer Norm)
☆137Updated 4 years ago
yzh119 / BPT
Source code of paper "BP-Transformer: Modelling Long-Range Context via Binary Partitioning"
☆128Updated 4 years ago
jihunchoi / unsupervised-treelstm
☆121Updated 6 years ago
yumeng5 / Spherical-Text-Embedding
[NeurIPS 2019] Spherical Text Embedding
☆177Updated last year
rdspring1 / PyTorch_GBW_LM
PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset
☆123Updated 5 years ago