threelittlemonkeys / rnn-encoder-decoder-pytorchLinks
RNN Encoder-Decoder in PyTorch
☆44Updated last year
Alternatives and similar repositories for rnn-encoder-decoder-pytorch
Users that are interested in rnn-encoder-decoder-pytorch are comparing it to the libraries listed below
Sorting:
- diagNNose is a Python library that facilitates a broad set of tools for analysing hidden activations of neural models.☆82Updated last year
- Code for Multi-Head Attention: Collaborate Instead of Concatenate☆151Updated 2 years ago
- An LSTM in PyTorch with best practices (weight dropout, forget bias, etc.) built-in. Fully compatible with PyTorch LSTM.☆134Updated 5 years ago
- ☆211Updated last year
- Code for "Finetuning Pretrained Transformers into Variational Autoencoders"☆39Updated 3 years ago
- The entmax mapping and its loss, a family of sparse softmax alternatives.☆445Updated last year
- Trains Transformer model variants. Data isn't shuffled between batches.☆143Updated 2 years ago
- Code for the paper "A Stable Variational Autoencoder for Text Modelling"☆26Updated 5 years ago
- Sequence to Sequence Models in PyTorch☆44Updated last year
- LAnguage Modelling Benchmarks☆138Updated 5 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆67Updated 2 years ago
- PyTorch implementation of "Lagging Inference Networks and Posterior Collapse in Variational Autoencoders" (ICLR 2019)☆185Updated 4 years ago
- A PyTorch implementation of the Transformer model from "Attention Is All You Need".☆59Updated 6 years ago
- Hard-Coded Gaussian Attention for Neural Machine Translation☆36Updated 2 years ago
- Understanding the Difficulty of Training Transformers☆330Updated 3 years ago
- Cascaded Text Generation with Markov Transformers☆129Updated 2 years ago
- Official Code for Towards Transparent and Explainable Attention Models paper (ACL 2020)☆35Updated 3 years ago
- Implementation of the GBST block from the Charformer paper, in Pytorch☆118Updated 4 years ago
- Transformers without Tears: Improving the Normalization of Self-Attention☆133Updated last year
- ☆219Updated 5 years ago
- Minimal RNN classifier with self-attention in Pytorch☆151Updated 3 years ago
- PyTorch implementation of BERT in "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"☆109Updated 6 years ago
- ☆64Updated 5 years ago
- Transformer-based Conditional Variational Autoencoder for Controllable Story Generation☆158Updated 3 years ago
- Code accompanying our papers on the "Generative Distributional Control" framework☆118Updated 2 years ago
- ☆85Updated last year
- Compositional generalization through meta sequence-to-sequence learning☆83Updated 5 years ago
- Code for "Rissanen Data Analysis: Examining Dataset Characteristics via Description Length" by Ethan Perez, Douwe Kiela, and Kyungyhun Ch…☆36Updated 4 years ago
- This is a repository with the code for the ACL 2019 paper "Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, t…☆312Updated 4 years ago
- An implementation of masked language modeling for Pytorch, made as concise and simple as possible☆179Updated 2 years ago