AotY / Pytorch-NMTLinks
PyTorch implementation of "Effective Approaches to Attention-based Neural Machine Translation" using scheduled sampling to improve the parameter estimation process.
☆20Updated 6 years ago
Alternatives and similar repositories for Pytorch-NMT
Users that are interested in Pytorch-NMT are comparing it to the libraries listed below
Sorting:
- [ICML 2021 Oral] We show pure attention suffers rank collapse, and how different mechanisms combat it.☆165Updated 4 years ago
- Pytorch reimplementation for "Gradient Surgery for Multi-Task Learning"☆359Updated 4 years ago
- Pre-Trained Language Models for Interactive Decision-Making [NeurIPS 2022]☆128Updated 3 years ago
- Implementation of BC-IRL and other IRL baselines☆28Updated 2 years ago
- Implementation of Forward Forward Network proposed by Hinton in NIPS 2022.☆170Updated 2 years ago
- The newest reading list for representation learning☆116Updated 4 years ago
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆225Updated 3 years ago
- Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch☆70Updated 5 years ago
- ☆23Updated 4 years ago
- Implementation of Memformer, a Memory-augmented Transformer, in Pytorch☆119Updated 4 years ago
- Implementation of Linformer for Pytorch☆295Updated last year
- Implement the paper "Self-Attention with Relative Position Representations"☆136Updated 4 years ago
- This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample …☆57Updated last year
- Repo has PyTorch implementation "Attention is All you Need - Transformers" paper for Machine Translation from French queries to English.☆70Updated 4 years ago
- Code for "Gradient Surgery for Multi-Task Learning"☆331Updated 5 years ago
- Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"☆134Updated last year
- Pytorch implementation of Compressive Transformers, from Deepmind☆163Updated 3 years ago
- An implementation of local windowed attention for language modeling☆471Updated 3 weeks ago
- ☆73Updated 4 years ago
- PyTorch implementation of a Variational Autoencoder with Gumbel-Softmax Distribution☆210Updated 6 years ago
- Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"☆369Updated last year
- Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)☆190Updated 3 years ago
- categorical variational autoencoder using the Gumbel-Softmax estimator☆26Updated 6 years ago
- ☆137Updated last year
- Pytorch solutions for UC Berkeley's cs285 assignments☆140Updated 3 years ago
- Crawl & Visualize ICLR 2023 Data from OpenReview☆84Updated 2 years ago
- A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.☆129Updated 8 months ago
- ☆16Updated 3 years ago
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆61Updated 2 years ago
- This is a pytorch implementation of k-means clustering algorithm☆321Updated 5 months ago