AotY / Pytorch-NMTLinks
PyTorch implementation of "Effective Approaches to Attention-based Neural Machine Translation" using scheduled sampling to improve the parameter estimation process.
☆20Updated 6 years ago
Alternatives and similar repositories for Pytorch-NMT
Users that are interested in Pytorch-NMT are comparing it to the libraries listed below
Sorting:
- [ICML 2021 Oral] We show pure attention suffers rank collapse, and how different mechanisms combat it.☆166Updated 4 years ago
- Implementation of Linformer for Pytorch☆297Updated last year
- This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample …☆57Updated last year
- Pre-Trained Language Models for Interactive Decision-Making [NeurIPS 2022]☆129Updated 3 years ago
- The newest reading list for representation learning☆116Updated 4 years ago
- M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning☆29Updated 4 years ago
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆225Updated 3 years ago
- Implementation of BC-IRL and other IRL baselines☆28Updated 2 years ago
- Implement the paper "Self-Attention with Relative Position Representations"☆138Updated 4 years ago
- categorical variational autoencoder using the Gumbel-Softmax estimator☆26Updated 6 years ago
- ☆73Updated 4 years ago
- PyTorch implementation of a Variational Autoencoder with Gumbel-Softmax Distribution☆210Updated 6 years ago
- ☆31Updated 5 years ago
- Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch☆70Updated 5 years ago
- [ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention☆196Updated 2 years ago
- Simple pytorch implmentation of reinforcement learning algorithms☆26Updated 6 years ago
- Pytorch reimplementation for "Gradient Surgery for Multi-Task Learning"☆363Updated 4 years ago
- Pytorch implementation of Compressive Transformers, from Deepmind☆164Updated 3 years ago
- code for the ddp tutorial☆32Updated 3 years ago
- Crawl & Visualize ICLR 2023 Data from OpenReview☆84Updated 2 years ago
- ☆46Updated 2 years ago
- Pytorch Implementation of "Neural Discrete Representation Learning"☆91Updated 7 years ago
- PyTorch implementation of HyperNetworks (Ha et al., ICLR 2017) for ResNet (Residual Networks)☆269Updated 4 years ago
- Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"☆369Updated last year
- Keras implement of Finite Scalar Quantization☆81Updated last year
- ☆45Updated 4 years ago
- ☆23Updated 4 years ago
- Implementation of Memformer, a Memory-augmented Transformer, in Pytorch☆120Updated 4 years ago
- Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"☆134Updated 2 years ago
- [ICLR 2022] "Deep AutoAugment" by Yu Zheng, Zhi Zhang, Shen Yan, Mi Zhang☆65Updated 11 months ago