AotY / Pytorch-NMT
PyTorch implementation of "Effective Approaches to Attention-based Neural Machine Translation" using scheduled sampling to improve the parameter estimation process.
☆20Updated 6 years ago
Alternatives and similar repositories for Pytorch-NMT:
Users that are interested in Pytorch-NMT are comparing it to the libraries listed below
- M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning☆29Updated 4 years ago
- ☆42Updated 3 years ago
- ☆23Updated 4 years ago
- PyTorch implementation of "Effective Approaches to Attention-based Neural Machine Translation" using scheduled sampling to improve the pa…☆38Updated 7 years ago
- Pre-Trained Language Models for Interactive Decision-Making [NeurIPS 2022]☆120Updated 2 years ago
- Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch☆70Updated 4 years ago
- Video descriptions of research papers relating to foundation models and scaling☆30Updated last year
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 4 years ago
- The newest reading list for representation learning☆114Updated 3 years ago
- Implementation of BC-IRL and other IRL baselines☆25Updated last year
- Implementation of Memformer, a Memory-augmented Transformer, in Pytorch☆111Updated 4 years ago
- Transformer-based Conditional Variational Autoencoder for Controllable Story Generation☆151Updated 2 years ago
- This repository contains code for paper VICTR: Visual Information Captured Text Representation for Text-to-Image Multimodal Tasks☆13Updated 3 years ago
- This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample …☆49Updated last year
- This is the public github for our paper "Transformer with a Mixture of Gaussian Keys"☆26Updated 2 years ago
- A collection of transformer's guides, implementations and variants.☆102Updated 5 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆72Updated 2 years ago
- ICLR2023 statistics☆60Updated last year
- CS231n Assignments Solutions - Spring 2020☆48Updated 3 years ago
- PyTorch implementation of HyperNetworks (Ha et al., ICLR 2017) for ResNet (Residual Networks)☆262Updated 3 years ago
- lstm with layer normalization☆19Updated 3 years ago
- Minimal RLHF implementation built on top of minGPT.☆29Updated 7 months ago
- ☆49Updated 7 months ago
- Keras implement of Finite Scalar Quantization☆69Updated last year
- Sparse Transformer with limited attention span in PyTorch☆12Updated 3 years ago
- Minimal code for A Generalist Agent☆38Updated 2 years ago
- Curriculum Learning related papers and materials☆54Updated 4 years ago
- Simple pytorch implmentation of reinforcement learning algorithms☆25Updated 5 years ago
- Multi-head attention in PyTorch☆150Updated 5 years ago
- My notes and assignment solutions for Stanford CS330 (Fall 2019 & 2020) Deep Multi-Task and Meta Learning☆41Updated last year