danqi / thesis
Danqi Chen's PhD Thesis
☆221Updated 4 years ago
Related projects: ⓘ
- "Target-Guided Open-Domain Conversation" in ACL 2019☆149Updated 5 years ago
- Source code for "Efficient Training of BERT by Progressively Stacking"☆111Updated 5 years ago
- PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"☆269Updated 2 years ago
- A simple yet strong implementation of neural machine translation in pytorch☆89Updated 3 years ago
- A collection of transformer's guides, implementations and variants.☆102Updated 4 years ago
- Implementation of Universal Transformer in Pytorch☆256Updated 5 years ago
- A PyTorch implementation of Attention is all you need☆42Updated 5 years ago
- Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve exis…☆249Updated 2 years ago
- Source code of paper "BP-Transformer: Modelling Long-Range Context via Binary Partitioning"☆125Updated 3 years ago
- Codes for "Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View"☆146Updated 5 years ago
- A Toolkit for Training, Tracking, Saving Models and Syncing Results☆60Updated 4 years ago
- Visualization for simple attention and Google's multi-head attention.☆68Updated 6 years ago
- Implementation of Dual Learning NMT on PyTorch☆162Updated 6 years ago
- End-To-End Memory Networks in PyTorch☆38Updated 6 years ago
- A neural machine translation model in PyTorch☆117Updated 5 years ago
- A dual learning toolkit developed by Microsoft Research☆71Updated last year
- Reinforcement Learning for Neural Machine Translation☆186Updated last year
- Source Code for DialogWAE: Multimodal Response Generation with Conditional Wasserstein Autoencoder (https://arxiv.org/abs/1805.12352)☆125Updated 6 years ago
- Re-implement "QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension"☆120Updated 5 years ago
- ☆308Updated 2 years ago
- This repo is not maintained. For latest version, please visit https://github.com/ictnlp. A collection of transformer's guides, implementa…☆42Updated 5 years ago
- Paper collection of Neural Text Generation☆51Updated 5 years ago
- Tensorflow Implementation of Knowledge-Guided CVAE for dialog generation ACL 2017. It is released by Tiancheng Zhao (Tony) from Dialog Re…☆309Updated 5 years ago
- Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"☆577Updated 5 years ago
- Bidirectional Attention Flow for Machine Comprehension, Minjoon Seo, Aniruddha Kembhavi, Ali Farhadi, Hannaneh Hajishirzi https://arxiv.o…☆143Updated 2 years ago
- The source code for "An Actor Critic Algorithm for Structured Prediction"☆167Updated 7 years ago
- Neural Text Generation with Unlikelihood Training☆311Updated 3 years ago
- Improving the Transformer translation model with document-level context☆172Updated 4 years ago
- (Beta Version!) Experiment Code for Paper ``CoT: Cooperative Training for Generative Modeling of Discrete Data''☆74Updated 5 years ago
- Code for the RecAdam paper: Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting.☆114Updated 3 years ago