danqi / thesis
Danqi Chen's PhD Thesis
☆221Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for thesis
- Codes for "Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View"☆145Updated 5 years ago
- "Target-Guided Open-Domain Conversation" in ACL 2019☆149Updated 5 years ago
- Source code for "Efficient Training of BERT by Progressively Stacking"☆112Updated 5 years ago
- PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"☆269Updated 2 years ago
- Implementation of Dual Learning NMT on PyTorch☆164Updated 6 years ago
- A PyTorch implementation of Attention is all you need☆42Updated 6 years ago
- This repo is not maintained. For latest version, please visit https://github.com/ictnlp. A collection of transformer's guides, implementa…☆42Updated 5 years ago
- Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve exis…☆250Updated 3 years ago
- Source code of paper "BP-Transformer: Modelling Long-Range Context via Binary Partitioning"☆125Updated 3 years ago
- Source Code for DialogWAE: Multimodal Response Generation with Conditional Wasserstein Autoencoder (https://arxiv.org/abs/1805.12352)☆125Updated 6 years ago
- Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"☆578Updated 5 years ago
- PyTorch codebase for zero-shot dialog generation SIGDIAL 2018, It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, …☆133Updated 5 years ago
- Improving the Transformer translation model with document-level context☆172Updated 4 years ago
- LAMB Optimizer for Large Batch Training (TensorFlow version)☆120Updated 4 years ago
- A neural machine translation model in PyTorch☆118Updated 5 years ago
- Generative Flow based Sequence-to-Sequence Toolkit written in Python.☆244Updated 4 years ago
- ☆75Updated 7 years ago
- A dual learning toolkit developed by Microsoft Research☆70Updated last year
- ☆310Updated 2 years ago
- Code for the RecAdam paper: Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting.☆115Updated 4 years ago
- ☆120Updated 5 years ago
- Implementation of Universal Transformer in Pytorch☆258Updated 6 years ago
- ☆94Updated 3 years ago
- ☆74Updated 2 years ago
- LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts (AAAI 2019)☆122Updated 5 years ago
- Visualization for simple attention and Google's multi-head attention.☆68Updated 6 years ago
- Reinforcement Learning for Neural Machine Translation☆187Updated 2 years ago
- ☆121Updated 8 years ago
- A simple yet strong implementation of neural machine translation in pytorch☆92Updated 3 years ago