zphang / adaptive-computation-time-pytorch
Alex Graves' Adaptive Computation Time in PyTorch
☆15Updated 7 years ago
Alternatives and similar repositories for adaptive-computation-time-pytorch:
Users that are interested in adaptive-computation-time-pytorch are comparing it to the libraries listed below
- Source code for "Efficient Training of BERT by Progressively Stacking"☆112Updated 5 years ago
- [EMNLP 2018] On Tree-Based Neural Sentence Modeling.☆65Updated 5 years ago
- Code for SegTree Transformer (ICLR-RLGM 2019).☆27Updated 5 years ago
- Method to improve inference time for BERT. This is an implementation of the paper titled "PoWER-BERT: Accelerating BERT Inference via Pro…☆59Updated last year
- ☆11Updated 4 years ago
- meProp: Sparsified Back Propagation for Accelerated Deep Learning (ICML 2017)☆110Updated 2 years ago
- Non-autoregressive Neural Machine Translation (not a full version)☆71Updated 2 years ago
- ☆22Updated 3 years ago
- ☆53Updated 8 years ago
- ☆13Updated 5 years ago
- Non-Monotonic Sequential Text Generation (ICML 2019)☆72Updated 5 years ago
- ☆119Updated 6 years ago
- Code for "Language GANs Falling Short"☆59Updated 3 years ago
- Code for paper "Continual and Multi-Task Architecture Search (ACL 2019)"☆41Updated 5 years ago
- Ouroboros: On Accelerating Training of Transformer-Based Language Models☆10Updated 5 years ago
- Code to reproduce results in our ACL 2018 paper "Did the Model Understand the Question?"☆33Updated 6 years ago
- This repo is not maintained. For latest version, please visit https://github.com/ictnlp. A collection of transformer's guides, implementa…☆43Updated 6 years ago
- [ACL 2019] Visually Grounded Neural Syntax Acquisition☆89Updated 11 months ago
- The implementation of multi-branch attentive Transformer (MAT).☆33Updated 4 years ago
- This repository contains the code used for Ordered Memory paper☆28Updated 5 years ago
- PyTorch implementation of Transformer-based Neural Machine Translation☆77Updated 2 years ago
- Source code for the paper "Multilingual Neural Machine Translation with Soft Decoupled Encoding"☆29Updated 3 years ago
- End-To-End Memory Networks in PyTorch☆38Updated 7 years ago
- Dataset and models for paper "Game-Based Video-Context Dialogue (EMNLP 2018)"☆19Updated 6 years ago
- Implementation of Stochastic Beam Search using Fairseq☆99Updated 5 years ago
- Adaptive Softmax implementation for PyTorch☆80Updated 5 years ago
- This is a PyTorch implementation of the ICLR 2017 paper "HIERARCHICAL MULTISCALE RECURRENT NEURAL NETWORKS" (https://openreview.net/pdf?i…☆51Updated 7 years ago
- ☆74Updated 7 years ago
- PhD thesis (updating) of Jiatao Gu from HKU☆19Updated 6 years ago
- Implementation of ICLR 2020 paper "Revisiting Self-Training for Neural Sequence Generation"☆46Updated 2 years ago