cgraywang / gluon-nlp-1
Code repo for "Language Models with Transformers" paper
☆21Updated 4 years ago
Alternatives and similar repositories for gluon-nlp-1:
Users that are interested in gluon-nlp-1 are comparing it to the libraries listed below
- ☆17Updated 2 years ago
- ☆22Updated 6 years ago
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Updated 4 years ago
- Code accompanying EMNLP 2018 paper Language Modeling with Sparse Product of Sememe Experts☆25Updated 6 years ago
- Code for the paper "Cross-Lingual BERT Transformation for Zero-Shot Dependency Parsing"☆35Updated 5 years ago
- Document-Level Neural Machine Translation with Hierarchical Attention Networks☆68Updated 2 years ago
- ☆24Updated 4 years ago
- Code for EMNLP 2018 paper https://arxiv.org/pdf/1808.09075.pdf☆38Updated 6 years ago
- Pytorch implementation of models described in "Grounded compositional outputs for adaptive language modeling", EMNLP 2020.☆18Updated 3 years ago
- Code for ACL2020 "Jointly Masked Sequence-to-Sequence Model for Non-Autoregressive Neural Machine Translation"☆39Updated 4 years ago
- semi-autoregressive neural machine translation☆23Updated 6 years ago
- Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.☆51Updated 2 years ago
- dstc7-noesis☆46Updated 5 years ago
- Code for ACL2021 paper: "GLGE: A New General Language Generation Evaluation Benchmark"☆58Updated 2 years ago
- Weakly Supervised Topic Segmentation and Labeling☆33Updated 3 years ago
- Source code for ``Straight to the Tree: Constituency Parsing with Neural Syntactic Distance'' published at ACL 2018☆63Updated 6 years ago
- ☆46Updated 4 months ago
- ☆41Updated 7 years ago
- Codes for our paper at EMNLP2019☆36Updated 5 years ago
- ☆23Updated 7 years ago
- Graph-based and Transition-based dependency parsers based on BiLSTMs☆18Updated 3 years ago
- My Ph.D. thesis paper "Tackling Graphical NLP problems with Graph Recurrent Networks" and my defense slides☆9Updated 4 years ago
- ☆23Updated 5 years ago
- Code for NeurIPS2020 "Incorporating BERT into Parallel Sequence Decoding with Adapters"☆32Updated 2 years ago
- Soft Contextual Data Augmentation☆39Updated 5 months ago
- souce code for "Accelerating Neural Transformer via an Average Attention Network"☆78Updated 5 years ago
- Dynamic data selection for neural machine translation☆20Updated 6 years ago
- ☆14Updated 2 years ago
- Simple LSTM-based word-level language model in PyTorch☆46Updated 5 years ago