Edy-Barraza / Transformer_DistillationLinks
Knowledge Distillation For Transformer Language Models
☆52Updated last year
Alternatives and similar repositories for Transformer_Distillation
Users that are interested in Transformer_Distillation are comparing it to the libraries listed below
Sorting:
- MAsked Sequence to Sequence (MASS) pre-training for language generation☆21Updated 6 years ago
- Implemented transformer NN block for Machine translation, text classfication, Natural language inference as well as Machine reading compr…☆11Updated last year
- Record papers for some NLP related area☆24Updated 3 years ago
- Code for EMNLP 2018 paper https://arxiv.org/pdf/1808.09075.pdf☆38Updated 6 years ago
- ☆46Updated 10 months ago
- Distilling BERT using natural language generation.☆38Updated last year
- Classic deep neural network models for text matching, and implementation with tensorflow.☆12Updated 6 years ago
- modification of official bert for downstream task☆31Updated 2 years ago
- ☆59Updated 5 years ago
- Knowledge Distillation from BERT☆52Updated 6 years ago
- ☆85Updated 5 years ago
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Updated 4 years ago
- Tensorflow implementation of Bi-directional RNN Langauge Model☆38Updated 6 years ago
- BERT Extension in TensorFlow☆30Updated 5 years ago
- Deep Unknown Intent Detection with Margin Loss (ACL2019)☆34Updated 2 years ago
- a simple yet complete implementation of the popular BERT model☆127Updated 5 years ago
- ☆24Updated 5 years ago
- Ordered Neurons LSTM☆30Updated 3 years ago
- ai challenge 2018 's final code.☆16Updated 6 years ago
- machine reading comprehension with deep learning☆20Updated 7 years ago
- dstc7-noesis☆46Updated 5 years ago
- code for Question Condensing Networks for Answer Selection in Community Question Answering☆14Updated 6 years ago
- An Implementation of Bidirectional Attention Flow☆40Updated 7 years ago
- PyTorch implementation of Attention-over-Attention Neural Networks for Reading Comprehension☆60Updated 7 years ago
- R-Net with PyTorch☆24Updated 7 years ago
- This is the code in <Selection Bias Explorations and Debias Methods for Natural Language Sentence Matching Datasets> which has been accep…☆34Updated last year
- This is the code for "Learning Sentiment Memories for Sentiment Modification without Parallel Data".☆54Updated 6 years ago
- ☆23Updated 2 years ago
- An "end-to-end trainable task-oriented dialogue model" implementation.☆37Updated 2 years ago
- Code for reproducing the results from the paper Few Shot Text Classification with a Human in the Loop☆90Updated 7 years ago