Edy-Barraza / Transformer_Distillation
Knowledge Distillation For Transformer Language Models
☆52Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for Transformer_Distillation
- PyTorch port of BERT ML model☆16Updated 5 years ago
- MAsked Sequence to Sequence (MASS) pre-training for language generation☆21Updated 5 years ago
- ☆46Updated 3 months ago
- ☆81Updated 4 years ago
- Distilling BERT using natural language generation.☆35Updated last year
- Ordered Neurons LSTM☆30Updated 2 years ago
- This is the PyTorch implementation of the ACL 2019 paper RankQA: Neural Question Answering with Answer Re-Ranking.☆84Updated 2 years ago
- A large-scale cleaned Chinese chitchat corpus and Chinese dialogpt models☆34Updated 4 years ago
- NAACL'19: "Jointly Optimizing Diversity and Relevance in Neural Response Generation"☆74Updated 4 years ago
- ☆17Updated 2 years ago
- ☆34Updated 5 years ago
- modification of official bert for downstream task☆31Updated last year
- Implementation of pQRNN in PyTorch☆46Updated 3 years ago
- ☆48Updated 3 years ago
- An Implementation of Bidirectional Attention Flow☆41Updated 7 years ago
- Code for EMNLP 2018 paper https://arxiv.org/pdf/1808.09075.pdf☆38Updated 6 years ago
- Record papers for some NLP related area☆24Updated 2 years ago
- Natural Language Generation by Hierarchical Decoding with Linguistic Patterns (NAACL-HLT 2018), Investigating Linguistic Pattern Ordering…☆33Updated 6 years ago
- ☆29Updated 5 years ago
- This is the code in <Selection Bias Explorations and Debias Methods for Natural Language Sentence Matching Datasets> which has been accep…☆34Updated last year
- PyTorch implementation of Transformer-based Neural Machine Translation☆77Updated last year
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Updated 4 years ago
- PyTorch implementation of Attention-over-Attention Neural Networks for Reading Comprehension☆61Updated 7 years ago
- Official implementation of the models proposed in paper "Improving Neural Response Diversity with Frequency-Aware Cross-Entropy Loss"☆19Updated 5 years ago
- Knowledge Graph based Question Answering benchmark.☆10Updated 4 years ago
- In this project we develop new deep learning models for bootstrapping language understanding models for languages with no labeled data us…☆77Updated 2 years ago