Edy-Barraza / Transformer_Distillation
Knowledge Distillation For Transformer Language Models
☆51Updated 8 months ago
Related projects: ⓘ
- Distilling BERT using natural language generation.☆35Updated last year
- ☆19Updated this week
- MAsked Sequence to Sequence (MASS) pre-training for language generation☆21Updated 5 years ago
- ☆29Updated this week
- ☆38Updated this week
- Source code for the paper "Multilingual Neural Machine Translation with Soft Decoupled Encoding"☆29Updated 3 years ago
- PyTorch implementation of Attention-over-Attention Neural Networks for Reading Comprehension☆61Updated 7 years ago
- ☆46Updated last month
- ☆47Updated 5 years ago
- NAACL'19: "Jointly Optimizing Diversity and Relevance in Neural Response Generation"☆74Updated 3 years ago
- Ordered Neurons LSTM☆30Updated 2 years ago
- Natural Language Generation by Hierarchical Decoding with Linguistic Patterns (NAACL-HLT 2018), Investigating Linguistic Pattern Ordering…☆33Updated 5 years ago
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Updated 3 years ago
- Tensorflow implementation of Bi-directional RNN Langauge Model☆39Updated 6 years ago
- ☆24Updated 4 years ago
- Multiple Different Natural Language Processing Tasks in a Single Deep Model☆49Updated 5 years ago
- modification of official bert for downstream task☆31Updated last year
- ☆29Updated 5 years ago
- Record papers for some NLP related area☆24Updated 2 years ago
- ☆17Updated last year
- Boolean Question Answering with multi-task learning and uses large LM embeddings like BERT, RoBERTa☆18Updated 5 years ago
- ☆27Updated this week
- This is the PyTorch implementation of the ACL 2019 paper RankQA: Neural Question Answering with Answer Re-Ranking.☆84Updated 2 years ago
- PyTorch port of BERT ML model☆16Updated 5 years ago
- ☆47Updated 3 years ago
- Implementation of pQRNN in PyTorch☆46Updated 2 years ago
- NoiseMix - data generation for natural language☆41Updated 6 years ago
- An Implementation of Bidirectional Attention Flow☆41Updated 7 years ago
- Enhancing Sentence Embedding with Generalized Pooling☆20Updated last year
- ☆34Updated 5 years ago