tacchinotacchi / distil-bilstm
Scripts to train a bidirectional LSTM with knowledge distillation from BERT
☆157Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for distil-bilstm
- Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs☆106Updated 2 years ago
- XLNet: fine tuning on RTX 2080 GPU - 8 GB☆154Updated 5 years ago
- XLNet for generating language.☆165Updated 3 years ago
- Comparing Text Classification results using BERT embedding and ULMFIT embedding☆65Updated 5 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆76Updated 5 years ago
- Variational Methods for Pretraining in Resource-limited Environments☆174Updated 4 years ago
- LM Pretraining with PyTorch/TPU☆132Updated 5 years ago
- Authors' implementation of EMNLP-IJCNLP 2019 paper "Answering Complex Open-domain Questions Through Iterative Query Generation"☆195Updated 5 years ago
- ☆58Updated 5 years ago
- Pytorch Implementation of ALBERT(A Lite BERT for Self-supervised Learning of Language Representations)☆225Updated 3 years ago
- Neural Text Generation with Unlikelihood Training☆310Updated 3 years ago
- Bidirectional Attention Flow for Machine Comprehension implemented in Keras 2☆64Updated last year
- Transformer-XL with checkpoint loader☆68Updated 2 years ago
- A collection of resources on using BERT (https://arxiv.org/abs/1810.04805 ) and related Language Models in production environments.☆94Updated 3 years ago
- Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index (DenSPI)☆200Updated last year
- Easy to use NLP library built on PyTorch and TorchText☆254Updated 4 years ago
- Pytorch and Torchtext implementation of Sequence to sequence☆60Updated 6 years ago
- A set of tutorials for torchtext☆186Updated 5 years ago
- Re-implementation of ELMo on Keras☆135Updated last year
- DeepThought's solution☆80Updated last year
- A short tutorial on Elmo training (Pre trained, Training on new data, Incremental training)☆155Updated 4 years ago
- ☆122Updated last year
- LM, ULMFit et al.☆47Updated 4 years ago
- Preprocessing Library for Natural Language Processing☆161Updated last year
- Inference with state-of-the-art models (pre-trained by LD-Net / AutoNER / VanillaNER / ...)☆116Updated 5 years ago
- An Attention Layer in Keras☆43Updated 5 years ago
- ☆97Updated 4 years ago