tacchinotacchi / distil-bilstm
Scripts to train a bidirectional LSTM with knowledge distillation from BERT
☆158Updated 5 years ago
Alternatives and similar repositories for distil-bilstm:
Users that are interested in distil-bilstm are comparing it to the libraries listed below
- Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs☆106Updated 2 years ago
- XLNet: fine tuning on RTX 2080 GPU - 8 GB☆154Updated 5 years ago
- XLNet for generating language.☆165Updated 4 years ago
- Variational Methods for Pretraining in Resource-limited Environments☆174Updated 4 years ago
- LM, ULMFit et al.☆46Updated 5 years ago
- Implementation of ULMFit algorithm for text classification via transfer learning☆94Updated 6 years ago
- The Annotated Encoder Decoder with Attention☆166Updated 3 years ago
- ☆47Updated 5 years ago
- A set of tutorials for torchtext☆186Updated 6 years ago
- Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch☆61Updated 6 years ago
- Efficient Transformers for research, PyTorch and Tensorflow using Locality Sensitive Hashing☆93Updated 5 years ago
- Pytorch Implementation of ALBERT(A Lite BERT for Self-supervised Learning of Language Representations)☆226Updated 3 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆75Updated 5 years ago
- Comparing Text Classification results using BERT embedding and ULMFIT embedding☆65Updated 6 years ago
- LM Pretraining with PyTorch/TPU☆134Updated 5 years ago
- ☆58Updated 5 years ago
- Authors' implementation of EMNLP-IJCNLP 2019 paper "Answering Complex Open-domain Questions Through Iterative Query Generation"☆195Updated 5 years ago
- Bidirectional Attention Flow for Machine Comprehension implemented in Keras 2☆64Updated 2 years ago
- Pytorch and Torchtext implementation of Sequence to sequence☆59Updated 7 years ago
- Easy to use NLP library built on PyTorch and TorchText☆254Updated 5 years ago
- ☆319Updated 2 years ago
- Re-implementation of ELMo on Keras☆134Updated last year
- PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset☆122Updated 5 years ago
- Multilingual hierarchical attention networks toolkit☆77Updated 5 years ago
- A PyTorch implementation of a Bi-LSTM CRF with character-level features☆63Updated 6 years ago
- NLP research experiments, built on PyTorch within the AllenNLP framework.☆91Updated 11 months ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆361Updated 2 years ago
- Sequence to Sequence Models in PyTorch☆44Updated 6 months ago
- BertQA - Attention on Steroids☆115Updated 2 years ago
- Transformer-XL with checkpoint loader☆68Updated 3 years ago