tacchinotacchi / distil-bilstmLinks
Scripts to train a bidirectional LSTM with knowledge distillation from BERT
☆158Updated 5 years ago
Alternatives and similar repositories for distil-bilstm
Users that are interested in distil-bilstm are comparing it to the libraries listed below
Sorting:
- Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs☆107Updated 2 years ago
- XLNet: fine tuning on RTX 2080 GPU - 8 GB☆154Updated 6 years ago
- ☆58Updated 5 years ago
- Implementation of ULMFit algorithm for text classification via transfer learning☆94Updated 6 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆75Updated 6 years ago
- XLNet for generating language.☆165Updated 4 years ago
- LM, ULMFit et al.☆46Updated 5 years ago
- DeepThought's solution☆80Updated last year
- Pytorch Implementation of ALBERT(A Lite BERT for Self-supervised Learning of Language Representations)☆226Updated 4 years ago
- Load GPT-2 checkpoint and generate texts☆127Updated 3 years ago
- LM Pretraining with PyTorch/TPU☆134Updated 5 years ago
- Authors' implementation of EMNLP-IJCNLP 2019 paper "Answering Complex Open-domain Questions Through Iterative Query Generation"☆195Updated 5 years ago
- The Annotated Encoder Decoder with Attention☆166Updated 4 years ago
- State of the Art results in Intent Classification using Sematic Hashing for three datasets: AskUbuntu, Chatbot and WebApplication.☆134Updated 5 years ago
- Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch☆61Updated 6 years ago
- ☆323Updated 2 years ago
- Pre-training of Language Models for Language Understanding☆83Updated 5 years ago
- Pytorch and Torchtext implementation of Sequence to sequence☆59Updated 7 years ago
- Comparing Text Classification results using BERT embedding and ULMFIT embedding☆65Updated 6 years ago
- Datasets I have created for scientific summarization, and a trained BertSum model☆114Updated 5 years ago
- Bidirectional Attention Flow for Machine Comprehension implemented in Keras 2☆64Updated 2 years ago
- NLP library designed for reproducible experimentation management☆293Updated 11 months ago
- Explains nlp building blocks in a simple manner.☆251Updated 5 years ago
- ☆47Updated 6 years ago
- Efficient Transformers for research, PyTorch and Tensorflow using Locality Sensitive Hashing☆95Updated 5 years ago
- QnA bot powered by CoQA + BERT☆39Updated 2 years ago
- ☆96Updated 5 years ago
- Variational Methods for Pretraining in Resource-limited Environments☆174Updated 4 years ago
- Official code and data repository for our ACL 2019 long paper "Generating Question-Answer Hierarchies" (https://arxiv.org/abs/1906.02622)…☆93Updated 11 months ago
- Transformer-XL with checkpoint loader☆68Updated 3 years ago