tacchinotacchi / distil-bilstmLinks
Scripts to train a bidirectional LSTM with knowledge distillation from BERT
☆158Updated 5 years ago
Alternatives and similar repositories for distil-bilstm
Users that are interested in distil-bilstm are comparing it to the libraries listed below
Sorting:
- LM, ULMFit et al.☆46Updated 5 years ago
- Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs☆106Updated 2 years ago
- Implementation of ULMFit algorithm for text classification via transfer learning☆94Updated 6 years ago
- XLNet: fine tuning on RTX 2080 GPU - 8 GB☆154Updated 6 years ago
- XLNet for generating language.☆166Updated 4 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆75Updated 6 years ago
- ☆58Updated 6 years ago
- LM Pretraining with PyTorch/TPU☆136Updated 6 years ago
- Gendered Ambiguous Pronouns Shared Task☆31Updated 2 years ago
- Bidirectional Attention Flow for Machine Comprehension implemented in Keras 2☆64Updated 2 years ago
- ☆47Updated 6 years ago
- Python package for understanding the difficulty of text classification datasets. (in CoNNL 2018)☆64Updated 4 years ago
- DeepThought's solution☆80Updated 2 years ago
- Comparing Text Classification results using BERT embedding and ULMFIT embedding☆65Updated 6 years ago
- A collection of resources on using BERT (https://arxiv.org/abs/1810.04805 ) and related Language Models in production environments.☆96Updated 4 years ago
- Efficient Transformers for research, PyTorch and Tensorflow using Locality Sensitive Hashing☆95Updated 5 years ago
- Transformer-XL with checkpoint loader☆68Updated 3 years ago
- Pre-training of Language Models for Language Understanding☆83Updated 6 years ago
- The Annotated Encoder Decoder with Attention☆166Updated 4 years ago
- State of the Art results in Intent Classification using Sematic Hashing for three datasets: AskUbuntu, Chatbot and WebApplication.☆134Updated 5 years ago
- Knowledge Distillation For Transformer Language Models☆52Updated last year
- Authors' implementation of EMNLP-IJCNLP 2019 paper "Answering Complex Open-domain Questions Through Iterative Query Generation"☆195Updated 5 years ago
- a Pytorch implementation of the Reformer Network (https://openreview.net/pdf?id=rkgNKkHtvB)☆53Updated 2 years ago
- Datasets I have created for scientific summarization, and a trained BertSum model☆115Updated 6 years ago
- Let's put all materials into this repository☆49Updated 5 years ago
- Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch☆62Updated 6 years ago
- A tutorial on how to implement models for natural language inference using PyTorch and TorchText. [IN PROGRESS]☆26Updated 5 years ago
- Neural Abstractive Text Summarization with Sequence-to-Sequence Models☆158Updated 6 years ago
- We summarize the summarization papers presented at major conferences (starting with ACL 2019)☆85Updated 5 years ago
- Text Generation Using A Variational Autoencoder☆110Updated 8 years ago