tacchinotacchi / distil-bilstmLinks
Scripts to train a bidirectional LSTM with knowledge distillation from BERT
☆159Updated 6 years ago
Alternatives and similar repositories for distil-bilstm
Users that are interested in distil-bilstm are comparing it to the libraries listed below
Sorting:
- Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs☆106Updated 3 years ago
- LM, ULMFit et al.☆46Updated 5 years ago
- ☆58Updated 6 years ago
- Implementation of ULMFit algorithm for text classification via transfer learning☆94Updated 6 years ago
- XLNet for generating language.☆166Updated 4 years ago
- ☆47Updated 6 years ago
- Pre-training of Language Models for Language Understanding☆83Updated 6 years ago
- XLNet: fine tuning on RTX 2080 GPU - 8 GB☆155Updated 6 years ago
- Gendered Ambiguous Pronouns Shared Task☆31Updated 3 years ago
- Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch☆62Updated 6 years ago
- Authors' implementation of EMNLP-IJCNLP 2019 paper "Answering Complex Open-domain Questions Through Iterative Query Generation"☆195Updated 6 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆75Updated 6 years ago
- Python package for understanding the difficulty of text classification datasets. (in CoNNL 2018)☆64Updated 4 years ago
- The Annotated Encoder Decoder with Attention☆167Updated 4 years ago
- Pytorch and Torchtext implementation of Sequence to sequence☆59Updated 7 years ago
- A collection of resources on using BERT (https://arxiv.org/abs/1810.04805 ) and related Language Models in production environments.☆96Updated 4 years ago
- Text Generation Using A Variational Autoencoder☆110Updated 8 years ago
- LM Pretraining with PyTorch/TPU☆136Updated 6 years ago
- Comparing Text Classification results using BERT embedding and ULMFIT embedding☆65Updated 6 years ago
- State of the Art results in Intent Classification using Sematic Hashing for three datasets: AskUbuntu, Chatbot and WebApplication.☆134Updated 5 years ago
- Transformer-XL with checkpoint loader☆68Updated 3 years ago
- Knowledge Distillation For Transformer Language Models☆53Updated last year
- Pytorch implementation of OpenAI-GPT for ROC stories☆51Updated 6 years ago
- A toolkit for evaluating the linguistic knowledge and transferability of contextual representations. Code for "Linguistic Knowledge and T…☆211Updated 4 years ago
- Bidirectional Attention Flow for Machine Comprehension implemented in Keras 2☆64Updated 3 years ago
- Preprocessing Library for Natural Language Processing☆166Updated 2 years ago
- PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset☆123Updated 6 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆137Updated 2 years ago
- NLP library designed for reproducible experimentation management☆294Updated last year
- Encoding position with the word embeddings.☆84Updated 7 years ago