tacchinotacchi / distil-bilstmLinks
Scripts to train a bidirectional LSTM with knowledge distillation from BERT
☆159Updated 6 years ago
Alternatives and similar repositories for distil-bilstm
Users that are interested in distil-bilstm are comparing it to the libraries listed below
Sorting:
- Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs☆106Updated 3 years ago
- XLNet for generating language.☆166Updated 4 years ago
- Pre-training of Language Models for Language Understanding☆83Updated 6 years ago
- LM, ULMFit et al.☆46Updated 6 years ago
- The Annotated Encoder Decoder with Attention☆167Updated 4 years ago
- Implementation of ULMFit algorithm for text classification via transfer learning☆94Updated 6 years ago
- ☆58Updated 6 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆75Updated 6 years ago
- XLNet: fine tuning on RTX 2080 GPU - 8 GB☆155Updated 6 years ago
- Gendered Ambiguous Pronouns Shared Task☆31Updated 3 years ago
- Python package for understanding the difficulty of text classification datasets. (in CoNNL 2018)☆64Updated 4 years ago
- LM Pretraining with PyTorch/TPU☆137Updated 6 years ago
- Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch☆62Updated 7 years ago
- Bidirectional Attention Flow for Machine Comprehension implemented in Keras 2☆64Updated 3 years ago
- ☆47Updated 6 years ago
- BERT which stands for Bidirectional Encoder Representations from Transformations is the SOTA in Transfer Learning in NLP.☆56Updated 5 years ago
- Authors' implementation of EMNLP-IJCNLP 2019 paper "Answering Complex Open-domain Questions Through Iterative Query Generation"☆195Updated 6 years ago
- DeepThought's solution☆80Updated 2 years ago
- A toolkit for evaluating the linguistic knowledge and transferability of contextual representations. Code for "Linguistic Knowledge and T…☆211Updated 4 years ago
- Neural Abstractive Text Summarization with Sequence-to-Sequence Models☆158Updated 6 years ago
- Comparing Text Classification results using BERT embedding and ULMFIT embedding☆65Updated 6 years ago
- State of the Art results in Intent Classification using Sematic Hashing for three datasets: AskUbuntu, Chatbot and WebApplication.☆134Updated 5 years ago
- ULMFiT + Siamese Network for Sentence Vectors☆33Updated 7 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆137Updated 2 years ago
- Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Process…☆250Updated 7 years ago
- Datasets I have created for scientific summarization, and a trained BertSum model☆116Updated 6 years ago
- NLP library designed for reproducible experimentation management☆294Updated last year
- ☆324Updated 3 years ago
- We summarize the summarization papers presented at major conferences (starting with ACL 2019)☆84Updated 6 years ago
- Neural Text Generation with Unlikelihood Training☆310Updated 4 years ago