zbloss / TransformerModel
☆58Updated 5 years ago
Alternatives and similar repositories for TransformerModel:
Users that are interested in TransformerModel are comparing it to the libraries listed below
- Implementation of ULMFit algorithm for text classification via transfer learning☆94Updated 6 years ago
- Scripts to train a bidirectional LSTM with knowledge distillation from BERT☆158Updated 5 years ago
- Code for my blog post☆49Updated 6 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆75Updated 6 years ago
- LM, ULMFit et al.☆46Updated 5 years ago
- ULMFiT + Siamese Network for Sentence Vectors☆34Updated 6 years ago
- MT Tutorial for the JSALT 2019 Summer School☆48Updated 5 years ago
- Exploring Random Encoders for Sentence Classification☆183Updated 5 years ago
- ☆46Updated 2 years ago
- Fine Tuning Language Models for Multilabel Prediction☆61Updated 2 years ago
- Pytorch and Torchtext implementation of Sequence to sequence☆59Updated 7 years ago
- Machine-generated summaries and highlights of the every accepted paper at Thirty-second Conference on Neural Information Processing Syste…☆71Updated 6 years ago
- Pre-training of Language Models for Language Understanding☆83Updated 5 years ago
- Code for "Learning to Generate Reviews and Discovering Sentiment"☆16Updated 7 years ago
- Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methods☆95Updated 4 years ago
- The notes for Math, Machine Learning, Deep Learning and Research papers.☆52Updated 5 years ago
- XLNet: fine tuning on RTX 2080 GPU - 8 GB☆154Updated 5 years ago
- Some frequently used NLP blocks I implemented☆226Updated 6 years ago
- Language Model Fine-tuning for Moby Dick☆42Updated 6 years ago
- Code for the Eager Translation Model from the paper You May Not Need Attention☆294Updated 6 years ago
- Comparing Text Classification results using BERT embedding and ULMFIT embedding☆65Updated 6 years ago
- Beam search for neural network sequence to sequence (encoder-decoder) models.☆34Updated 6 years ago
- Multilingual hierarchical attention networks toolkit☆77Updated 5 years ago
- Example showing generalisation☆69Updated 4 years ago
- Quasi-RNN for language modeling☆57Updated 8 years ago
- ☆47Updated 6 years ago
- A spell checker built from GloVe word vectors☆81Updated 6 years ago
- a Pytorch implementation of the Reformer Network (https://openreview.net/pdf?id=rkgNKkHtvB)☆53Updated 2 years ago
- interactive explorer for language models☆133Updated 3 years ago
- ☆42Updated 6 years ago