hsajjad / transformers
π€ Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
β68Updated 4 years ago
Related projects β
Alternatives and complementary repositories for transformers
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.β145Updated 3 years ago
- A π€-style implementation of BERT using lambda layers instead of self-attentionβ70Updated 4 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and β¦β49Updated 4 years ago
- Easy-to-use text representations extraction library based on the Transformers library.β32Updated 2 years ago
- Pre-training of Language Models for Language Understandingβ83Updated 5 years ago
- β58Updated 5 years ago
- LM Pretraining with PyTorch/TPUβ132Updated 5 years ago
- ELECTRA MODEL NLPβ13Updated 4 years ago
- KitanaQA: Adversarial training and data augmentation for neural question-answering modelsβ57Updated last year
- Language Model Fine-tuning for Moby Dickβ42Updated 5 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.β126Updated 3 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselinesβ131Updated last year
- Tools for training pytorch language modelsβ27Updated 3 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+β37Updated 3 years ago
- Introduction to the recently released T5 model from the paper - Exploring the Limits of Transfer Learning with a Unified Text-to-Text Traβ¦β35Updated 4 years ago
- ULMFiT + Siamese Network for Sentence Vectorsβ35Updated 6 years ago
- β46Updated 4 years ago
- Subword Language Model for Query Auto-Completionβ67Updated 5 years ago
- The accompanying code for "Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understandinβ¦β21Updated 5 years ago
- Boolean Question Answering with multi-task learning and uses large LM embeddings like BERT, RoBERTaβ18Updated 5 years ago
- β64Updated 4 years ago
- shabeelkandi / Handling-Out-of-Vocabulary-Words-in-Natural-Language-Processing-using-Language-Modellingβ68Updated 5 years ago
- Scripts to train a bidirectional LSTM with knowledge distillation from BERTβ157Updated 4 years ago
- Preprocessing Library for Natural Language Processingβ160Updated last year
- Create interactive textual heat maps for Jupiter notebooksβ196Updated 5 months ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transferβ39Updated 4 years ago
- On Generating Extended Summaries of Long Documentsβ77Updated 3 years ago
- Transformer based Trigram Blocking implementation in Tensorflowβ11Updated 4 years ago
- Minimal Interactive Attention Visualizationβ138Updated 4 years ago