giannisdaras / spaCyIRL_slides
Slides from my talk on spaCy IRL, regarding sparse attention.
β12Updated 5 years ago
Alternatives and similar repositories for spaCyIRL_slides
Users that are interested in spaCyIRL_slides are comparing it to the libraries listed below
Sorting:
- A π€-style implementation of BERT using lambda layers instead of self-attentionβ69Updated 4 years ago
- PyTorch source code of NAACL 2019 paper "An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models"β96Updated last year
- Easy-to-use text representations extraction library based on the Transformers library.β32Updated 2 years ago
- Auxiliary GAN for WE post-specialisationβ23Updated 6 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".β63Updated 4 years ago
- β31Updated 5 years ago
- Assessing syntactic abilities of BERTβ39Updated 5 years ago
- A tutorial on how to implement models for natural language inference using PyTorch and TorchText. [IN PROGRESS]β26Updated 5 years ago
- A small repo showing how to easily use BERT (or other transformers) for inferenceβ99Updated 5 years ago
- Utils and modules for Speech Language and Multimodal processing using pytorch and pytorch lightningβ22Updated 2 years ago
- Introduction to the recently released T5 model from the paper - Exploring the Limits of Transfer Learning with a Unified Text-to-Text Traβ¦β35Updated 4 years ago
- Implementation of "Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs"β77Updated 3 years ago
- β64Updated 5 years ago
- ULMFiT + Siamese Network for Sentence Vectorsβ34Updated 6 years ago
- β21Updated 6 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselinesβ136Updated last year
- On Generating Extended Summaries of Long Documentsβ78Updated 4 years ago
- LM Pretraining with PyTorch/TPUβ134Updated 5 years ago
- The accompanying code for "Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understandinβ¦β21Updated 5 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.β147Updated 3 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and β¦β51Updated 5 months ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transferβ39Updated 4 years ago
- numeric fused-head identification and resolutionβ33Updated 5 years ago
- Variational Methods for Pretraining in Resource-limited Environmentsβ174Updated 4 years ago
- http://nlp.seas.harvard.edu/2018/04/03/attention.htmlβ62Updated 3 years ago
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer β¦β55Updated 4 years ago
- Implementation of unsupervised smoothed inverse frequency (Best Paper, Repl4NLP @ ACL 2018)β77Updated 6 years ago
- Boolean Question Answering with multi-task learning and uses large LM embeddings like BERT, RoBERTaβ18Updated 5 years ago
- Training Transformer-XL on 128 GPUsβ140Updated 4 years ago
- Fine-tune transformers with pytorch-lightningβ44Updated 3 years ago