giannisdaras / spaCyIRL_slidesLinks
Slides from my talk on spaCy IRL, regarding sparse attention.
☆12Updated 6 years ago
Alternatives and similar repositories for spaCyIRL_slides
Users that are interested in spaCyIRL_slides are comparing it to the libraries listed below
Sorting:
- ULMFiT + Siamese Network for Sentence Vectors☆33Updated 7 years ago
- PyTorch source code of NAACL 2019 paper "An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models"☆96Updated 2 years ago
- Auxiliary GAN for WE post-specialisation☆24Updated 6 years ago
- Create interactive textual heat maps for Jupiter notebooks☆196Updated last year
- Exploring Random Encoders for Sentence Classification☆183Updated 5 years ago
- Training Transformer-XL on 128 GPUs☆141Updated 5 years ago
- numeric fused-head identification and resolution☆33Updated 6 years ago
- NLP library designed for reproducible experimentation management☆294Updated last year
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated 3 years ago
- A tutorial on how to implement models for natural language inference using PyTorch and TorchText. [IN PROGRESS]☆26Updated 5 years ago
- The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model Fine-tuning" https://arxiv.org/abs/1909.04761☆282Updated 5 years ago
- Datasets I have created for scientific summarization, and a trained BertSum model☆115Updated 6 years ago
- Let's put all materials into this repository☆49Updated 5 years ago
- shabeelkandi / Handling-Out-of-Vocabulary-Words-in-Natural-Language-Processing-using-Language-Modelling☆69Updated 6 years ago
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆69Updated 5 years ago
- Variational Methods for Pretraining in Resource-limited Environments☆174Updated 5 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆137Updated 2 years ago
- Utility scripts in Python☆37Updated 5 months ago
- Repository for the ACL 2020 virtual conference website (work in progress)☆39Updated 3 years ago
- Misspelling Oblivious Word Embeddings☆201Updated 6 years ago
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.☆51Updated 5 years ago
- Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Process…☆250Updated 7 years ago
- Code and data for the WSDM '19 paper "Crosslingual Document Embedding as Reduced-Rank Ridge Regression (Cr5)"☆30Updated 6 years ago
- LM Pretraining with PyTorch/TPU☆136Updated 6 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆64Updated 5 years ago
- Source code for the ACL workshop paper and Kaggle competition by Google AI team☆41Updated 3 years ago
- Scripts to train a bidirectional LSTM with knowledge distillation from BERT☆159Updated 6 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆75Updated 6 years ago
- An Interactive Tool for Scalable and Reproducible Error Analysis.☆109Updated 4 years ago
- The accompanying code for "Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understandin…☆21Updated 6 years ago