giannisdaras / spaCyIRL_slidesLinks
Slides from my talk on spaCy IRL, regarding sparse attention.
☆12Updated 6 years ago
Alternatives and similar repositories for spaCyIRL_slides
Users that are interested in spaCyIRL_slides are comparing it to the libraries listed below
Sorting:
- PyTorch source code of NAACL 2019 paper "An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models"☆96Updated last year
- ULMFiT + Siamese Network for Sentence Vectors☆33Updated 6 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated 2 years ago
- ☆31Updated 5 years ago
- Training Transformer-XL on 128 GPUs☆140Updated 5 years ago
- Exploring Random Encoders for Sentence Classification☆183Updated 5 years ago
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆69Updated 4 years ago
- Create interactive textual heat maps for Jupiter notebooks☆196Updated last year
- Let's put all materials into this repository☆49Updated 5 years ago
- The Annotated Encoder Decoder with Attention☆166Updated 4 years ago
- ☆21Updated 6 years ago
- Auxiliary GAN for WE post-specialisation☆24Updated 6 years ago
- Scripts to train a bidirectional LSTM with knowledge distillation from BERT☆158Updated 5 years ago
- Variational Methods for Pretraining in Resource-limited Environments☆174Updated 5 years ago
- NLP library designed for reproducible experimentation management☆294Updated last year
- A tutorial on how to implement models for natural language inference using PyTorch and TorchText. [IN PROGRESS]☆26Updated 5 years ago
- LM Pretraining with PyTorch/TPU☆135Updated 5 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆64Updated 4 years ago
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.☆51Updated 5 years ago
- Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation☆63Updated 6 years ago
- Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Process…☆251Updated 7 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆137Updated last year
- 25,100 queries from the Paralex corpus (Fader et al., 2013) annotated with human ratings of whether they are well-formed natural languag…☆84Updated 6 years ago
- Misspelling Oblivious Word Embeddings☆201Updated 6 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Updated 4 years ago
- The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model Fine-tuning" https://arxiv.org/abs/1909.04761☆282Updated 5 years ago
- Gendered Ambiguous Pronouns Shared Task☆31Updated 2 years ago
- Datasets I have created for scientific summarization, and a trained BertSum model☆114Updated 5 years ago
- Code for EMNLP 2018 paper "Auto-Encoding Dictionary Definitions into Consistent Word Embeddings"☆36Updated 6 years ago