s-nlp / annotated-transformer
http://nlp.seas.harvard.edu/2018/04/03/attention.html
☆62Updated 3 years ago
Alternatives and similar repositories for annotated-transformer:
Users that are interested in annotated-transformer are comparing it to the libraries listed below
- Visualising the Transformer encoder☆111Updated 4 years ago
- Distillation of BERT model with catalyst framework☆76Updated last year
- Create interactive textual heat maps for Jupiter notebooks☆196Updated 10 months ago
- A small library with distillation, quantization and pruning pipelines☆26Updated 3 years ago
- A tiny Catalyst-like experiment runner framework on top of micrograd.☆51Updated 4 years ago
- Theoretical Deep Learning: generalization ability☆46Updated 5 years ago
- Pytorch library for end-to-end transformer models training, inference and serving☆70Updated 2 years ago
- Russian RoBERTa☆29Updated 5 years ago
- DEREK (Domain Entities and Relations Extraction Kit)☆10Updated last year
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition☆17Updated 8 months ago
- LM Pretraining with PyTorch/TPU☆134Updated 5 years ago
- diagNNose is a Python library that facilitates a broad set of tools for analysing hidden activations of neural models.☆81Updated last year
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆69Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆146Updated 3 years ago
- ☆21Updated 6 years ago
- XAI Tutorial for the Explainable AI track in the ALPS winter school 2021☆58Updated 4 years ago
- My implementation of DeepMind's Perceiver☆63Updated 3 years ago
- ☆108Updated 2 years ago
- nlp workshop at datafest siberia 2019☆22Updated 2 years ago
- Exploratory search engine based on hierarchical topic models from BigARTM☆13Updated 3 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆136Updated last year
- Unofficial implementation of Perceiver IO☆120Updated 2 years ago
- ☆153Updated 4 years ago
- Lightweight knowledge distillation pipeline☆28Updated 3 years ago
- ☆102Updated 4 years ago
- Our solution of the Kaggle Abstraction and Reasoning Challenge☆22Updated 4 years ago
- Repository with all material for SMILES, the Summer School of Machine Learning at Skoltech, taking place from the 16th to the 21st of Aug…☆55Updated 4 years ago
- Language Modeling Example with Transformers and PyTorch Lighting☆65Updated 4 years ago
- ☆64Updated 5 years ago
- Jupyter active learning annotator widget.☆22Updated 4 years ago