s-nlp / annotated-transformer
http://nlp.seas.harvard.edu/2018/04/03/attention.html
☆62Updated 3 years ago
Alternatives and similar repositories for annotated-transformer:
Users that are interested in annotated-transformer are comparing it to the libraries listed below
- Visualising the Transformer encoder☆111Updated 4 years ago
- A small library with distillation, quantization and pruning pipelines☆26Updated 4 years ago
- Pytorch library for end-to-end transformer models training, inference and serving☆70Updated 2 weeks ago
- Distillation of BERT model with catalyst framework☆78Updated last year
- Russian RoBERTa☆29Updated 5 years ago
- Theoretical Deep Learning: generalization ability☆46Updated 5 years ago
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition☆17Updated 9 months ago
- A tiny Catalyst-like experiment runner framework on top of micrograd.☆51Updated 4 years ago
- DEREK (Domain Entities and Relations Extraction Kit)☆10Updated last year
- RuREBus shared task repo☆30Updated 4 years ago
- XAI Tutorial for the Explainable AI track in the ALPS winter school 2021☆58Updated 4 years ago
- Create interactive textual heat maps for Jupiter notebooks☆196Updated 11 months ago
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆69Updated 4 years ago
- Interface for easier topic modelling.☆139Updated 9 months ago
- diagNNose is a Python library that facilitates a broad set of tools for analysing hidden activations of neural models.☆81Updated last year
- What are the best Systems? New Perspectives on NLP Benchmarking☆13Updated 2 years ago
- Probing suite for evaluation of Russian embedding and language models☆33Updated 7 months ago
- ☆103Updated 4 years ago
- Code for scaling Transformers☆26Updated 4 years ago
- Training Transformer-XL on 128 GPUs☆140Updated 4 years ago
- nlp workshop at datafest siberia 2019☆22Updated 2 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Updated 3 years ago
- Source code for the ACL workshop paper and Kaggle competition by Google AI team☆40Updated 3 years ago
- ☆21Updated 6 years ago
- ☆76Updated 3 years ago
- Course "Theories of Deep Learning"☆196Updated 5 years ago
- Code for BERT classifier finetuning for multiclass text classification☆70Updated 3 years ago
- Implementation of the GBST block from the Charformer paper, in Pytorch☆116Updated 3 years ago
- Active learning☆78Updated 2 years ago
- LM Pretraining with PyTorch/TPU☆134Updated 5 years ago