s-nlp / annotated-transformer
http://nlp.seas.harvard.edu/2018/04/03/attention.html
☆63Updated 3 years ago
Alternatives and similar repositories for annotated-transformer:
Users that are interested in annotated-transformer are comparing it to the libraries listed below
- Visualising the Transformer encoder☆111Updated 4 years ago
- Distillation of BERT model with catalyst framework☆75Updated last year
- A small library with distillation, quantization and pruning pipelines☆26Updated 3 years ago
- Theoretical Deep Learning: generalization ability☆46Updated 5 years ago
- Pytorch library for end-to-end transformer models training, inference and serving☆70Updated 2 years ago
- A tiny Catalyst-like experiment runner framework on top of micrograd.☆51Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆145Updated 3 years ago
- XAI Tutorial for the Explainable AI track in the ALPS winter school 2021☆58Updated 3 years ago
- Russian RoBERTa☆29Updated 5 years ago
- What are the best Systems? New Perspectives on NLP Benchmarking☆13Updated last year
- Create interactive textual heat maps for Jupiter notebooks☆196Updated 7 months ago
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition☆17Updated 5 months ago
- diagNNose is a Python library that facilitates a broad set of tools for analysing hidden activations of neural models.☆81Updated last year
- LM Pretraining with PyTorch/TPU☆134Updated 5 years ago
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆69Updated 4 years ago
- Interface for easier topic modelling.☆138Updated 5 months ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆133Updated last year
- ☆21Updated 6 years ago
- Learning to Initialize Neural Networks for Stable and Efficient Training☆138Updated 2 years ago
- Check if you have training samples in your test set☆64Updated 2 years ago
- DEREK (Domain Entities and Relations Extraction Kit)☆10Updated last year
- Code for scaling Transformers☆26Updated 4 years ago
- This is an official repository for "Artificial Text Detection via Examining the Topology of Attention Maps" presented at EMNLP 2021 confe…☆22Updated last year
- ☆74Updated 3 years ago
- RuREBus shared task repo☆30Updated 4 years ago
- Lightweight knowledge distillation pipeline☆28Updated 3 years ago
- Training Transformer-XL on 128 GPUs☆140Updated 4 years ago
- A deep learning library based on Pytorch focussed on low resource language research and robustness☆69Updated 3 years ago
- ML project workflow☆7Updated 4 years ago
- Source code for the ACL workshop paper and Kaggle competition by Google AI team☆40Updated 3 years ago