harvardnlp / annotated-transformer
An annotated implementation of the Transformer paper.
☆5,733Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for annotated-transformer
- A PyTorch implementation of the Transformer model in "Attention is All You Need".☆8,878Updated 7 months ago
- Google AI 2018 BERT pytorch implementation☆6,223Updated last year
- BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)☆6,952Updated last year
- Transformer: PyTorch Implementation of "Attention Is All You Need"☆3,039Updated 3 months ago
- Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.☆5,381Updated 10 months ago
- ☆10,467Updated 6 months ago
- Ongoing research training transformer models at scale☆10,595Updated this week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,415Updated 2 weeks ago
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆7,958Updated this week
- Open Source Neural Machine Translation and (Large) Language Models in PyTorch☆6,773Updated 4 months ago
- Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.☆15,560Updated last year
- ☆3,612Updated 2 years ago
- Unsupervised text tokenizer for Neural Network-based text generation.☆10,295Updated 2 weeks ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,178Updated 2 months ago
- A concise but complete full-attention transformer with a set of promising experimental features from various papers☆4,793Updated this week
- Model interpretability and understanding for PyTorch☆4,935Updated this week
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,426Updated last month
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,182Updated last year
- Models, data loaders and abstractions for language processing, powered by PyTorch☆3,517Updated this week
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆20,652Updated last week
- A TensorFlow Implementation of the Transformer: Attention Is All You Need☆4,288Updated last year
- Fast and memory-efficient exact attention☆14,279Updated this week
- A Unified Library for Parameter-Efficient and Modular Transfer Learning☆2,581Updated 2 weeks ago
- tensorboard for pytorch (and chainer, mxnet, numpy, ...)☆7,871Updated this week
- Model summary in PyTorch similar to `model.summary()` in Keras☆4,020Updated 8 months ago
- Data augmentation for NLP☆4,454Updated 4 months ago
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆8,524Updated this week
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,892Updated last year
- End-to-End Object Detection with Transformers☆13,626Updated 8 months ago
- An open-source NLP research library, built on PyTorch.☆11,759Updated last year