harvardnlp / annotated-transformerLinks
An annotated implementation of the Transformer paper.
☆6,585Updated last year
Alternatives and similar repositories for annotated-transformer
Users that are interested in annotated-transformer are comparing it to the libraries listed below
Sorting:
- A PyTorch implementation of the Transformer model in "Attention is All You Need".☆9,408Updated last year
- Google AI 2018 BERT pytorch implementation☆6,470Updated 2 years ago
- Transformer: PyTorch Implementation of "Attention Is All You Need"☆4,103Updated 2 months ago
- BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)☆7,698Updated 4 months ago
- A TensorFlow Implementation of the Transformer: Attention Is All You Need☆4,417Updated 2 years ago
- Must-read Papers on pre-trained language models.☆3,361Updated 2 years ago
- Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.☆5,626Updated last year
- A concise but complete full-attention transformer with a set of promising experimental features from various papers☆5,607Updated 2 weeks ago
- State-of-the-Art Text Embeddings☆17,649Updated this week
- ☆3,674Updated 3 years ago
- Transformer seq2seq model, program that can build a language translator from parallel corpus☆1,409Updated 2 years ago
- Ongoing research training transformer models at scale☆13,755Updated this week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,813Updated last week
- Unsupervised text tokenizer for Neural Network-based text generation.☆11,331Updated last week
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,435Updated 5 months ago
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆24,113Updated 2 weeks ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,602Updated 11 months ago
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆9,215Updated 2 months ago
- Fast and memory-efficient exact attention☆19,864Updated this week
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆9,199Updated this week
- Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.☆16,541Updated 2 years ago
- Train transformer language models with reinforcement learning.☆15,818Updated this week
- Open Source Neural Machine Translation and (Large) Language Models in PyTorch☆6,951Updated 7 months ago
- Code and model for the paper "Improving Language Understanding by Generative Pre-Training"☆2,245Updated 6 years ago
- Longformer: The Long-Document Transformer☆2,169Updated 2 years ago
- ☆11,848Updated 7 months ago
- Must-read papers on prompt-based tuning for pre-trained language models.☆4,282Updated 2 years ago
- Natural Language Processing Tutorial for Deep Learning Researchers☆14,756Updated last year
- Transformer implementation in PyTorch.☆491Updated 6 years ago
- The official GitHub page for the survey paper "A Survey of Large Language Models".☆11,861Updated 7 months ago