harvardnlp / annotated-transformerLinks

An annotated implementation of the Transformer paper.

☆6,585

Alternatives and similar repositories for annotated-transformer

Users that are interested in annotated-transformer are comparing it to the libraries listed below

Sorting:

jadore801120 / attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
☆9,408Updated last year
codertimo / BERT-pytorch
Google AI 2018 BERT pytorch implementation
☆6,470Updated 2 years ago
hyunwoongko / transformer
Transformer: PyTorch Implementation of "Attention Is All You Need"
☆4,103Updated 2 months ago
jessevig / bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
☆7,698Updated 4 months ago
Kyubyong / transformer
A TensorFlow Implementation of the Transformer: Attention Is All You Need
☆4,417Updated 2 years ago
thunlp / PLMpapers
Must-read Papers on pre-trained language models.
☆3,361Updated 2 years ago
bentrevett / pytorch-seq2seq
Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.
☆5,626Updated last year
lucidrains / x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
☆5,607Updated 2 weeks ago
UKPLab / sentence-transformers
State-of-the-Art Text Embeddings
☆17,649Updated this week
kimiyoung / transformer-xl
☆3,674Updated 3 years ago
SamLynnEvans / Transformer
Transformer seq2seq model, program that can build a language translator from parallel corpus
☆1,409Updated 2 years ago
NVIDIA / Megatron-LM
Ongoing research training transformer models at scale
☆13,755Updated this week
NVIDIA / apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
☆8,813Updated last week
google / sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
☆11,331Updated last week
google-research / text-to-text-transfer-transformer
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
☆6,435Updated 5 months ago
lucidrains / vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…
☆24,113Updated 2 weeks ago
princeton-nlp / SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
☆3,602Updated 11 months ago
arogozhnikov / einops
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
☆9,215Updated 2 months ago
Dao-AILab / flash-attention
Fast and memory-efficient exact attention
☆19,864Updated this week
huggingface / accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…
☆9,199Updated this week
tensorflow / tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
☆16,541Updated 2 years ago
huggingface / trl
Train transformer language models with reinforcement learning.
☆15,818Updated this week
OpenNMT / OpenNMT-py
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
☆6,951Updated 7 months ago
openai / finetune-transformer-lm
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
☆2,245Updated 6 years ago
allenai / longformer
Longformer: The Long-Document Transformer
☆2,169Updated 2 years ago
google-research / vision_transformer
☆11,848Updated 7 months ago
thunlp / PromptPapers
Must-read papers on prompt-based tuning for pre-trained language models.
☆4,282Updated 2 years ago
graykode / nlp-tutorial
Natural Language Processing Tutorial for Deep Learning Researchers
☆14,756Updated last year
tunz / transformer-pytorch
Transformer implementation in PyTorch.
☆491Updated 6 years ago
RUCAIBox / LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
☆11,861Updated 7 months ago