lyeoni / gpt-pytorchLinks

PyTorch Implementation of OpenAI GPT

☆127

Alternatives and similar repositories for gpt-pytorch

Users that are interested in gpt-pytorch are comparing it to the libraries listed below

Sorting:

lucidrains / electra-pytorch
A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch
☆235Updated 2 years ago
lucidrains / mlm-pytorch
An implementation of masked language modeling for Pytorch, made as concise and simple as possible
☆179Updated 2 years ago
facebookresearch / transformer-sequential
Trains Transformer model variants. Data isn't shuffled between batches.
☆143Updated 3 years ago
richarddwang / electra_pytorch
Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)
☆330Updated last year
Andras7 / word2vec-pytorch
Extremely simple and fast word2vec implementation with Negative Sampling + Sub-sampling
☆189Updated 4 years ago
LiyuanLucasLiu / Transformer-Clinic
Understanding the Difficulty of Training Transformers
☆332Updated 3 years ago
czyssrs / Few-Shot-NLG
Code and Data for ACL 2020 paper "Few-Shot NLG with Pre-Trained Language Model"
☆190Updated 5 months ago
affjljoo3581 / GPT2
PyTorch Implementation of OpenAI GPT-2
☆348Updated last year
rish-16 / aft-pytorch
Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.
☆243Updated 3 years ago
graykode / ALBERT-Pytorch
Pytorch Implementation of ALBERT(A Lite BERT for Self-supervised Learning of Language Representations)
☆228Updated 4 years ago
yang-zhang / lightning-language-modeling
Language Modeling Example with Transformers and PyTorch Lighting
☆65Updated 5 years ago
clarkkev / attention-analysis
☆470Updated 4 years ago
lucidrains / compressive-transformer-pytorch
Pytorch implementation of Compressive Transformers, from Deepmind
☆162Updated 4 years ago
vinsis / math-and-ml-notes
Books, papers and links to latest research in ML/AI
☆91Updated last year
laiguokun / Funnel-Transformer
☆219Updated 5 years ago
Rick-McCoy / Reformer-pytorch
Implements Reformer: The Efficient Transformer in pytorch.
☆86Updated 5 years ago
facebookresearch / GraphLog
API for accessing the GraphLog dataset
☆90Updated last year
bentrevett / a-tour-of-pytorch-optimizers
A tour of different optimization algorithms in PyTorch.
☆99Updated 3 years ago
lucidrains / routing-transformer
Fully featured implementation of Routing Transformer
☆296Updated 4 years ago
yk / huggingface-nlp-demo
☆46Updated 5 years ago
dreamgonfly / BERT-pytorch
PyTorch implementation of BERT in "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
☆110Updated 7 years ago
sailab-code / gnn
Graph Neural Network
☆44Updated 4 years ago
lucidrains / feedback-transformer-pytorch
Implementation of Feedback Transformer in Pytorch
☆108Updated 4 years ago
williamFalcon / minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
☆27Updated 3 years ago
shentianxiao / text-autoencoders
☆212Updated last year
kzl / universal-computation
Official codebase for Pretrained Transformers as Universal Computation Engines.
☆247Updated 3 years ago
lena-voita / the-story-of-heads
This is a repository with the code for the ACL 2019 paper "Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, t…
☆315Updated 4 years ago
epfml / collaborative-attention
Code for Multi-Head Attention: Collaborate Instead of Concatenate
☆151Updated 2 years ago
SeanNaren / minGPT
A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!
☆113Updated 2 years ago
tatp22 / linformer-pytorch
My take on a practical implementation of Linformer for Pytorch.
☆421Updated 3 years ago