ConnorJL / GPT2Links

An implementation of training for GPT2, supports TPUs

☆1,424

Alternatives and similar repositories for GPT2

Users that are interested in GPT2 are comparing it to the libraries listed below

Sorting:

salesforce / ctrl
Conditional Transformer Language Model for Controllable Generation
☆1,885Updated 3 months ago
graykode / gpt-2-Pytorch
Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation
☆1,005Updated 6 years ago
rish-16 / gpt2client
✍🏻 gpt2-client: Easy-to-use TensorFlow Wrapper for GPT-2 117M, 345M, 774M, and 1.5B Transformer Models 🤖 📝
☆372Updated 4 years ago
imcaspar / gpt2-ml
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
☆1,712Updated 2 years ago
nshepperd / gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
☆1,148Updated 2 years ago
openai / finetune-transformer-lm
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
☆2,223Updated 6 years ago
openai / gpt-2-output-dataset
Dataset of GPT-2 outputs for research in detection, biases, and more
☆1,991Updated last year
Smerity / sha-rnn
Single Headed Attention RNN - "Stop thinking with your head"
☆1,182Updated 3 years ago
google-research / albert
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
☆3,274Updated 2 years ago
asyml / texar
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…
☆2,388Updated 3 years ago
huggingface / transfer-learning-conv-ai
🦄 State-of-the-Art Conversational AI with Transfer Learning
☆1,754Updated 2 years ago
microsoft / NeuronBlocks
NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego
☆1,456Updated 2 years ago
huggingface / pytorch-openai-transformer-lm
🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI
☆1,512Updated 3 years ago
salesforce / decaNLP
The Natural Language Decathlon: A Multitask Challenge for NLP
☆2,349Updated 3 months ago
minimaxir / gpt-2-simple
Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts
☆3,408Updated 2 years ago
zihangdai / xlnet
XLNet: Generalized Autoregressive Pretraining for Language Understanding
☆6,182Updated 2 years ago
kimiyoung / transformer-xl
☆3,665Updated 2 years ago
google-research / electra
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
☆2,358Updated last year
TsinghuaAI / CPM-1-Generate
Chinese Pre-Trained Language Models (CPM-LM) Version-I
☆1,583Updated 2 years ago
microsoft / DialoGPT
Large-scale pretraining for dialogue
☆2,400Updated 2 years ago
openai / sparse_attention
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
☆1,584Updated 4 years ago
facebookresearch / XLM
PyTorch original implementation of Cross-lingual Language Model Pretraining.
☆2,916Updated 2 years ago
uber-research / PPLM
Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.
☆1,149Updated last year
akanyaani / gpt-2-tensorflow2.0
OpenAI GPT2 pre-training and sequence prediction implementation in Tensorflow 2.0
☆262Updated 2 years ago
facebookresearch / UnsupervisedMT
Phrase-Based & Neural Unsupervised Machine Translation
☆1,504Updated 3 years ago
rowanz / grover
Code for Defending Against Neural Fake News, https://rowanzellers.com/grover/
☆921Updated 2 years ago
Jiakui / awesome-bert
bert nlp papers, applications and github resources, including the newst xlnet ， BERT、XLNet 相关论文和 github 项目
☆1,850Updated 4 years ago
OpenNMT / OpenNMT-tf
Neural machine translation and sequence learning using TensorFlow
☆1,473Updated last year
tensorflow / lingvo
Lingvo
☆2,848Updated last month
graykode / xlnet-Pytorch
Simple XLNet implementation with Pytorch Wrapper
☆580Updated 6 years ago