openai/finetune-transformer-lm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/openai/finetune-transformer-lm)

openai / finetune-transformer-lm

Code and model for the paper "Improving Language Understanding by Generative Pre-Training"

☆2,309

Alternatives and similar repositories for finetune-transformer-lm

Users that are interested in finetune-transformer-lm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

huggingface / pytorch-openai-transformer-lm
View on GitHub
🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI
☆1,521Aug 9, 2021Updated 4 years ago
allenai / bilm-tf
View on GitHub
Tensorflow implementation of contextualized word representations from bi-directional language models
☆1,612Mar 4, 2023Updated 3 years ago
zihangdai / xlnet
View on GitHub
XLNet: Generalized Autoregressive Pretraining for Language Understanding
☆6,185May 28, 2023Updated 3 years ago
allenai / allennlp
View on GitHub
An open-source NLP research library, built on PyTorch.
☆11,889Nov 22, 2022Updated 3 years ago
salesforce / decaNLP
View on GitHub
The Natural Language Decathlon: A Multitask Challenge for NLP
☆2,338May 1, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
openai / gpt-2
View on GitHub
Code for the paper "Language Models are Unsupervised Multitask Learners"
☆25,026Aug 14, 2024Updated last year
facebookresearch / XLM
View on GitHub
PyTorch original implementation of Cross-lingual Language Model Pretraining.
☆2,923Feb 14, 2023Updated 3 years ago
google-research / bert
View on GitHub
TensorFlow code and pre-trained models for BERT
☆40,061Jul 23, 2024Updated 2 years ago
namisan / mt-dnn
View on GitHub
Multi-Task Deep Neural Networks for Natural Language Understanding
☆2,259Mar 7, 2024Updated 2 years ago
IndicoDataSolutions / finetune
View on GitHub
Scikit-learn style model finetuning for NLP
☆721May 5, 2026Updated 2 months ago
tensorflow / tensor2tensor
View on GitHub
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
☆17,464Jun 2, 2023Updated 3 years ago
salesforce / awd-lstm-lm
View on GitHub
LSTM and QRNN Language Model Toolkit for PyTorch
☆1,989Feb 12, 2022Updated 4 years ago
localminimum / QANet
View on GitHub
A Tensorflow implementation of QANet for machine reading comprehension
☆984May 30, 2018Updated 8 years ago
facebookresearch / SentEval
View on GitHub
A python tool for evaluating the quality of sentence embeddings.
☆2,110Mar 19, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / InferSent
View on GitHub
InferSent sentence embeddings
☆2,280Aug 30, 2021Updated 4 years ago
Kyubyong / transformer
View on GitHub
A TensorFlow Implementation of the Transformer: Attention Is All You Need
☆4,472May 21, 2023Updated 3 years ago
kimiyoung / transformer-xl
View on GitHub
☆3,711Sep 21, 2022Updated 3 years ago
facebookresearch / UnsupervisedMT
View on GitHub
Phrase-Based & Neural Unsupervised Machine Translation
☆1,499Sep 15, 2021Updated 4 years ago
sebastianruder / NLP-progress
View on GitHub
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…
☆22,955Jul 28, 2024Updated 2 years ago
google / sentencepiece
View on GitHub
Unsupervised text tokenizer for Neural Network-based text generation.
☆11,991Updated this week
allenai / bi-att-flow
View on GitHub
Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granul…
☆1,546May 31, 2023Updated 3 years ago
google-research / text-to-text-transfer-transformer
View on GitHub
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
☆6,540Jul 8, 2026Updated 2 weeks ago
facebookresearch / pytext
View on GitHub
A natural language modeling framework based on PyTorch
☆6,295Oct 17, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
rsennrich / subword-nmt
View on GitHub
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
☆2,272Aug 7, 2024Updated last year
codertimo / BERT-pytorch
View on GitHub
Google AI 2018 BERT pytorch implementation
☆6,528Sep 15, 2023Updated 2 years ago
YichenGong / Densely-Interactive-Inference-Network
View on GitHub
Cleaned code for paper "Natural Language Inference over Interaction Space"
☆248Mar 24, 2023Updated 3 years ago
abisee / pointer-generator
View on GitHub
Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"
☆2,194Jun 16, 2022Updated 4 years ago
salesforce / cove
View on GitHub
☆471Feb 12, 2022Updated 4 years ago
facebookresearch / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆32,250Sep 30, 2025Updated 9 months ago
OpenNMT / OpenNMT-py
View on GitHub
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
☆7,012Oct 14, 2025Updated 9 months ago
thunlp / ERNIE
View on GitHub
Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"
☆1,419Jan 10, 2024Updated 2 years ago
asyml / texar
View on GitHub
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…
☆2,390Jul 21, 2026Updated last week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
jina-ai / clip-as-service
View on GitHub
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
☆12,833Jan 23, 2024Updated 2 years ago
openai / sparse_attention
View on GitHub
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
☆1,614Aug 12, 2020Updated 5 years ago
facebookresearch / DrQA
View on GitHub
Reading Wikipedia to Answer Open-Domain Questions
☆4,471Oct 1, 2023Updated 2 years ago
facebookresearch / MUSE
View on GitHub
A library for Multilingual Unsupervised or Supervised word Embeddings
☆3,248Aug 31, 2022Updated 3 years ago
google / seq2seq
View on GitHub
A general-purpose encoder-decoder framework for Tensorflow
☆5,620Oct 15, 2020Updated 5 years ago
ryankiros / skip-thoughts
View on GitHub
Sent2Vec encoder and training code from the paper "Skip-Thought Vectors"
☆2,048Jun 9, 2020Updated 6 years ago
openai / generating-reviews-discovering-sentiment
View on GitHub
Code for "Learning to Generate Reviews and Discovering Sentiment"
☆1,520Jun 28, 2023Updated 3 years ago