openai / gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
β22,625Updated 3 months ago
Alternatives and similar repositories for gpt-2:
Users that are interested in gpt-2 are comparing it to the libraries listed below
- π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.β136,044Updated this week
- GPT-3: Language Models are Few-Shot Learnersβ15,695Updated 4 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β30,650Updated last month
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) trainingβ20,348Updated 3 months ago
- TensorFlow code and pre-trained models for BERTβ38,326Updated 4 months ago
- Unsupervised text tokenizer for Neural Network-based text generation.β10,351Updated last week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalitiesβ20,333Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β35,794Updated this week
- Ongoing research training transformer models at scaleβ10,758Updated this week
- State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterβ¦β13,647Updated 3 months ago
- Dataset of GPT-2 outputs for research in detection, biases, and moreβ1,946Updated 11 months ago
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β34,448Updated this week
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Autoβ¦β12,364Updated this week
- An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.β8,242Updated 2 years ago
- State-of-the-Art Text Embeddingsβ15,521Updated this week
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an imageβ26,369Updated 4 months ago
- XLNet: Generalized Autoregressive Pretraining for Language Understandingβ6,183Updated last year
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"β6,204Updated 2 months ago
- A library for efficient similarity search and clustering of dense vectors.β31,824Updated this week
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β16,644Updated this week
- StyleGAN2 - Official TensorFlow Implementationβ11,006Updated 6 months ago
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.β10,498Updated last year
- A very simple framework for state-of-the-art Natural Language Processing (NLP)β13,981Updated this week
- Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.β15,630Updated last year
- This repository contains implementations and illustrative code to accompany DeepMind publicationsβ13,316Updated 3 weeks ago
- The fastai deep learning libraryβ26,374Updated 3 weeks ago
- Inference code for Llama modelsβ56,693Updated 3 months ago
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.β12,607Updated 2 months ago
- Open Source Neural Machine Translation and (Large) Language Models in PyTorchβ6,788Updated 5 months ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β37,177Updated last week