openai / gpt-2-output-dataset
Dataset of GPT-2 outputs for research in detection, biases, and more
☆1,965Updated last year
Alternatives and similar repositories for gpt-2-output-dataset:
Users that are interested in gpt-2-output-dataset are comparing it to the libraries listed below
- Code for the paper "Language Models are Unsupervised Multitask Learners"☆1,147Updated 2 years ago
- Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation☆987Updated 5 years ago
- Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts☆3,402Updated 2 years ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,289Updated last week
- Code and model for the paper "Improving Language Understanding by Generative Pre-Training"☆2,193Updated 6 years ago
- Conditional Transformer Language Model for Controllable Generation☆1,877Updated 3 years ago
- Code for Defending Against Neural Fake News, https://rowanzellers.com/grover/☆919Updated last year
- An open clone of the GPT-2 WebText dataset by OpenAI. Still WIP.☆388Updated 11 months ago
- Code for the paper "Language Models are Unsupervised Multitask Learners"☆23,154Updated 6 months ago
- Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed.☆729Updated 2 years ago
- 🦄 State-of-the-Art Conversational AI with Transfer Learning☆1,750Updated last year
- ☆1,540Updated last year
- Large-scale pretraining for dialogue☆2,379Updated 2 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,348Updated 11 months ago
- Crawl BookCorpus☆825Updated last year
- The implementation of DeBERTa☆2,047Updated last year
- A robust Python tool for text-based AI training and generation using GPT-2.☆1,843Updated last year
- 🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI☆1,508Updated 3 years ago
- ☆2,763Updated this week
- Language-Agnostic SEntence Representations☆3,619Updated 10 months ago
- ALBERT: A Lite BERT for Self-supervised Learning of Language Representations☆3,267Updated last year
- An implementation of training for GPT2, supports TPUs☆1,425Updated 2 years ago
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,141Updated last year
- Code for "Learning to summarize from human feedback"☆1,013Updated last year
- ☆1,618Updated last year
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,628Updated last year
- ☆1,268Updated 2 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,900Updated 2 years ago
- Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is design…☆978Updated 3 years ago
- 💥 Fast State-of-the-Art Tokenizers optimized for Research and Production☆9,452Updated 3 weeks ago