jcpeterson / openwebtextLinks

Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed.
731Updated 2 years ago

Alternatives and similar repositories for openwebtext

Users that are interested in openwebtext are comparing it to the libraries listed below

Sorting: