jcpeterson / openwebtextView on GitHub
Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed.
760Dec 8, 2022Updated 3 years ago

Alternatives and similar repositories for openwebtext

Users that are interested in openwebtext are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?