EleutherAI / stackexchange-datasetLinks

Python tools for processing the stackexchange data dumps into a text dataset for Language Models
80Updated last year

Alternatives and similar repositories for stackexchange-dataset

Users that are interested in stackexchange-dataset are comparing it to the libraries listed below

Sorting: