EleutherAI / stackexchange-dataset

Python tools for processing the stackexchange data dumps into a text dataset for Language Models
81Updated last year

Alternatives and similar repositories for stackexchange-dataset:

Users that are interested in stackexchange-dataset are comparing it to the libraries listed below