EleutherAI / stackexchange-dataset

Python tools for processing the stackexchange data dumps into a text dataset for Language Models
82Updated last year

Alternatives and similar repositories for stackexchange-dataset

Users that are interested in stackexchange-dataset are comparing it to the libraries listed below

Sorting: