bigscience-workshop / data-preparation

Code used for sourcing and cleaning the BigScience ROOTS corpus
305Updated last year

Related projects

Alternatives and complementary repositories for data-preparation