EleutherAI / pile-pubmedcentral

A script for collecting the PubMed Central dataset in a language modelling friendly format.
23Updated 3 years ago

Related projects

Alternatives and complementary repositories for pile-pubmedcentral