rspeer / wiki2text

Extract a plain text corpus from MediaWiki XML dumps, such as Wikipedia.
132Updated 6 years ago

Related projects

Alternatives and complementary repositories for wiki2text