bwbaugh / wikipedia-extractorView on GitHub
This is a mirror of the script by Giuseppe Attardi, and contains history before the official repo started: https://github.com/attardi/wikiextractor --- Extracts and cleans text from Wikipedia database dump and stores output in a number of files of similar size in a given directory.
259Aug 17, 2016Updated 9 years ago

Alternatives and similar repositories for wikipedia-extractor

Users that are interested in wikipedia-extractor are comparing it to the libraries listed below

Sorting:

Are these results useful?