tbrianjones / website_extractorView on GitHub
This is an In-Memory Web Crawler & Scraper built to extract data in small runs from public websites. The current implementation takes a .csv of urls and crawls the sites, extracting basic info about the site like emails, phone numbers, addresses, & specified terms.
11Apr 19, 2015Updated 10 years ago

Alternatives and similar repositories for website_extractor

Users that are interested in website_extractor are comparing it to the libraries listed below

Sorting:

Are these results useful?