tbrianjones / website_extractor

This is an In-Memory Web Crawler & Scraper built to extract data in small runs from public websites. The current implementation takes a .csv of urls and crawls the sites, extracting basic info about the site like emails, phone numbers, addresses, & specified terms.
10Updated 9 years ago

Alternatives and similar repositories for website_extractor:

Users that are interested in website_extractor are comparing it to the libraries listed below