18F / scrapebox

A simple, system independent infrastructure for performing web scraping. Utilizes Vagrant virtualbox interface and puppet provisioning to create and execute scraping of web content to structured data quickly and easily without modifying your core system.
24Updated 10 years ago

Alternatives and similar repositories for scrapebox:

Users that are interested in scrapebox are comparing it to the libraries listed below