18F / scrapeboxLinks
A simple, system independent infrastructure for performing web scraping. Utilizes Vagrant virtualbox interface and puppet provisioning to create and execute scraping of web content to structured data quickly and easily without modifying your core system.
☆24Updated 11 years ago
Alternatives and similar repositories for scrapebox
Users that are interested in scrapebox are comparing it to the libraries listed below
Sorting:
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆47Updated 7 years ago
- Bringing sanity to world of messed-up data☆66Updated 10 years ago
- ☆36Updated last year
- A component based data flow framework with a drag-n-drop Web 2.0 interface. Based on Stackless Python and inspired by Yahoo! Pipes.☆150Updated 13 years ago
- Specialised bot for periodical grabs and video/audio/etc. webpage scrapes.☆11Updated 7 years ago
- Twerp is the telephone hackers toolkit. It's also a command-line app for Twilio, written in Python☆26Updated 4 years ago
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆11Updated 10 years ago
- Junk drawer of old scripts.☆18Updated 9 years ago
- A tool to graph who has sent you the most emails☆17Updated 8 years ago
- Python library with common functionality for writing web scrapers☆102Updated 10 years ago
- The Social Harvest server that exposes an API and harvests data from the web to be analyzed.☆115Updated 10 years ago
- scraper related helper functions☆27Updated 11 years ago
- Open Source Social Media Monitoring And Engagement System Core/API☆36Updated 11 years ago
- The Python Achievements Framework!☆118Updated 3 years ago
- Packet Sniffing in the Cloud☆36Updated 6 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40Updated last year
- Keep an eye on specific keywords being posted on Twitter☆46Updated 9 years ago
- A modern web based communication service on top IRC.☆152Updated 8 years ago
- Python code examples for working with the Slack API. 2.x and 3.x compatible code.☆13Updated 9 years ago
- Network mapping tool☆16Updated 7 years ago
- Write you a home page with bookmarks well-organized.☆16Updated 8 years ago
- Python scripts for scraping bus ticket data from the websites of BoltBus, Greyhound, Megabus, GoBus, Amtrak, Peterpan, and EasternTravel.☆38Updated 4 years ago
- A portable, lightweight, locally-hosted IPv4 and IPv6 geolocation API/server☆40Updated 7 years ago
- small web parser that gets all the top jobs and visualizes the various salaries for each position☆21Updated 9 years ago
- ☆223Updated 10 years ago
- Python module to watch Twitter user pages or search-results.☆63Updated 11 years ago
- Secure random passwords in javascript☆18Updated 5 years ago
- sync a website or local spreadsheet with a google sheet☆35Updated 2 years ago
- Gamification Platform☆102Updated 10 years ago
- Main repo for pinitto.me open source corkboard☆63Updated 5 years ago