18F / scrapeboxLinks
A simple, system independent infrastructure for performing web scraping. Utilizes Vagrant virtualbox interface and puppet provisioning to create and execute scraping of web content to structured data quickly and easily without modifying your core system.
☆24Updated 11 years ago
Alternatives and similar repositories for scrapebox
Users that are interested in scrapebox are comparing it to the libraries listed below
Sorting:
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆47Updated 7 years ago
- A tool to graph who has sent you the most emails☆17Updated 8 years ago
- ☆36Updated 2 years ago
- A pastebin for tables.☆34Updated 12 years ago
- Python module to watch Twitter user pages or search-results.☆64Updated 11 years ago
- framework for scraping legislative/government data☆89Updated 3 weeks ago
- Twerp is the telephone hackers toolkit. It's also a command-line app for Twilio, written in Python☆27Updated 5 years ago
- Bringing sanity to world of messed-up data☆66Updated 11 years ago
- Python library with common functionality for writing web scrapers☆102Updated 10 years ago
- Specialised bot for periodical grabs and video/audio/etc. webpage scrapes.☆11Updated 8 years ago
- Junk drawer of old scripts.☆18Updated 9 years ago
- Open Source Social Media Monitoring And Engagement System Core/API☆37Updated 11 years ago
- Keep an eye on specific keywords being posted on Twitter☆46Updated 10 years ago
- a simple server that connects calls between citizens and their congress person using the Twilio API☆67Updated 4 years ago
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆11Updated 10 years ago
- A component based data flow framework with a drag-n-drop Web 2.0 interface. Based on Stackless Python and inspired by Yahoo! Pipes.☆150Updated 13 years ago
- Write you a home page with bookmarks well-organized.☆16Updated 8 years ago
- Main repo for pinitto.me open source corkboard☆63Updated 5 years ago
- A native web-based client for Slack.☆23Updated 8 years ago
- Python interface to Digital Ocean☆24Updated 10 years ago
- video indexing site☆214Updated 9 years ago
- We use Tock to track and report our time at 18F☆125Updated last month
- A photobooth script that automatically snaps a photo, applies a watermark, uploads to a remote server, generates a QRCode, shortens the U…☆70Updated 10 years ago
- A collaborative list of open-source alternatives to typical government and enterprise software needs☆47Updated 9 years ago
- The fastest way to start using Twilio with Python.☆99Updated 6 years ago
- Feedbuffer buffers RSS and Atom syndication feeds, that is to say it caches new feed entries until the news aggregator requests them and …☆19Updated 9 years ago
- Create local study groups and learn together.☆15Updated last week
- "Hacker-CMS" Sandstorm App mashing up Jekyll, Ace Editor, and jsTree☆67Updated 9 years ago
- ScraperWiki Python library for scraping and saving data☆158Updated 2 years ago
- Canadian legislative scrapers☆35Updated this week