18F / scrapeboxLinks
A simple, system independent infrastructure for performing web scraping. Utilizes Vagrant virtualbox interface and puppet provisioning to create and execute scraping of web content to structured data quickly and easily without modifying your core system.
☆24Updated 11 years ago
Alternatives and similar repositories for scrapebox
Users that are interested in scrapebox are comparing it to the libraries listed below
Sorting:
- Twerp is the telephone hackers toolkit. It's also a command-line app for Twilio, written in Python☆26Updated 4 years ago
- ☆36Updated last year
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆47Updated 7 years ago
- A pastebin for tables.☆34Updated 11 years ago
- A tool to manage servers through a central configuration. Plugins allow provisioning, configuration and other management tasks.☆78Updated 10 months ago
- Django feeds provides an extensive database model for RSS feeds and a fault tolerant parser.☆31Updated 13 years ago
- AES encrypted password manager☆186Updated 10 years ago
- Sample applications that cover common use cases in a variety of languages.☆18Updated 13 years ago
- A native web-based client for Slack.☆23Updated 8 years ago
- Bringing sanity to world of messed-up data☆66Updated 10 years ago
- A tool to graph who has sent you the most emails☆17Updated 8 years ago
- Write you a home page with bookmarks well-organized.☆16Updated 8 years ago
- Python scripts for scraping bus ticket data from the websites of BoltBus, Greyhound, Megabus, GoBus, Amtrak, Peterpan, and EasternTravel.☆38Updated 4 years ago
- Python library with common functionality for writing web scrapers☆102Updated 10 years ago
- Python module to watch Twitter user pages or search-results.☆63Updated 11 years ago
- scraper related helper functions☆27Updated 11 years ago
- a simple server that connects calls between citizens and their congress person using the Twilio API☆66Updated 3 years ago
- Taws - A personal and private web search engine☆24Updated 10 years ago
- Python interface to Digital Ocean☆24Updated 10 years ago
- "Hacker-CMS" Sandstorm App mashing up Jekyll, Ace Editor, and jsTree☆67Updated 9 years ago
- The Python Achievements Framework!☆118Updated 3 years ago
- Tiny python web crawler☆169Updated 9 years ago
- Junk drawer of old scripts.☆18Updated 9 years ago
- This is a heroku buildpack for Pelican.☆23Updated 3 years ago
- A Python SDK for Human + AI Conversational Experiences☆10Updated 8 years ago
- A basic Django template skeleton built on HTML5 Boilerplate and Twitter Bootstrap.☆33Updated 11 years ago
- small web parser that gets all the top jobs and visualizes the various salaries for each position☆21Updated 9 years ago
- Export a graph of link between crawled items by scrapy in dot file format.☆26Updated 13 years ago
- Django template ready to use in PAAS platforms like Heroku, OpenShift, etc...☆21Updated 11 years ago
- video indexing site☆216Updated 9 years ago