18F / scrapeboxLinks
A simple, system independent infrastructure for performing web scraping. Utilizes Vagrant virtualbox interface and puppet provisioning to create and execute scraping of web content to structured data quickly and easily without modifying your core system.
☆24Updated 10 years ago
Alternatives and similar repositories for scrapebox
Users that are interested in scrapebox are comparing it to the libraries listed below
Sorting:
- This is a heroku buildpack for Pelican.☆23Updated 3 years ago
- Python command line tools, for increased fu.☆46Updated 9 years ago
- 3bot is a software platform to build, configure and perform.☆11Updated 7 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆46Updated 7 years ago
- bookmark management for the Django web framework☆17Updated 9 years ago
- Write you a home page with bookmarks well-organized.☆16Updated 7 years ago
- Junk drawer of old scripts.☆18Updated 9 years ago
- Django feeds provides an extensive database model for RSS feeds and a fault tolerant parser.☆31Updated 13 years ago
- Webhooks for Django *experimental*☆62Updated 15 years ago
- A native web-based client for Slack.☆23Updated 7 years ago
- This project is no longer maintained. Check out https://timesheet.gregbrown.co - the time tracking application which grew out of this cod…☆20Updated 5 years ago
- An autoscaling python script for Heroku☆27Updated 13 years ago
- Django template ready to use in PAAS platforms like Heroku, OpenShift, etc...☆21Updated 11 years ago
- An example REST API with Django, Tastypie, xAuth and Heroku☆72Updated 5 years ago
- django buddy, a chat bot use django as server, python aiml as backend.☆20Updated 11 years ago
- Specialised bot for periodical grabs and video/audio/etc. webpage scrapes.☆11Updated 7 years ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆24Updated 8 years ago
- Twerp is the telephone hackers toolkit. It's also a command-line app for Twilio, written in Python☆26Updated 4 years ago
- Python RethinkDB Object Mapper Interface Inspired by Appengine NDB☆13Updated 9 years ago
- Very simple Netflix API client☆24Updated 14 years ago
- SSH into all local Docker containers by name.☆83Updated 11 years ago
- Open Source Social Media Monitoring And Engagement System Core/API☆36Updated 10 years ago
- video indexing site☆216Updated 9 years ago
- An eBook tool to extract ISBN or Metadata form eBook and rename them by using ISBN database and Metadata☆30Updated 9 years ago
- A weather monitoring Dashboard built upon Python and Yahoo API☆14Updated 10 years ago
- Reddit Bots and Scripts☆10Updated 10 years ago
- Constituent Relationship Management and Content Management system solution, built for non-profit and governmental groups.☆17Updated 10 years ago
- Bombolone is a tasty Content Management System for Python based on Flask, MongoDB, AngularJS, Sass and Bootstrap. It's designed to be a s…☆74Updated 9 years ago
- A Grooveshark song downloader in Python☆120Updated 8 years ago
- Carrot -- A simple task Queue for Django.☆14Updated 3 years ago