ralacher / phpBB_crawler
Scrapy spider to crawl phpBB forums and extract information, allows for authentication
☆8Updated 9 years ago
Alternatives and similar repositories for phpBB_crawler:
Users that are interested in phpBB_crawler are comparing it to the libraries listed below
- A distrubuted crawler ues celery.☆17Updated 10 years ago
- Search engine base (crawler, indexer and parser) using Python, Celery, RabbitMQ, CouchDB and Whoosh.☆11Updated last year
- Jabba's headless webkit browser for scraping AJAX-powered webpages.☆91Updated 10 years ago
- Scrapy project to fetch movies data from IMDB and put on MySQL DB. It download movie covers also and more scrapers will be done on future…☆15Updated 12 years ago
- Turn your IPython console into a cross-database SQL client☆31Updated 8 years ago
- Python SMTP client and Email for Humans™☆82Updated 6 years ago
- A reddit/hackernews clone created in python with flask and app engine☆11Updated 12 years ago
- Scraping bhinneka.com, just for fun☆14Updated 12 years ago
- Bringing sanity to world of messed-up data☆65Updated 10 years ago
- Mosaics generation from movie frames☆44Updated 9 years ago
- Python command line tools, for increased fu.☆46Updated 9 years ago
- Open Source Social Media Monitoring And Engagement System Core/API☆36Updated 10 years ago
- This is a scrapy project in which I have implemented several crawlers for different torrent and direct link websites.☆59Updated 4 years ago
- This is a bot to download all your instagram gallery pictures in a single folder☆58Updated 8 years ago
- An eBook tool to extract ISBN or Metadata form eBook and rename them by using ISBN database and Metadata☆30Updated 9 years ago
- Web scraping engines with Python and Scrapy☆33Updated 4 years ago
- A component that tries to avoid downloading duplicate content☆27Updated 6 years ago
- Intelligent RSS news aggregator.☆33Updated last year
- Turn your laptop into a CCTV!☆72Updated 5 years ago
- Python library that uses selenium and phantomjs to automate Facebook Group administration☆27Updated 6 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- A scrapy spider to extract post, thread, and user information from a vBulletin forum to a MongoDB database.☆31Updated 8 years ago
- A native web-based client for Slack.☆23Updated 7 years ago
- A Python library for interacting with WordPress REST API.☆40Updated 2 years ago
- Some scrapy and web.py exmaples☆78Updated 7 years ago
- Short script for removing watermarks from PDF files. Requires pdftk.☆58Updated 5 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 9 years ago
- Passive network observation tool☆30Updated 5 years ago
- Python web-scraping library that wraps urllib2 and BeautifulSoup☆39Updated 5 years ago
- Scrapy middleware which allows to crawl only new content☆80Updated 2 years ago