ralacher / phpBB_crawlerLinks
Scrapy spider to crawl phpBB forums and extract information, allows for authentication
☆8Updated 10 years ago
Alternatives and similar repositories for phpBB_crawler
Users that are interested in phpBB_crawler are comparing it to the libraries listed below
Sorting:
- An Extensible Image Crawler☆160Updated 8 years ago
- Bringing sanity to world of messed-up data☆66Updated 10 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆47Updated 7 years ago
- Python module to watch Twitter user pages or search-results.☆62Updated 10 years ago
- Short script for removing watermarks from PDF files. Requires pdftk.☆59Updated 6 years ago
- Search engine base (crawler, indexer and parser) using Python, Celery, RabbitMQ, CouchDB and Whoosh.☆11Updated last month
- Grabbing all news.☆62Updated 5 years ago
- Turn your IPython console into a cross-database SQL client☆31Updated 9 years ago
- A python script to download books from libgen.io☆75Updated 6 years ago
- Jabba's headless webkit browser for scraping AJAX-powered webpages.☆91Updated 10 years ago
- PyQuery-based scraping micro-framework.☆117Updated 3 years ago
- Open Source Social Media Monitoring And Engagement System Core/API☆36Updated 10 years ago
- ☕🗄CAching Proxy in Python – Simple file based python http proxy☆15Updated 3 years ago
- Fetch novels from internet☆13Updated 4 years ago
- A simple, system independent infrastructure for performing web scraping. Utilizes Vagrant virtualbox interface and puppet provisioning to…☆24Updated 10 years ago
- A component based data flow framework with a drag-n-drop Web 2.0 interface. Based on Stackless Python and inspired by Yahoo! Pipes.☆150Updated 12 years ago
- Python scripts to convert Google Chrome’s bookmarks and history to the standard HTML-ish bookmarks file format.☆205Updated 3 years ago
- Python SMTP client and Email for Humans™☆82Updated 6 years ago
- scraper related helper functions☆27Updated 11 years ago
- Python script for searching through your digital books and cataloguing them in an easy-to-share list of files.☆31Updated 5 years ago
- A general purpose Python automatization library with nifty real-time web UI☆30Updated last month
- 📻 Play your favorite radio station from the terminal☆76Updated 5 years ago
- Collection of python scripts I have created to crawl various websites, mostly for lead generation projects to match keywords and collect …☆131Updated last year
- Python library for extracting text from various file formats (for indexing).☆113Updated 3 years ago
- A Python library for interacting with WordPress REST API.☆40Updated 3 years ago
- Chrome Debugging client for Python☆33Updated 5 years ago
- A Music playlist app☆13Updated 7 years ago
- Phantompy is a headless WebKit engine with powerful pythonic api build on top of Qt5 Webkit☆613Updated 8 years ago
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.☆56Updated 3 years ago
- Secure random passwords in javascript☆18Updated 5 years ago