woxcab / scrapy_rss
Tools to easy generate RSS feed that contains each scraped item using Scrapy framework.
☆33Updated 2 months ago
Alternatives and similar repositories for scrapy_rss:
Users that are interested in scrapy_rss are comparing it to the libraries listed below
- Simple Web UI for Scrapy spider management via Scrapyd☆51Updated 6 years ago
- A Scrapy crawler for http://books.toscrape.com☆27Updated 7 years ago
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.☆56Updated 2 years ago
- A collection of pipelines for Scrapy☆16Updated 3 months ago
- A fork of http://pydispatcher.sourceforge.net/ with PyPy support☆16Updated 7 years ago
- A Ruia plugin for loading javascript - pyppeteer☆18Updated 2 years ago
- A list of awesome project for Ruia☆13Updated 2 years ago
- python api wrapper for https://mercury.postlight.com/web-parser/☆23Updated last year
- Scrapy downloader middleware that stores response HTMLs to disk.☆18Updated 9 months ago
- PyQuery-based scraping micro-framework.☆116Updated 3 years ago
- Combine XPath, CSS Selectors and JSONPath for Web data extracting.☆27Updated last month
- A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.☆109Updated 8 months ago
- Scrapy spider middleware to split an item into multiple items using a multi-valued key☆20Updated 8 years ago
- A Python library for extracting titles, images, descriptions and canonical urls from HTML.☆147Updated 4 years ago
- Scrapy middleware which allows to crawl only new content☆80Updated 2 years ago
- Python library for finding phone numbers in random user input text.☆9Updated 7 years ago
- A command utility to read and write data into csv, tsv, xls, xlsx and ods format.☆30Updated 5 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆21Updated 4 years ago
- ☆29Updated 3 years ago
- This is the HeadQuarters of my digital info. HPI library got me inspired and I'm trying to play with the idea on a smaller scale for myse…☆20Updated last year
- A request based package for scraping twitter data. No API Key required. Support for proxies and private twitter accounts☆27Updated last year
- A Python script to help you add user attributions to your Twitter bots☆11Updated 4 years ago
- A crawler for http://books.toscrape.com☆40Updated last year
- Some scrapy and web.py exmaples☆79Updated 7 years ago
- A Scrapy extension to log items coverage when the spider shuts down☆19Updated 4 years ago
- Generate random date(time) in Python.☆10Updated 11 months ago
- Use pyppeteer from a Scrapy spider☆60Updated 5 years ago
- Scrapy with Headless Selenium, for scraping interactive web pages☆10Updated 2 years ago
- Exports list of all your starred Github repos to a json file☆27Updated 4 years ago
- RSS feed reader for Python 3☆85Updated 2 years ago