Girbons / mercury-parserpy
python api wrapper for https://mercury.postlight.com/web-parser/
☆24Updated last year
Alternatives and similar repositories for mercury-parserpy
Users that are interested in mercury-parserpy are comparing it to the libraries listed below
Sorting:
- RSS feed reader for Python 3☆87Updated 2 years ago
- Tools to easy generate RSS feed that contains each scraped item using Scrapy framework.☆32Updated this week
- Web scraping Page Objects core library☆99Updated 3 months ago
- A Scrapy extension to log items coverage when the spider shuts down☆19Updated 5 years ago
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.☆56Updated 3 years ago
- This is the HeadQuarters of my digital info. HPI library got me inspired and I'm trying to play with the idea on a smaller scale for myse…☆21Updated last year
- URL normalization for Python☆94Updated 2 weeks ago
- Atom, RSS and JSON feed parser for Python 3☆117Updated 2 years ago
- Django QuerySet like interface to query simple Python collections☆68Updated last year
- Utilize your personal data like Google!☆163Updated last year
- Python Shell Toolkit☆25Updated 2 years ago
- A collection of pipelines for Scrapy☆16Updated last month
- Analyze scraped data☆46Updated 5 years ago
- The Temboz RSS/Atom feed reader☆83Updated last year
- yael (Yet Another EPUB Library) is a Python library for reading, manipulating, and writing EPUB 2/3 files☆18Updated 9 years ago
- a tool to snapshot sqlite databases you don't own☆21Updated 6 months ago
- Restrict crawl and scraping scope using matchers.☆25Updated 8 years ago
- A Python library for finding feed links on websites.☆52Updated 2 years ago
- Asyncio web crawling framework. Work in progress.☆18Updated 9 months ago
- Parse numbers written in natural language☆114Updated 6 months ago
- Save an RSS or ATOM feed to a SQLite database☆51Updated 2 years ago
- A Python 3, asyncio-based library to interact with the Pinboard API☆10Updated last week
- CLI based diff viewer☆23Updated 3 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆22Updated 2 weeks ago
- Scrapy middleware which allows to crawl only new content☆80Updated 2 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 7 months ago
- A scrapy extension to store requests and responses information in storage service☆26Updated 3 years ago
- Tool for running transformations on columns in a SQLite database☆31Updated 3 years ago
- Functional interface for concurrent futures, including async coroutines.☆11Updated 5 months ago
- linkbak is a web page archiver : it reads a list of links and dumps the corresponding pages in HTML and PDF.☆14Updated 2 years ago