Girbons / mercury-parserpyLinks
python api wrapper for https://mercury.postlight.com/web-parser/
☆25Updated 2 years ago
Alternatives and similar repositories for mercury-parserpy
Users that are interested in mercury-parserpy are comparing it to the libraries listed below
Sorting:
- A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.☆110Updated last year
- This repository provides usage examples for the Python module Newspaper3k.☆148Updated last year
- RSS feed reader for Python 3☆88Updated 2 years ago
- Zyte Automatic Extraction integration for Scrapy☆56Updated 3 years ago
- A python based HTML to text conversion library, command line client and Web service.☆320Updated last month
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆266Updated last year
- Ultimate Website Sitemap Parser☆225Updated last week
- Extract text from HTML☆134Updated 5 years ago
- Parsing JavaScript objects into Python data structures☆212Updated last month
- Scraping assistant tool. Editing and maintaining CSS/XPath selectors across webpages.☆105Updated 7 years ago
- Software stack with latest Scrapy and updated deps☆65Updated last month
- python library for getting metadata☆147Updated last week
- Web scraping Page Objects core library☆101Updated 2 weeks ago
- Extract price amount and currency symbol from a raw text string☆337Updated 6 months ago
- Utilize your personal data like Google!☆160Updated last year
- A helper library full of URL-related heuristics.☆70Updated this week
- A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them☆68Updated 2 years ago
- Atom, RSS and JSON feed parser for Python 3☆117Updated 2 years ago
- Scrapy middleware which allows to crawl only new content☆79Updated 2 years ago
- Parse government documents into well formed JSON☆72Updated 3 weeks ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆138Updated last month
- Generate Python Requests code from your browser activity 🤖☆120Updated 3 months ago
- A Scrapy middleware to bypass the CloudFlare's anti-bot protection☆110Updated 4 years ago
- Simple, robust email validation☆132Updated 2 years ago
- This is the HeadQuarters of my digital info. HPI library got me inspired and I'm trying to play with the idea on a smaller scale for myse…☆21Updated last year
- Serve click scripts over the web☆271Updated 11 months ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆23Updated 3 months ago
- admin ui for scrapy/open source scrapinghub☆58Updated 4 years ago
- Common interface for data container classes☆68Updated this week
- Python Wrapper for the USPS API☆59Updated 2 years ago