fhightower / html-to-jsonLinks
Convert HTML to JSON. Can also (intelligently) convert HTML tables to JSON (using table headers (if available) as keys in the resulting JSON).
☆52Updated 2 years ago
Alternatives and similar repositories for html-to-json
Users that are interested in html-to-json are comparing it to the libraries listed below
Sorting:
- Parse numbers written in natural language☆124Updated last year
- Common interface for data container classes☆68Updated 2 weeks ago
- ☆85Updated 8 months ago
- A python based HTML to text conversion library, command line client and Web service.☆332Updated 2 months ago
- Web scraping Page Objects core library☆104Updated this week
- Parsing JavaScript objects into Python data structures☆217Updated 5 months ago
- A modern CSS selector implementation for BeautifulSoup☆263Updated last week
- Python client for Typesense: https://github.com/typesense/typesense☆231Updated 2 weeks ago
- Pandoc (Python Library)☆178Updated 3 months ago
- universal character encoding detector☆63Updated last year
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆273Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆123Updated 2 months ago
- Ultimate Website Sitemap Parser☆242Updated this week
- URL normalization for Python☆99Updated 9 months ago
- Caching for HTTPX☆73Updated 3 months ago
- Python Markdown extension to include local or remote files☆62Updated last year
- Automatically close issues that have a label, after a custom delay, if no one replies back.☆72Updated 2 months ago
- Library to populate items using XPath and CSS with a convenient API☆47Updated this week
- ☆127Updated this week
- Productivity tools for popular open source repos, used by pydantic☆63Updated 2 years ago
- Schema.org classes in pydantic☆73Updated 3 years ago
- A pure-Python robots.txt parser with support for modern conventions.☆76Updated last month
- Lightweight browser hot reload for Python ASGI web apps☆164Updated 4 months ago
- Spider templates for automatic crawlers.☆34Updated 2 weeks ago
- A Python implementation of Lunr.js 🌖☆203Updated 10 months ago
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆57Updated last year
- A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them☆72Updated 2 years ago
- Python binding to Poppler-cpp pdf library☆114Updated last year
- Extract price amount and currency symbol from a raw text string☆347Updated 3 months ago
- Asynchronous version of functions of shutil module.☆47Updated 3 months ago