fhightower / html-to-json
Convert HTML to JSON. Can also (intelligently) convert HTML tables to JSON (using table headers (if available) as keys in the resulting JSON).
☆50Updated last year
Alternatives and similar repositories for html-to-json:
Users that are interested in html-to-json are comparing it to the libraries listed below
- Schema.org classes in pydantic☆65Updated 2 years ago
- Common interface for data container classes☆66Updated last week
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆52Updated last month
- Parse numbers written in natural language☆109Updated 3 months ago
- ☆84Updated last month
- Library to populate items using XPath and CSS with a convenient API☆46Updated 2 weeks ago
- A pure-Python robots.txt parser with support for modern conventions.☆58Updated 2 weeks ago
- Extract text from HTML☆133Updated 4 years ago
- Python clients for Zyte AutoExtract API☆40Updated 3 years ago
- Web scraping Page Objects core library☆96Updated last week
- Transport adapter for fetching file:// URLs with the requests python library☆86Updated 7 months ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆253Updated 11 months ago
- Bounded Process&Thread Pool Executor☆63Updated 11 months ago
- universal character encoding detector☆58Updated 5 months ago
- Asynchronous version of functions of shutil module.☆37Updated 7 months ago
- A Python implementation of Lunr.js 🌖☆195Updated last month
- A modern CSS selector implementation for BeautifulSoup☆229Updated this week
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆67Updated last year
- ⬅️ Dedent and format source code strings into their intended human-readable strings☆19Updated last year
- ☆18Updated last month
- Caching for HTTPX☆70Updated 6 months ago
- Automatic generation of documentation for mkdocs☆43Updated 4 years ago
- A fast and flexible reimplementation of data classes☆84Updated 2 years ago
- Define your JSON schema as Python dataclasses☆63Updated last year
- Python tool to support lazy imports.☆28Updated 3 months ago
- Simple tool for getting geolocation information on given IP address from various geolocation databases.☆62Updated 3 years ago
- ☆61Updated 10 months ago
- Parsing JavaScript objects into Python data structures☆202Updated last month
- Fast and robust date extraction from web pages, with Python or on the command-line☆122Updated last month
- ☆55Updated this week