fhightower / html-to-jsonLinks
Convert HTML to JSON. Can also (intelligently) convert HTML tables to JSON (using table headers (if available) as keys in the resulting JSON).
☆52Updated 2 years ago
Alternatives and similar repositories for html-to-json
Users that are interested in html-to-json are comparing it to the libraries listed below
Sorting:
- Parse numbers written in natural language☆126Updated last year
- Common interface for data container classes☆68Updated last month
- URL normalization for Python☆99Updated 9 months ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆273Updated last year
- Parsing JavaScript objects into Python data structures☆217Updated 6 months ago
- universal character encoding detector☆63Updated last year
- Web scraping Page Objects core library☆104Updated 2 weeks ago
- ☆86Updated 8 months ago
- A modern CSS selector implementation for BeautifulSoup☆263Updated last week
- Extract price amount and currency symbol from a raw text string☆347Updated 4 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆123Updated 3 months ago
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆57Updated last year
- A python based HTML to text conversion library, command line client and Web service.☆334Updated 2 months ago
- Simple tool for getting geolocation information on given IP address from various geolocation databases.☆65Updated 4 years ago
- Automatically close issues that have a label, after a custom delay, if no one replies back.☆71Updated 2 months ago
- A pure-Python robots.txt parser with support for modern conventions.☆79Updated last week
- Python client for Typesense: https://github.com/typesense/typesense☆233Updated this week
- GitHub Action for Python Coveralls.io☆48Updated last year
- rstr is a helper module for easily generating random strings of various types. It could be useful for fuzz testing, generating dummy data…☆99Updated 11 months ago
- A Python implementation of Lunr.js 🌖☆204Updated 11 months ago
- Ultimate Website Sitemap Parser☆242Updated 2 weeks ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆145Updated 3 months ago
- chromedriver self updated binaries for all platforms☆57Updated 2 months ago
- Library to populate items using XPath and CSS with a convenient API☆47Updated last week
- Simple python wrapper to convert HTML to PDF with headless Chrome via selenium☆73Updated last month
- A Requests-compatible interface for PycURL.☆71Updated 4 months ago
- Caching for HTTPX☆73Updated 4 months ago
- A python module to split file into multiple chunks based on the given size.☆69Updated last year
- FrozenList is a list-like structure that implements collections.abc.MutableSequence and can be made immutable.☆119Updated this week
- A plugin for poetry that allows you to execute scripts defined in your pyproject.toml, just like you can in npm or pipenv☆59Updated 10 months ago