fhightower / html-to-jsonLinks
Convert HTML to JSON. Can also (intelligently) convert HTML tables to JSON (using table headers (if available) as keys in the resulting JSON).
☆52Updated 2 years ago
Alternatives and similar repositories for html-to-json
Users that are interested in html-to-json are comparing it to the libraries listed below
Sorting:
- Parse numbers written in natural language☆124Updated last year
- Common interface for data container classes☆68Updated 2 weeks ago
- universal character encoding detector☆63Updated last year
- Simple python wrapper to convert HTML to PDF with headless Chrome via selenium☆74Updated last week
- URL normalization for Python☆99Updated 8 months ago
- Python client for Typesense: https://github.com/typesense/typesense☆231Updated last month
- Parsing JavaScript objects into Python data structures☆217Updated 4 months ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆272Updated last year
- chromedriver self updated binaries for all platforms☆57Updated 3 weeks ago
- Web scraping Page Objects core library☆104Updated 2 weeks ago
- Extract price amount and currency symbol from a raw text string☆346Updated 2 months ago
- A library for working with HTML/CSS color formats in Python.☆172Updated 2 months ago
- A python module for returning data about countries, ISO info and states/provinces within them.☆158Updated last month
- Library to populate items using XPath and CSS with a convenient API☆47Updated 3 weeks ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆122Updated last month
- A python based HTML to text conversion library, command line client and Web service.☆331Updated last month
- ☆85Updated 7 months ago
- Caching for HTTPX☆73Updated 2 months ago
- Run a Scrapy spider programmatically from a script or a Celery task - no project required.☆121Updated last year
- Automatically close issues that have a label, after a custom delay, if no one replies back.☆71Updated last month
- An async persistent cache for aiohttp requests☆148Updated 3 weeks ago
- A Python library for working with and comparing language codes.☆353Updated 7 months ago
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆56Updated 11 months ago
- A pure-Python robots.txt parser with support for modern conventions.☆75Updated 3 weeks ago
- Python API for PDF documents☆124Updated last year
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆76Updated last year
- A modern CSS selector implementation for BeautifulSoup☆260Updated last week
- XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml☆86Updated 2 months ago
- Python wrapper for the Meilisearch API☆571Updated 2 weeks ago
- Library that helps use puppeteer in scrapy.☆52Updated 4 months ago