fhightower / html-to-json
Convert HTML to JSON. Can also (intelligently) convert HTML tables to JSON (using table headers (if available) as keys in the resulting JSON).
☆50Updated last year
Alternatives and similar repositories for html-to-json:
Users that are interested in html-to-json are comparing it to the libraries listed below
- Parse numbers written in natural language☆110Updated 5 months ago
- A compression AGSI middleware using brotli.☆71Updated last year
- A pure-Python robots.txt parser with support for modern conventions.☆61Updated 2 weeks ago
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆52Updated 2 months ago
- Web scraping Page Objects core library☆97Updated last month
- Common interface for data container classes☆67Updated last month
- Fast and robust date extraction from web pages, with Python or on the command-line☆123Updated 2 months ago
- ☆84Updated 3 weeks ago
- python json-rpc client/server without boilerplate☆36Updated last month
- Python client for Typesense: https://github.com/typesense/typesense☆188Updated this week
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆112Updated 2 weeks ago
- universal character encoding detector☆58Updated 6 months ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆65Updated 2 years ago
- Browser reload with uvicorn!☆68Updated 2 years ago
- We don't like positional args, we like keyword only args! 🎉☆89Updated 2 years ago
- A Python library for working with and comparing language codes.☆17Updated 2 months ago
- Fast Indexed python HTML parser which builds a DOM node tree, providing common getElementsBy* functions for scraping, testing, modificati…☆102Updated last year
- Zyte Automatic Extraction integration for Scrapy☆56Updated 3 years ago
- Library to populate items using XPath and CSS with a convenient API☆47Updated this week
- Productivity tools for popular open source repos, used by pydantic☆59Updated last year
- An Elasticsearch Python ORM based on Pydantic.☆125Updated last year
- FastAPI without reliance on CDNs for docs☆47Updated 4 months ago
- Fast and memory-efficient Python PDF Parser based on xpdf sources☆41Updated last year
- chromedriver self updated binaries for all platforms☆53Updated last month
- Python Markdown extension to include local or remote files☆58Updated last year
- FrozenList is a list-like structure that implements collections.abc.MutableSequence and can be made immutable.☆99Updated this week
- Python CLI using type hints and docstrings.☆20Updated 8 months ago
- Python component for Avataaars - port of https://github.com/fangpenlin/avataaars☆74Updated 11 months ago
- A plugin for poetry that allows you to execute scripts defined in your pyproject.toml, just like you can in npm or pipenv☆58Updated last week
- Asynchronous logging for Python and asyncio☆148Updated last year