chuanconggao / html2jsonLinks
Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.
☆23Updated 7 months ago
Alternatives and similar repositories for html2json
Users that are interested in html2json are comparing it to the libraries listed below
Sorting:
- RSS feed reader for Python 3☆88Updated 3 years ago
- URL normalization for Python☆99Updated 9 months ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆193Updated 3 years ago
- Homoglyphs: get similar letters, convert to ASCII, detect possible languages and UTF-8 group.☆84Updated 5 years ago
- Python library for extracting text from various file formats (for indexing).☆114Updated 3 years ago
- Atom, RSS and JSON feed parser for Python 3☆117Updated 3 years ago
- Analyze scraped data☆46Updated 6 years ago
- Python module for Named Entity Recognition (NER) using natural language processing.☆13Updated 4 years ago
- A Python library for extracting titles, images, descriptions and canonical urls from HTML.☆151Updated 5 years ago