zytedata/clear-html

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zytedata/clear-html)

zytedata / clear-html

Remove DIVs, style stuff and normalize HTML preserving structure information

☆14

Alternatives and similar repositories for clear-html

Users that are interested in clear-html are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

scrapy / xtractmime
View on GitHub
https://mimesniff.spec.whatwg.org/ implementation for Python
☆13Jul 9, 2026Updated last week
scrapinghub / shub-workflow
View on GitHub
☆14Updated this week
zytedata / zyte-spider-templates-project
View on GitHub
☆23Mar 18, 2026Updated 4 months ago
zytedata / python-zyte-api
View on GitHub
Python client for Zyte API
☆30Jul 16, 2026Updated last week
rock64-android / RKDocs
View on GitHub
☆10Jun 17, 2017Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
scrapy-plugins / scrapy-zyte-api
View on GitHub
Zyte API integration for Scrapy
☆43Jun 26, 2026Updated 3 weeks ago
zytedata / zyte-spider-templates
View on GitHub
Spider templates for automatic crawlers.
☆35Mar 26, 2026Updated 3 months ago
Zhouyi-AIPU / Model_zoo
View on GitHub
Zhouyi model zoo (Maintained at https://github.com/Arm-China/Model_zoo)
☆14Dec 30, 2024Updated last year
ne3x7 / pysymspell
View on GitHub
Python port of SymSpell
☆17Feb 22, 2019Updated 7 years ago
openculinary / knowledge-graph
View on GitHub
Migrated to: https://codeberg.org/openculinary/knowledge-graph
☆11Aug 21, 2025Updated 11 months ago
scrapy / scrapy-lint
View on GitHub
A linter for Scrapy projects.
☆22Jul 7, 2026Updated 2 weeks ago
scrapinghub / web-poet
View on GitHub
Web scraping Page Objects core library
☆107Jul 10, 2026Updated last week
rytilahti / homeassistant-mpris-bridge
View on GitHub
Control your Home Assistant media players from your desktop using MPRIS
☆32Aug 23, 2024Updated last year
onatcipli / quick_interact
View on GitHub
A flutter package for showing quick interactions for any widget
☆15Sep 25, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
novitalabs / litelama
View on GitHub
lightweight LAMA inference wrapper
☆26Sep 28, 2023Updated 2 years ago
voltachan / esp8266killer
View on GitHub
一个美观、简单、易用、易二次创作的ESP8266固件！Star、Fork、Follow 三连！！！
☆15Feb 10, 2019Updated 7 years ago
scrapinghub / scrapy-autounit
View on GitHub
Automatic unit test generation for Scrapy.
☆58Jul 12, 2021Updated 5 years ago
JibbyJames / data-studio-lighthouse-gauge
View on GitHub
A community visualisation for Google Data Studio in the style of the site speed auditing tool Lighthouse gauges.
☆21Feb 5, 2023Updated 3 years ago
interstellarninja / MarketAgents
View on GitHub
Agent based market simulation
☆15Aug 10, 2024Updated last year
Maverick0351a / Oscillink
View on GitHub
Oscillink — Self‑Optimizing Coherent Memory for Embedding Workflows
☆15Nov 24, 2025Updated 7 months ago
Naramsim / awesome-lego-mindstorms
View on GitHub
A list of delightful MINDSTORMS software and resources
☆14Mar 10, 2025Updated last year
scrapy / protego
View on GitHub
A pure-Python robots.txt parser with support for modern conventions.
☆90Updated this week
LAiSER-Software / extract-module
View on GitHub
LAiSER is a tool that helps learners, educators and employers share trusted and mutually intelligible information about skills.
☆11Updated this week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
LAION-AI / Desktop-BUD-E_V1.0
View on GitHub
BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…
☆23Oct 10, 2024Updated last year
dimitryzub / hotels-scraper-js
View on GitHub
Scrape Airbnb, Booking, Hotels.com from a single JavaScript module. ❗No longer maintained.
☆18Apr 18, 2023Updated 3 years ago
jeanluc243 / humanize
View on GitHub
A Dart & Flutter package for translating numbers and dates into a human readable format.
☆18Sep 24, 2025Updated 9 months ago
scrapy / itemloaders
View on GitHub
Library to populate items using XPath and CSS with a convenient API
☆49Jul 16, 2026Updated last week
graphrag / ms-graphrag
View on GitHub
A modular graph-based Retrieval-Augmented Generation (RAG) system
☆16Updated this week
fmoo / twisted-connect-proxy
View on GitHub
Default Twisted does not ship with a CONNECT-enabled HTTP(s) proxy. This code provides one.
☆51Feb 21, 2017Updated 9 years ago
bigdata-pw / florence-tool
View on GitHub
The Florence Tool CLI provides a command-line interface for processing images using the Florence-2 model. This tool allows users to apply…
☆16Jan 21, 2025Updated last year
navalnica / whisper-finetuning-be
View on GitHub
Finetuning Whisper ASR model for Belarusian language
☆17Feb 16, 2025Updated last year
DevAlone / google_foobar_invite_maker
View on GitHub
Simple program to get A LOT OF invites to https://foobar.withgoogle.com/
☆31Jun 12, 2019Updated 7 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
frappe / preview_generator
View on GitHub
Generates preview image for HTML
☆21Jun 12, 2026Updated last month
salihagic / rest_api_client
View on GitHub
Abstraction for communicating with REST API in flutter projects.
☆12May 25, 2026Updated last month
activescott / agentmarkdown
View on GitHub
An accurate, extensible, and fast HTML-to-markdown converter.
☆23Feb 7, 2026Updated 5 months ago
Voker57 / qmpdclient
View on GitHub
QMPDClient official repository
☆38Nov 18, 2015Updated 10 years ago
scrapy-plugins / scrapy-jsonschema
View on GitHub
Scrapy schema validation pipeline and Item builder using JSON Schema
☆45Mar 26, 2021Updated 5 years ago
ZhaoyuDeng / eyetracker-raspberrypi
View on GitHub
Eye-tracker based on Raspberry Pi
☆21May 17, 2020Updated 6 years ago
xojoc / cleanurl
View on GitHub
Remove clutter from URLs and return a canonicalized version
☆21Jun 3, 2024Updated 2 years ago