zytedata / clear-html
Remove DIVs, style stuff and normalize HTML preserving structure information
☆10Updated 2 months ago
Alternatives and similar repositories for clear-html:
Users that are interested in clear-html are comparing it to the libraries listed below
- https://mimesniff.spec.whatwg.org/ implementation for Python☆13Updated last year
- Remote web browser automation.☆21Updated 10 months ago
- Fetch all GitHub issues for a repository☆13Updated 8 months ago
- The official Python library for Formulaic☆16Updated last year
- ☆20Updated 3 weeks ago
- Dockerized FastAPI wrapper around the recognize-anything image recognition models☆25Updated last year
- Loadable spellfix1 extension for sqlite as python package☆26Updated last year
- Python client for txtai☆14Updated this week
- A fast TUI application (with optional webui) to visually navigate and inspect JSON and JSONL data. Easily localize parse errors in large …☆13Updated 6 months ago
- pyppeteer stealth plugin, attempts to look like a normal browser☆22Updated 6 months ago
- convert natural language into technical diagrams☆14Updated 4 months ago
- Functional composable pipelines allowing clean separation of the business logic and its implementation☆11Updated 10 months ago
- MCP server for Nile Database - Manage and query databases, tenants, users, auth using LLMs☆14Updated last month
- Hybrid Search (BM25 & Vector) with SQLite☆15Updated 8 months ago
- LLM plugin for asking questions of LLM's own documentation, and related packages☆14Updated last week
- Multi-agent workflows and complex Agent interactions, both via YAML manifest and programmatic usage. Pydantic-AI and LiteLLM backends. Hu…☆16Updated this week
- ☆14Updated this week
- Define and implement any functions on the fly with LLMs☆11Updated 11 months ago
- Automatically pass your funcions defined in Python to ChatGPT have it call them back seemlessly.☆13Updated last year
- Neural search engine for discovering semantically similar Python repositories on GitHub☆28Updated last year
- Check if timestamp falls within specific boundaries☆11Updated last year
- An LLM playground similar to the OpenAI API playground☆21Updated last year
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆35Updated last year
- ATUI - Assistant Textual User Interface☆22Updated last year
- Scripts and ideas to manage tons and tons of images and movies☆17Updated last month
- Search a JSON path and get the value fast☆22Updated 2 months ago
- Datasette plugin for searching all searchable tables at once☆24Updated 7 months ago
- AI-native SaaS framework that builds full-stack apps using autonomous AI agents☆14Updated this week
- 360M model running in the browser on WebGPU☆21Updated 8 months ago
- A Voice Assistant in your Browser.☆21Updated last month