Extract structured data from HTML and XML documents like a boss.
☆51Dec 6, 2024Updated last year
Alternatives and similar repositories for python-xextract
Users that are interested in python-xextract are comparing it to the libraries listed below
Sorting:
- command line python scripts for humans☆13Feb 20, 2026Updated last week
- 对dbpedia和百科采集而来的语料进行清洗,得到合适的三元组☆15Jun 24, 2017Updated 8 years ago
- An open-source framework that allows you to easily monitor your web applications using end-end browser tests.☆15Apr 17, 2021Updated 4 years ago
- Python3 & Flask connector for Rich Filemanager☆16Apr 30, 2018Updated 7 years ago
- A dataset of popular pages (taken from <dir.yahoo.com>) with manually marked up semantic blocks.☆15Feb 9, 2014Updated 12 years ago
- Powerful Python dict subclass(es) providing aliasing & attribute access☆33Oct 20, 2025Updated 4 months ago
- This is the repo for the Giotto-tda use-cases challenge 2020.☆23May 3, 2021Updated 4 years ago
- Diazo applies a static HTML theme to a dynamic website☆42Updated this week
- Scrapy Eagle is a tool that allow us to run any Scrapy based project in a distributed fashion and monitor how it is going on and how many…☆24Sep 4, 2020Updated 5 years ago
- Utility for asserting the structure and content of HTML in python.☆24May 4, 2020Updated 5 years ago
- extract difference between two html pages☆32Feb 10, 2026Updated 2 weeks ago
- Dental Practice Management Software.☆32Oct 18, 2022Updated 3 years ago
- Scrapy Pyppeteer Demo☆24Jul 13, 2018Updated 7 years ago
- Easy to use pattern matching and information extraction for Python☆41Nov 16, 2023Updated 2 years ago
- Field types for allowing file and image uploads to Amazon S3 (as well as default local storage) in Flask-Admin.☆27Jul 14, 2023Updated 2 years ago
- Automatically convert hardcoded links to assets in your project, to dynamic links for your web framework☆35Feb 7, 2021Updated 5 years ago
- Instant search for and access to many datasets in Pyspark.☆34Oct 6, 2022Updated 3 years ago
- Preparing DMOZ dataset for my n-Gram LM-based URL classification research☆31Aug 30, 2014Updated 11 years ago
- 💻 CLI for reporting events to Faros platform☆14Jan 30, 2026Updated last month
- Apache Spark based framework for analysis A/B experiments☆15Nov 3, 2024Updated last year
- This library facilitates creating OpenAPI (Swagger) document for Python projects.☆12Jan 4, 2021Updated 5 years ago
- A simple version of the MAX Object Detector Web App rewritten in python for use in the MAX tutorial☆10Mar 31, 2021Updated 4 years ago
- Use ipywidget in your Flask webserver☆37May 31, 2023Updated 2 years ago
- Output scrapy statistics to graphite/carbon☆54Mar 9, 2013Updated 12 years ago
- A Simple Web Crawler from Scratch.☆11Dec 2, 2017Updated 8 years ago
- Network tools API - geoip, dns, nmap, whois, ipwhois, ipcalc☆11Mar 19, 2021Updated 4 years ago
- FileReader及文件流处理方案 ·基于Promise管理文件异步上传 ·文件上传的两种方案 ·大文件切片上传 ·实现断点续传和文件秒传 ·基于Node/Express的服务器端处理☆11Mar 26, 2023Updated 2 years ago
- A starting Python-Flask web app template with accompanying guide☆12Jan 18, 2025Updated last year
- Extract (DOM tree) repetitions from a webpage☆12Jan 13, 2014Updated 12 years ago
- A Django App for HTML GUI applications, with easy Python/JS interoperation. It is a porting version of Eel.☆22Jul 28, 2018Updated 7 years ago
- ODK Sample Forms☆12Mar 23, 2019Updated 6 years ago
- Causal Impact of an intervention integrated with control group selection☆10Sep 11, 2022Updated 3 years ago
- Rossmann Store Sales: https://www.kaggle.com/c/rossmann-store-sales☆10May 13, 2018Updated 7 years ago
- Remote TestNG☆12Feb 22, 2025Updated last year
- BlockCAT token sale smart contracts.☆11Oct 19, 2017Updated 8 years ago
- A client-server chat app in Python☆40May 3, 2023Updated 2 years ago
- Tools for convenient interface creation over various types of data in a declarative way.☆13Sep 1, 2020Updated 5 years ago
- ☆12Aug 7, 2025Updated 6 months ago
- Repository for my Hypothesis training course☆11Sep 30, 2016Updated 9 years ago