Tools for web page segmentation. In development
☆17Nov 7, 2018Updated 7 years ago
Alternatives and similar repositories for segmentations
Users that are interested in segmentations are comparing it to the libraries listed below
Sorting:
- Web page segmentation and noise removal☆55Feb 4, 2024Updated 2 years ago
- Tools for web page segmentation evaluation☆13Nov 6, 2019Updated 6 years ago
- A dataset of popular pages (taken from <dir.yahoo.com>) with manually marked up semantic blocks.☆15Feb 9, 2014Updated 12 years ago
- Suite of tools for detecting changes in web pages and their rendering☆55Dec 17, 2023Updated 2 years ago
- Failover AWS Spot Instances☆11Dec 8, 2017Updated 8 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Feb 12, 2016Updated 10 years ago
- Age classification from text using PAN16, blogs, Fisher Callhome, and Cancer Forum☆18Jul 1, 2022Updated 3 years ago
- Python3 & Flask connector for Rich Filemanager☆16Apr 30, 2018Updated 7 years ago
- Code for "Web Page Segmentation Revisited: Evaluation Framework and Dataset", accepted as resources paper to CIKM 2020☆14Jan 13, 2023Updated 3 years ago
- Scalable pattern search optimization with dask☆22Apr 12, 2017Updated 8 years ago
- Extract structured data from HTML and XML documents like a boss.☆51Dec 6, 2024Updated last year
- A python library detect and extract listing data from HTML page.☆108May 5, 2017Updated 8 years ago
- Kaggle competition results☆20Jan 4, 2019Updated 7 years ago
- Scrapy Eagle is a tool that allow us to run any Scrapy based project in a distributed fashion and monitor how it is going on and how many…☆24Sep 4, 2020Updated 5 years ago
- a series of trie testing things☆21Apr 9, 2017Updated 8 years ago
- A simple CRUD wrapper around Amazon DynamoDB☆24Sep 24, 2019Updated 6 years ago
- extract difference between two html pages☆32Feb 10, 2026Updated 2 weeks ago
- The Clever Algorithms project is an effort to describe a large number of algorithmic techniques from the field of Artificial Intelligence…☆29Oct 28, 2018Updated 7 years ago
- Field types for allowing file and image uploads to Amazon S3 (as well as default local storage) in Flask-Admin.☆27Jul 14, 2023Updated 2 years ago
- Easy to use pattern matching and information extraction for Python☆41Nov 16, 2023Updated 2 years ago
- Automatically convert hardcoded links to assets in your project, to dynamic links for your web framework☆35Feb 7, 2021Updated 5 years ago
- This library facilitates creating OpenAPI (Swagger) document for Python projects.☆12Jan 4, 2021Updated 5 years ago
- Apache Spark based framework for analysis A/B experiments☆15Nov 3, 2024Updated last year
- 🌩️ The Deep Learning framework based on Lightning☆11Dec 11, 2025Updated 2 months ago
- Structured Data Extractor. An application to extract structured data from web pages. It uses Data Extraction Based on Partial Tree Alignm…☆49Jun 9, 2012Updated 13 years ago
- An attempt at creating a gold standard dataset for backtesting yesterday & today's content-extractors☆35Mar 19, 2015Updated 10 years ago
- A Django App for HTML GUI applications, with easy Python/JS interoperation. It is a porting version of Eel.☆22Jul 28, 2018Updated 7 years ago
- Extract (DOM tree) repetitions from a webpage☆12Jan 13, 2014Updated 12 years ago
- Application for checking performance of elevator group system in building using simulation method.☆12Nov 9, 2017Updated 8 years ago
- Remote TestNG☆12Feb 22, 2025Updated last year
- Configuration system geared towards Python ML projects☆11Apr 30, 2023Updated 2 years ago
- A starting Python-Flask web app template with accompanying guide☆12Jan 18, 2025Updated last year
- Unix shell written in Rust☆13May 4, 2017Updated 8 years ago
- Classifies webpages into categories defined in DMOZ dataset☆40Dec 14, 2015Updated 10 years ago
- Faster replacement for Python's urlparse module☆45Sep 30, 2018Updated 7 years ago
- Growl notification script for WeeChat.☆29Dec 10, 2014Updated 11 years ago
- A blockchain simulator based on SimPy in python.☆14Dec 18, 2018Updated 7 years ago
- The movie website design using HTML, pure CSS, and Vanilla JS☆12Mar 23, 2023Updated 2 years ago
- Enables one Django project to authenticate via a second Django project ***SEEKING CONTRIBUTORS***☆11May 25, 2022Updated 3 years ago