A dataset of popular pages (taken from <dir.yahoo.com>) with manually marked up semantic blocks.
☆15Feb 9, 2014Updated 12 years ago
Alternatives and similar repositories for dataset-popular
Users that are interested in dataset-popular are comparing it to the libraries listed below
Sorting:
- Tools for web page segmentation. In development☆17Nov 7, 2018Updated 7 years ago
- Failover AWS Spot Instances☆11Dec 8, 2017Updated 8 years ago
- Web page segmentation and noise removal☆55Feb 4, 2024Updated 2 years ago
- Data science tools from Moz☆23Jan 11, 2017Updated 9 years ago
- Age classification from text using PAN16, blogs, Fisher Callhome, and Cancer Forum☆18Jul 1, 2022Updated 3 years ago
- Classifies webpages into categories defined in DMOZ dataset☆40Dec 14, 2015Updated 10 years ago
- Agent fixing SWE bench issues☆19May 21, 2024Updated last year
- Scalable pattern search optimization with dask☆22Apr 12, 2017Updated 8 years ago
- a series of trie testing things☆21Apr 9, 2017Updated 8 years ago
- A simple CRUD wrapper around Amazon DynamoDB☆24Sep 24, 2019Updated 6 years ago
- The Clever Algorithms project is an effort to describe a large number of algorithmic techniques from the field of Artificial Intelligence…☆29Oct 28, 2018Updated 7 years ago
- extract difference between two html pages☆32Feb 10, 2026Updated 3 weeks ago
- Participate in the 4th U.S. National Action Plan for Open Government☆13Jun 8, 2018Updated 7 years ago
- openapi of all third-party☆10Feb 26, 2026Updated last week
- A CLI for benchmarking Scrapy.☆32Jun 28, 2025Updated 8 months ago
- This library facilitates creating OpenAPI (Swagger) document for Python projects.☆12Jan 4, 2021Updated 5 years ago
- Apache Spark based framework for analysis A/B experiments☆15Nov 3, 2024Updated last year
- Causality in Knowledge Graphs☆11Oct 12, 2022Updated 3 years ago
- 🌩️ The Deep Learning framework based on Lightning☆11Dec 11, 2025Updated 2 months ago
- An Intellij Plugin that generates unit test methods with meaningful names based in described behaviours with @should tags in methods ja…☆10Dec 14, 2025Updated 2 months ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Dec 17, 2021Updated 4 years ago
- Python bindings for libsrcml☆17Aug 25, 2025Updated 6 months ago
- ☆11Jul 20, 2021Updated 4 years ago
- A sandbox for opensource demonstrations of GitHub☆14Apr 13, 2016Updated 9 years ago
- audiofile.cc☆16Jun 27, 2011Updated 14 years ago
- Application for checking performance of elevator group system in building using simulation method.☆12Nov 9, 2017Updated 8 years ago
- Configuration system geared towards Python ML projects☆11Apr 30, 2023Updated 2 years ago
- BlockCAT token sale smart contracts.☆11Oct 19, 2017Updated 8 years ago
- A starting Python-Flask web app template with accompanying guide☆12Jan 18, 2025Updated last year
- A Django App for HTML GUI applications, with easy Python/JS interoperation. It is a porting version of Eel.☆22Jul 28, 2018Updated 7 years ago
- Faster replacement for Python's urlparse module☆45Sep 30, 2018Updated 7 years ago
- A blockchain simulator based on SimPy in python.☆14Dec 18, 2018Updated 7 years ago
- Python 2/3 compatible .npz CIFAR-10 dataset☆10Mar 1, 2017Updated 9 years ago
- Stuff related to scraping the Code Review StackExchange☆12Jan 19, 2023Updated 3 years ago
- Simple, dump sprite sheet generator☆23Jan 29, 2016Updated 10 years ago
- OAuth 2.0 provider written in python. Can work without database.☆19Apr 26, 2023Updated 2 years ago
- ☆12Jan 31, 2015Updated 11 years ago
- Web-based IDE for Python, Scheme, and SQL intended for students taking CS 61A.☆11Dec 10, 2022Updated 3 years ago
- E commerce-Database driven web application using Flask☆10Oct 1, 2020Updated 5 years ago