rkrzr/dataset-popular

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rkrzr/dataset-popular)

rkrzr / dataset-popular

A dataset of popular pages (taken from <dir.yahoo.com>) with manually marked up semantic blocks.

☆15

Alternatives and similar repositories for dataset-popular

Users that are interested in dataset-popular are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tdt / rdf2html
View on GitHub
a javascript library to visualize an array of RDF triples into an HTML page
☆15Feb 8, 2016Updated 10 years ago
OpenTransport / Stations
View on GitHub
A knowledge center for transport data
☆15Oct 29, 2013Updated 12 years ago
scrapy / pypydispatcher
View on GitHub
A fork of http://pydispatcher.sourceforge.net/ with PyPy support
☆16Jul 3, 2017Updated 9 years ago
blaze / datafabric
View on GitHub
A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.
☆13Feb 12, 2016Updated 10 years ago
nik0spapp / sdalg
View on GitHub
Web page segmentation and noise removal
☆55Feb 4, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
croqaz / Stones
View on GitHub
🗿Stones: Persistent key-value containers, compatible with Python dict
☆17Jul 15, 2024Updated 2 years ago
LinkedSoftwareDependencies / Components-Generator.js
View on GitHub
⚙️ Generate Components.js component files from TypeScript
☆14Updated this week
scrapy / scrapy-bench
View on GitHub
A CLI for benchmarking Scrapy.
☆32Jun 28, 2025Updated last year
oeg-upm / transmodel-ontology
View on GitHub
A repository to work on the transmodel ontology that provides support to the NeTEx model
☆13Feb 17, 2021Updated 5 years ago
nlpub / russe
View on GitHub
RUSSE: Russian Semantic Evaluation.
☆15Mar 1, 2022Updated 4 years ago
SolidBench / SolidBench.js
View on GitHub
A benchmark for Solid to simulate vaults with social network data.
☆11May 14, 2026Updated 2 months ago
scfc / bison-php
View on GitHub
Extension to bison to generate PHP code
☆15Mar 25, 2012Updated 14 years ago
ShinyTrinkets / twofold.ts
View on GitHub
TwoFold (2✂︎f). Text files breathe fire.
☆23Jan 28, 2026Updated 6 months ago
linkedconnections / linked-connections-server
View on GitHub
Express based server that exposes Linked Connections.
☆13Jan 4, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lopuhin / kaggle-jigsaw-2019
View on GitHub
☆14Jun 27, 2019Updated 7 years ago
tomayac / ldf-client
View on GitHub
Polymer Linked Data Fragments client
☆18Jul 31, 2016Updated 9 years ago
mattpage / ternary-search-tree
View on GitHub
A ternary search tree for Node.js
☆11Feb 28, 2026Updated 5 months ago
Datafable / rolling-blackout-belgium
View on GitHub
Rolling blackout plan in Belgium
☆18Mar 16, 2015Updated 11 years ago
hyperrail / hyperrail-for-android
View on GitHub
Hyperrail native android app
☆14Feb 10, 2026Updated 5 months ago
eriknw / dask-patternsearch
View on GitHub
Scalable pattern search optimization with dask
☆22Apr 12, 2017Updated 9 years ago
zytedata / html-text
View on GitHub
☆20Oct 6, 2025Updated 9 months ago
pieterprovoost / wktmap
View on GitHub
☆16Feb 3, 2026Updated 5 months ago
RDFLib / rdflib-leveldb
View on GitHub
A LevelDB-backed RDFLib Store for RDFLib=>6.0
☆19May 23, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tiefling-cat / ru-syntax
View on GitHub
Repository for ru-syntax command line tool.
☆15Mar 8, 2022Updated 4 years ago
awery / nmbs-api
View on GitHub
Unofficial documentation of the NMBS/SNCB API
☆13Apr 7, 2015Updated 11 years ago
public-transport / why-linked-open-transit-data
View on GitHub
Why do we need linked open public transport data?
☆21Jul 23, 2021Updated 5 years ago
openknowledge-archive / dpm-old
View on GitHub
**DEPRECATED** - see https://github.com/frictionlessdata/. [[Data package manager (dpm) is a command line tool and Python library for wor…
☆15Jun 21, 2014Updated 12 years ago
RMLio / RML-Processor
View on GitHub
☆17Apr 26, 2021Updated 5 years ago
TREEcg / event-stream-client
View on GitHub
Deprecated! Use the rdf-connect/ldes-client instead
☆14Mar 5, 2024Updated 2 years ago
ziman / idris-bytes
View on GitHub
FFI-based byte buffers for Idris
☆10Jun 21, 2019Updated 7 years ago
halolimat / SpExtor
View on GitHub
SpExtor: Sparse Entity Extractor
☆11Feb 10, 2020Updated 6 years ago
okfn / data-catalog-spec
View on GitHub
Data Catalog Specification (Schema and Protocol)
☆21May 25, 2018Updated 8 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
csarven / mayktso
View on GitHub
mayktso: encounters at an endpoint
☆20Jul 8, 2024Updated 2 years ago
rafaelcapucho / scrapy-eagle
View on GitHub
Scrapy Eagle is a tool that allow us to run any Scrapy based project in a distributed fashion and monitor how it is going on and how many…
☆24Sep 4, 2020Updated 5 years ago
opengeospatial / ontology-crs
View on GitHub
☆15Jul 21, 2026Updated last week
paulperry / kaggle
View on GitHub
Kaggle competition results
☆20Jan 4, 2019Updated 7 years ago
OpenTransport / vocabulary
View on GitHub
A vocabulary to describe transport systems
☆36Feb 2, 2015Updated 11 years ago
infoculture / mosopendata
View on GitHub
Parser and data from data.mos.ru. / Парсер и данные для портала открытых данных Москвы data.mos.ru
☆18Aug 24, 2014Updated 11 years ago
trickvi / datapackage
View on GitHub
Manage and load dataprotocols.org Data Packages
☆27Sep 17, 2015Updated 10 years ago