openpreserve/pagelyzer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/openpreserve/pagelyzer)

openpreserve / pagelyzer

Suite of tools for detecting changes in web pages and their rendering

☆56

Alternatives and similar repositories for pagelyzer

Users that are interested in pagelyzer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nik0spapp / sdalg
View on GitHub
Web page segmentation and noise removal
☆55Feb 4, 2024Updated 2 years ago
after12am / summary
View on GitHub
A python module provides content extraction and summarization of a web page even if the web page was broken.
☆18Apr 14, 2023Updated 3 years ago
snap-stanford / MetroMaps
View on GitHub
MetroMaps Release
☆16May 8, 2014Updated 12 years ago
askerlee / topiccloud
View on GitHub
Visualization of topics in a document (documents), aimed to replace word cloud
☆18May 10, 2016Updated 10 years ago
MiuLab / Spk-Dialogue
View on GitHub
Speaker Role Contextual Model for Dialogues
☆15Sep 30, 2017Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
rkrzr / dataset-popular
View on GitHub
A dataset of popular pages (taken from <dir.yahoo.com>) with manually marked up semantic blocks.
☆15Feb 9, 2014Updated 12 years ago
pydepta / pydepta
View on GitHub
A python implementation of DEPTA
☆84Jan 14, 2017Updated 9 years ago
socialsensor / storm-focused-crawler
View on GitHub
Collects multimedia content shared through social networks.
☆19Feb 18, 2015Updated 11 years ago
shadyabhi / Reddit-New-Comments-Highlighter
View on GitHub
Highlights new comments in a thread on reddit
☆12Feb 17, 2018Updated 8 years ago
shenshen-hungry / Neural-Rule-Engine
View on GitHub
Rules used in Neural Rule Engine.
☆28Aug 31, 2018Updated 7 years ago
ashish01 / CollinsTagger
View on GitHub
Implementation of Collin's perceptron for structured prediction
☆16Mar 10, 2025Updated last year
AnthonyMRios / relation-extraction-rnn
View on GitHub
Bi-directional LSTM model for relation extraction
☆23Jul 17, 2018Updated 8 years ago
jackdbd / geoviews-geopython-2018
View on GitHub
Material for my talk "Approaching geovisualization and remote sensing with GeoViews" @ GeoPython 2018.
☆15May 11, 2018Updated 8 years ago
fsxfreak / nlp-augment
View on GitHub
A collection of utilities used in exploring data augmentation of low-resource parallel corpuses. …
☆11Sep 6, 2017Updated 8 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
bencheeorg / benchee_csv
View on GitHub
Output your Benchee benchmarks as CSV to generate graphs in your favorite spreadsheet tool!
☆10Dec 10, 2023Updated 2 years ago
kazeto / phillip
View on GitHub
Abductive reasoner for NLP in C++
☆23Dec 17, 2018Updated 7 years ago
michielst / auto-poster
View on GitHub
Automatically post images from a subreddit to an instagram account.
☆10Feb 24, 2022Updated 4 years ago
ryansb / af3ro
View on GitHub
Afero-compliant interface to S3
☆10Sep 29, 2016Updated 9 years ago
rsling / texrex
View on GitHub
texrex web page cleaning & ClaraX random walk crawler
☆11Dec 13, 2021Updated 4 years ago
kno10 / WikipediaEntities
View on GitHub
UNSUPPORTED & OUTDATED: Derive named entities from Wikipedia
☆48Mar 18, 2019Updated 7 years ago
socialjack010 / sql_injection_bruteforce_login
View on GitHub
This is a program written in python which allows you to log in with different sql injections that allow you to bypass some login pages (F…
☆13Feb 23, 2023Updated 3 years ago
jni / streaming-talk
View on GitHub
Resources for a talk about streaming data analysis in Python
☆15Aug 29, 2015Updated 10 years ago
ESSS / pyboost_ipc
View on GitHub
Python bindings for Boost.Interprocess
☆10Dec 20, 2016Updated 9 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
drewhannay / dagger-android-sample
View on GitHub
Sample Android app project using dagger-android
☆10Jul 27, 2017Updated 8 years ago
datalib / libextract
View on GitHub
Extract data from websites using basic statistical magic
☆505Oct 2, 2020Updated 5 years ago
banyh / PyStanfordNLP
View on GitHub
A Python Wrapper of Stanford Chinese Segmenter
☆20Aug 2, 2017Updated 8 years ago
wothke / uade-2.13
View on GitHub
this project moved to bitbucket! WebAudio plugin of UADE
☆13Apr 16, 2021Updated 5 years ago
StanfordHCI / foundry
View on GitHub
Foundry is an interactive, real-time Javascript interface that allows flash teams to be assembled by anyone and tracked in real time.
☆30May 27, 2017Updated 9 years ago
gaskij / rpicampusmap
View on GitHub
Interactive map for the Rensselaer Polytechnic Institute campus.
☆10Jan 7, 2023Updated 3 years ago
formtools / api
View on GitHub
The Form Tools API.
☆15Nov 9, 2019Updated 6 years ago
0mp / kbfsd
View on GitHub
FreeBSD service daemon for KBFS, the Keybase filesystem
☆13Jul 22, 2021Updated 4 years ago
providence-replay / providence
View on GitHub
An open-source session replay tool for single-page applications that uses AI analysis, aggregated trends, and a RAG chatbot to help devel…
☆11Jan 23, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
xmb-cipher / fofe-ner
View on GitHub
FOFE NER
☆39Nov 4, 2017Updated 8 years ago
openmrs / openmrs-module-xforms
View on GitHub
A browser based forms module which adds XForms support and related services to XForms clients, like user and patient download.
☆15Apr 28, 2026Updated 2 months ago
arjoly / random-output-trees
View on GitHub
Randomized output tree for multilabel / multi-output regression tasks
☆23Dec 2, 2015Updated 10 years ago
tidwall / murmur3
View on GitHub
Murmur3 hash in Go
☆13Dec 15, 2017Updated 8 years ago
eXascaleInfolab / TRank
View on GitHub
Ranking Entity Types using the Web of Data
☆30Nov 22, 2016Updated 9 years ago
frictionlessdata / datapackage-ui
View on GitHub
Create and validate Data Packages in the browser
☆27Dec 20, 2021Updated 4 years ago
Lattyware / elm-minesweeper
View on GitHub
An implementation of the game "Minesweeper" in Elm.
☆20Jul 27, 2019Updated 6 years ago