xtannier/WebAnnotator

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xtannier/WebAnnotator)

xtannier / WebAnnotator

WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/firefox/addon/webannotator/), allowing annotation of both offline and inline pages. The HTML rendering is fully preserved and all annotations consist in new HTML spans with specific styles. WebAnnotator provides …

☆48

Alternatives and similar repositories for WebAnnotator

Users that are interested in WebAnnotator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TeamHG-Memex / url-summary
View on GitHub
Show summary of a large number of URLs in a Jupyter Notebook
☆19Apr 8, 2026Updated 3 months ago
scrapinghub / webstruct
View on GitHub
NER toolkit for HTML data
☆259May 3, 2024Updated 2 years ago
scrapinghub / page_finder
View on GitHub
Find which links on a web page are pagination links
☆29Jan 12, 2017Updated 9 years ago
seagatesoft / webdext
View on GitHub
Intelligent Web Data Extractor
☆74Dec 5, 2022Updated 3 years ago
mattilyra / glove2h5
View on GitHub
A small utility for converting Stanford GloVe vectors to HDF5 / NumPy
☆12Apr 4, 2017Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
adsva / python-wapiti
View on GitHub
Python bindings for libwapiti
☆67Dec 9, 2019Updated 6 years ago
scrapinghub / webpager
View on GitHub
Paginating the web
☆37Feb 11, 2014Updated 12 years ago
TeamHG-Memex / extract-html-diff
View on GitHub
extract difference between two html pages
☆33Apr 8, 2026Updated 3 months ago
pydepta / pydepta
View on GitHub
A python implementation of DEPTA
☆84Jan 14, 2017Updated 9 years ago
zytedata / flattering
View on GitHub
Flatten, format, and export any JSON-like data to CSV (or any other string output).
☆17Sep 13, 2021Updated 4 years ago
rkrzr / dataset-popular
View on GitHub
A dataset of popular pages (taken from <dir.yahoo.com>) with manually marked up semantic blocks.
☆15Feb 9, 2014Updated 12 years ago
ethanhe42 / named-entity-recognition
View on GitHub
name entity recognition with recurrent neural network(RNN) in tensorflow
☆16Feb 9, 2022Updated 4 years ago
scrapinghub / aile
View on GitHub
Automatic Item List Extraction
☆85Jun 15, 2016Updated 10 years ago
matthewruttley / mozclassify
View on GitHub
Algorithms for URL Classification
☆19Apr 13, 2015Updated 11 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
marcusklang / wikiforia
View on GitHub
A Utility Library for Wikipedia dumps
☆33Feb 24, 2017Updated 9 years ago
EN10 / KerasInception
View on GitHub
Google Inception-V3 with Keras
☆11Feb 17, 2018Updated 8 years ago
TeamHG-Memex / soft404
View on GitHub
A classifier for detecting soft 404 pages
☆65Apr 8, 2026Updated 3 months ago
RubenVerborgh / Refine-NER-Extension
View on GitHub
Named-Entity Recognition extension for Google Refine / OpenRefine
☆74Jun 21, 2017Updated 9 years ago
shritesh / brainfuck-rs-wasm
View on GitHub
A Brainfuck interpreter written in Rust and compiled to WebAssembly
☆10Dec 4, 2017Updated 8 years ago
rgrishman / ice
View on GitHub
Ice is a rapid information extraction customizer
☆15Apr 26, 2021Updated 5 years ago
chrislee973 / bible-semantic-search
View on GitHub
☆17Mar 15, 2023Updated 3 years ago
datalib / StatsCounter
View on GitHub
Python's missing statistical Swiss Army knife
☆15Aug 25, 2015Updated 10 years ago
SMerrony / daikin2mqtt
View on GitHub
A software bridge between certain popular Daikin™ HVAC units and MQTT.
☆14May 6, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
lestrrat-go / urlenc
View on GitHub
Marshal/Unmarshal interface for structs that can encode/decode themselves to URL query strings
☆11Jun 6, 2018Updated 8 years ago
rmax / databrewer
View on GitHub
The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!
☆41May 29, 2017Updated 9 years ago
clips / hades
View on GitHub
Repository for the CLiPS HAte speech DEtection System [HADES].
☆25Apr 5, 2018Updated 8 years ago
scrapinghub / skinfer
View on GitHub
Skinfer is a tool for inferring and merging JSON schemas
☆141Apr 24, 2024Updated 2 years ago
liaocyintl / web-segment
View on GitHub
Segment a HTML document into structural data
☆12Jan 15, 2019Updated 7 years ago
TeamHG-Memex / sitehound-frontend
View on GitHub
Site Hound (previously THH) is a Domain Discovery Tool
☆24Apr 8, 2026Updated 3 months ago
antonyms / CMultiVec
View on GitHub
Fast C++ implementation of multiple prototype word representation training based on Huang Socher 2012
☆21May 10, 2016Updated 10 years ago
ziman / idris-bytes
View on GitHub
FFI-based byte buffers for Idris
☆10Jun 21, 2019Updated 7 years ago
dataarts / turnhttp
View on GitHub
TURN Rest API Server
☆13Feb 6, 2015Updated 11 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
datamade / parserator
View on GitHub
A toolkit for making domain-specific probabilistic parsers
☆812Sep 26, 2024Updated last year
citp / TheWebNeverForgets
View on GitHub
Public code release for The Web Never Forgets paper
☆69Dec 13, 2021Updated 4 years ago
stevetjoa / musicsearch
View on GitHub
Music search using locality sensitive hashing. Just a prototype; not for production. Old grad school code.
☆15Sep 15, 2014Updated 11 years ago
kxtells / vague-places
View on GitHub
☆14Dec 24, 2016Updated 9 years ago
voider1 / hyperdav
View on GitHub
WebDAV client for Rust
☆10Jun 6, 2018Updated 8 years ago
liaocyintl / hybrid-image-matching
View on GitHub
The Hybrid Image Matching (HIM) method that combines the deep learning approach with the feature point matching to image classification.
☆15Jan 9, 2019Updated 7 years ago
piskvorky / sparsesvd
View on GitHub
Python wrapper around SVDLIBC, a fast library for sparse Singular Value Decomposition
☆55Aug 16, 2013Updated 12 years ago