dossier / html-highlighterLinks
Highlight and select phrases in HTML pages.
☆24Updated 5 years ago
Alternatives and similar repositories for html-highlighter
Users that are interested in html-highlighter are comparing it to the libraries listed below
Sorting:
- ☆44Updated 9 years ago
 - Index URLs in Common Crawl☆195Updated 8 years ago
 - Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆48Updated 3 years ago
 - An attempt at creating a silver/gold standard dataset for backtesting yesterday & today's content-extractors☆35Updated 10 years ago
 - Open source large document set visualization platform☆270Updated 2 years ago
 - Semanticizest: dump parser and client☆20Updated 9 years ago
 - FacetView is a pure javascript frontend for ElasticSearch.☆291Updated 10 years ago
 - Named-Entity Recognition extension for Google Refine / OpenRefine☆73Updated 8 years ago
 - mltk - Moz Language Tool Kit☆12Updated 10 years ago
 - An intelligent reading agent that understands text and translates it into Wikidata statements.☆116Updated 9 years ago
 - Scripts and microservice to feed an ElasticSearch with Wikidata and Inventaire entities, and keep those up-to-date☆41Updated 4 years ago
 - a pure javascript frontend for ElasticSearch search indices.☆80Updated 7 years ago
 - Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
 - Virtual patent marking crawler at iproduct.epfl.ch☆15Updated 8 years ago
 - command-line tool to extract taxonomies from Wikidata☆128Updated 6 years ago
 - "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆86Updated 2 years ago
 - Solrstrap is a Query-Result interface for Solr written in JavaScript, HTML and CSS☆87Updated 8 years ago
 - Browser add-on and web server to support collection and analysis of web browsing data.☆13Updated 9 years ago
 - Version 1.0 of the CrowdTruth Framework for crowdsourcing ground truth data, for training and evaluation of cognitive computing systems. …☆60Updated 7 years ago
 - Mirror of Apache Stanbol (incubating)☆114Updated last year
 - Pattern-of-Behavior Search Tool☆11Updated 3 years ago
 - IXA pipes Named Entity Tagger (http://ixa2.si.ehu.es/ixa-pipes).☆33Updated 6 years ago
 - SKOS Support for Apache Lucene and Solr☆56Updated 4 years ago
 - Extract statistics from Wikipedia Dump files.☆26Updated 4 years ago
 - A repo that contains outgoing links from DBpedia☆50Updated 5 years ago
 - General Architecture for Text Engineering☆49Updated 9 years ago
 - A cross-platform command line tool for parallelised content extraction and analysis.☆247Updated 2 weeks ago
 - Named Entities Recognition Annotator Tool for Europeana Newspapers☆61Updated 7 years ago
 - ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (image…☆96Updated 7 years ago
 - Quickly analyze and explore email with advanced analytics and visualization.☆56Updated 4 years ago