dossier / html-highlighterLinks
Highlight and select phrases in HTML pages.
☆24Updated 6 years ago
Alternatives and similar repositories for html-highlighter
Users that are interested in html-highlighter are comparing it to the libraries listed below
Sorting:
- Index URLs in Common Crawl☆198Updated 8 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆116Updated 9 years ago
- FacetView is a pure javascript frontend for ElasticSearch.☆291Updated 10 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Updated 4 years ago
- Open source large document set visualization platform☆270Updated 3 years ago
- An attempt at creating a gold standard dataset for backtesting yesterday & today's content-extractors☆35Updated 10 years ago
- ☆44Updated 10 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆73Updated 8 years ago
- a pure javascript frontend for ElasticSearch search indices.☆80Updated 7 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- A cross-platform command line tool for parallelised content extraction and analysis.☆252Updated 2 weeks ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆86Updated 2 years ago
- Traptor -- A distributed Twitter feed☆26Updated 3 years ago
- A tutorial about DBpedia and Linked Data in general☆23Updated 11 years ago
- Version 1.0 of the CrowdTruth Framework for crowdsourcing ground truth data, for training and evaluation of cognitive computing systems. …☆60Updated 7 years ago
- Entity Extraction Text Processor☆149Updated 2 years ago
- Automatically extracts and normalizes an online article or blog post publication date☆118Updated 2 years ago
- Easy extraction of keywords and engines from search engine results pages (SERPs).☆93Updated 3 months ago
- SKOS analysis for Elasticsearch☆54Updated 9 years ago
- Extraction Toolkit☆83Updated 4 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆15Updated 8 years ago
- Social Feed Manager user interface application.☆157Updated last year
- NER toolkit for HTML data☆259Updated last year
- Events and Situations Ontology☆14Updated 7 years ago
- Warcbase is an open-source platform for managing analyzing web archives☆162Updated 8 years ago
- Solrstrap is a Query-Result interface for Solr written in JavaScript, HTML and CSS☆87Updated 8 years ago
- General Architecture for Text Engineering☆49Updated 9 years ago
- Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.☆108Updated 10 months ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- ☆185Updated 7 years ago