opensemanticsearch / open-semantic-etlLinks
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
☆271Updated 2 years ago
Alternatives and similar repositories for open-semantic-etl
Users that are interested in open-semantic-etl are comparing it to the libraries listed below
Sorting:
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆99Updated 2 years ago
- Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of e…☆195Updated 2 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Updated 3 years ago
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆86Updated 5 years ago
- Record Linkage ToolKit (Find and link entities)☆110Updated 2 years ago
- Download DIG to run on your laptop or server.☆104Updated 6 years ago
- Trying to generate name synonyms from wikidata☆34Updated 5 years ago
- Wandora is a general purpose information extraction, management and publishing application based on Topic Maps and Java.☆133Updated 2 years ago
- LexPredict Legal Dictionaries☆124Updated 3 years ago
- LexPredict ContraxSuite☆174Updated 2 years ago
- Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.☆108Updated 5 months ago
- The GATE Embedded core API and GATE Developer application☆88Updated 10 months ago
- 🍊 Text Mining add-on for Orange3☆131Updated last week
- A cross-platform command line tool for parallelised content extraction and analysis.☆249Updated last month
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆46Updated 3 years ago
- Federated Knowledge Extraction Framework☆193Updated last year
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆116Updated 9 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆73Updated 8 years ago
- Evolutionary Graph Pattern Learner that learns SPARQL queries for a given set of source-target-pairs from an endpoint.☆91Updated 2 years ago
- This page is a companion for the paper titled Towards Automatic Structuring and Semantic Indexing of Legal Documents☆29Updated 6 years ago
- UMBEL (Upper Mapping and Binding Exchange Layer)☆101Updated last year
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- Now included in rigour☆151Updated last week
- Solr Relevance Ranking Analysis and Visualization Tool☆15Updated 5 years ago
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- 📦 The Knowledge Box - A data dependency management framework to help users to publish, find and install data models☆47Updated 2 months ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- Information extraction and interactive visualization of textual datasets for investigative data-driven journalism and eDiscovery☆57Updated last year
- Ergonomic line-by-line transcription of scanned text.☆54Updated 4 years ago
- KBPedia Knowledge Graph & Knowledge Ontology (KKO)☆227Updated 5 years ago