opensemanticsearch / open-semantic-etl
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
☆265Updated 2 years ago
Alternatives and similar repositories for open-semantic-etl:
Users that are interested in open-semantic-etl are comparing it to the libraries listed below
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆95Updated 2 years ago
- Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of e…☆189Updated 2 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆46Updated 3 years ago
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆83Updated 5 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆113Updated 8 years ago
- The GATE Embedded core API and GATE Developer application☆82Updated 2 months ago
- Federated Knowledge Extraction Framework☆191Updated last year
- Wandora is a general purpose information extraction, management and publishing application based on Topic Maps and Java.☆131Updated last year
- Download DIG to run on your laptop or server.☆101Updated 6 years ago
- KBPedia Knowledge Graph & Knowledge Ontology (KKO)☆220Updated 4 years ago
- Trying to generate name synonyms from wikidata☆32Updated 4 years ago
- Record Linkage ToolKit (Find and link entities)☆107Updated last year
- UMBEL (Upper Mapping and Binding Exchange Layer)☆101Updated last year
- A machine learning tool for fishing entities☆255Updated last week
- GROBID extension for identifying and normalizing physical quantities.☆77Updated 4 months ago
- The LKIF Core Ontology of Basic Legal Concepts☆116Updated 11 years ago
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆46Updated 3 years ago
- Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.☆210Updated last week
- Evolutionary Graph Pattern Learner that learns SPARQL queries for a given set of source-target-pairs from an endpoint.☆85Updated 2 years ago
- LexPredict ContraxSuite☆167Updated last year
- General Architecture for Text Engineering☆46Updated 8 years ago
- Open Semantic Search Appliance (VM)☆12Updated 4 years ago
- A python library detect and extract listing data from HTML page.☆109Updated 7 years ago
- LexPredict Legal Dictionaries☆114Updated 2 years ago
- Solr Relevance Ranking Analysis and Visualization Tool☆17Updated 5 years ago
- Extract Data from Wikipedia Tables☆33Updated 7 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆148Updated this week
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- Python library for information extraction of quantities from unstructured text☆120Updated last year
- Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.☆105Updated 2 years ago