opensemanticsearch / open-semantic-etl
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
☆262Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for open-semantic-etl
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆95Updated 2 years ago
- Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of e…☆181Updated 2 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆46Updated 2 years ago
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆78Updated 4 years ago
- Download DIG to run on your laptop or server.☆101Updated 5 years ago
- Wandora is a general purpose information extraction, management and publishing application based on Topic Maps and Java.☆129Updated last year
- Federated Knowledge Extraction Framework☆191Updated last year
- KBPedia Knowledge Graph & Knowledge Ontology (KKO)☆217Updated 4 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆112Updated 8 years ago
- Python library for information extraction of quantities from unstructured text☆121Updated last year
- Trying to generate name synonyms from wikidata☆33Updated 4 years ago
- Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.☆204Updated this week
- Evolutionary Graph Pattern Learner that learns SPARQL queries for a given set of source-target-pairs from an endpoint.☆85Updated last year
- LexPredict Legal Dictionaries☆111Updated 2 years ago
- Extraction Toolkit☆81Updated 3 years ago
- Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.☆105Updated last year
- tool for collectively summarizing large discussions☆143Updated last year
- The LKIF Core Ontology of Basic Legal Concepts☆115Updated 11 years ago
- ☆25Updated 8 years ago
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆46Updated 2 years ago
- Record Linkage ToolKit (Find and link entities)☆106Updated last year
- A python client for connecting to all the services provided by https://dandelion.eu☆36Updated last year
- A collection of simple tutorials for using Fonduer☆100Updated 4 years ago
- Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.☆107Updated 7 months ago
- LinkedPipes ETL is an RDF based, lightweight ETL tool☆148Updated 2 months ago
- LexPredict ContraxSuite☆167Updated last year
- A python library detect and extract listing data from HTML page.☆109Updated 7 years ago
- A knowledge base construction engine for richly formatted data☆410Updated 3 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆145Updated this week
- Graph NLU is a natural language understanding tool that leverages the power of graph databases☆84Updated 6 years ago