opensemanticsearch / open-semantic-etl
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
☆266Updated 2 years ago
Alternatives and similar repositories for open-semantic-etl:
Users that are interested in open-semantic-etl are comparing it to the libraries listed below
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆96Updated 2 years ago
- Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of e…☆194Updated 2 years ago
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆83Updated 5 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Updated 3 years ago
- Record Linkage ToolKit (Find and link entities)☆109Updated last year
- Trying to generate name synonyms from wikidata☆32Updated 4 years ago
- Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.☆217Updated this week
- A machine learning tool for fishing entities☆261Updated last week
- Download DIG to run on your laptop or server.☆101Updated 6 years ago
- Wandora is a general purpose information extraction, management and publishing application based on Topic Maps and Java.☆131Updated last year
- The GATE Embedded core API and GATE Developer application☆82Updated 3 months ago
- Graph NLU is a natural language understanding tool that leverages the power of graph databases☆84Updated 7 years ago
- Python library for information extraction of quantities from unstructured text☆119Updated last year
- Solr Relevance Ranking Analysis and Visualization Tool☆17Updated 5 years ago
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆45Updated 3 years ago
- Federated Knowledge Extraction Framework☆191Updated last year
- Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.☆105Updated 2 years ago
- Information extraction and interactive visualization of textual datasets for investigative data-driven journalism and eDiscovery☆55Updated 7 months ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆148Updated last month
- Information Integration Tool☆592Updated 11 months ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆113Updated 8 years ago
- KBPedia Knowledge Graph & Knowledge Ontology (KKO)☆221Updated 4 years ago
- 🍊 Text Mining add-on for Orange3☆130Updated last week
- Open Semantic Search Appliance (VM)☆12Updated 4 years ago
- 💫 REST microservices for various spaCy-related tasks☆240Updated 2 years ago
- Ergonomic line-by-line transcription of scanned text.☆51Updated 4 years ago
- ☆214Updated 2 years ago
- Textpipe: clean and extract metadata from text☆302Updated 3 years ago
- The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.☆142Updated last year