opensemanticsearch / open-semantic-etl
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
☆254Updated last year
Related projects: ⓘ
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆95Updated last year
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆77Updated 4 years ago
- Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of e…☆174Updated last year
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆45Updated 2 years ago
- Download DIG to run on your laptop or server.☆101Updated 5 years ago
- Wandora is a general purpose information extraction, management and publishing application based on Topic Maps and Java.☆127Updated last year
- A machine learning tool for fishing entities☆239Updated this week
- Federated Knowledge Extraction Framework☆189Updated 10 months ago
- KBPedia Knowledge Graph & Knowledge Ontology (KKO)☆212Updated 4 years ago
- Record Linkage ToolKit (Find and link entities)☆105Updated last year
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆112Updated 8 years ago
- The LKIF Core Ontology of Basic Legal Concepts☆115Updated 11 years ago
- Evolutionary Graph Pattern Learner that learns SPARQL queries for a given set of source-target-pairs from an endpoint.☆83Updated last year
- Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Tex…☆957Updated last year
- LinkedPipes ETL is an RDF based, lightweight ETL tool☆143Updated last month
- This repository has migrated to:☆100Updated last year
- GROBID extension for identifying and normalizing physical quantities.☆72Updated last week
- Trying to generate name synonyms from wikidata☆33Updated 4 years ago
- Silk Linked Data Integration Framework☆240Updated this week
- LexPredict ContraxSuite☆165Updated last year
- Extraction Toolkit☆81Updated 2 years ago
- 💫 REST microservices for various spaCy-related tasks☆240Updated 2 years ago
- Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.☆106Updated 5 months ago
- LexPredict Legal Dictionaries☆107Updated 2 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆73Updated 7 years ago
- Ergonomic line-by-line transcription of scanned text.☆47Updated 3 years ago
- Python library and command-line interface for inspecting and visualizing RDF models aka ontologies.☆220Updated 6 months ago
- Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.☆196Updated this week
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆142Updated 7 months ago
- The GATE Embedded core API and GATE Developer application☆76Updated 2 months ago