Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
☆281Oct 9, 2022Updated 3 years ago
Alternatives and similar repositories for open-semantic-etl
Users that are interested in open-semantic-etl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆46Jan 16, 2022Updated 4 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆100Oct 9, 2022Updated 3 years ago
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆93Jan 16, 2020Updated 6 years ago
- Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Tex…☆1,179Apr 19, 2025Updated last year
- Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of e…☆202Oct 9, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Python wrapper for the nascent hypothes.is web API☆11Jan 28, 2026Updated 4 months ago
- SKOS Support for Apache Lucene and Solr☆55May 12, 2021Updated 5 years ago
- Extract Data from Wikipedia Tables☆34Aug 26, 2017Updated 8 years ago
- LDIF - Linked Data Integration Framework☆37Aug 2, 2016Updated 9 years ago
- Solr Relevance Ranking Analysis and Visualization Tool☆15Oct 27, 2019Updated 6 years ago
- Semantic faceted search using SPARQL☆19May 18, 2018Updated 8 years ago
- Trying to generate name synonyms from wikidata☆35Jun 28, 2020Updated 5 years ago
- neonion is a user-centered collaborative semantic annotation webapp developed at the Human-Centered Computing group at Freie Universität …☆70Feb 13, 2019Updated 7 years ago
- An RDF plugin for Solr☆114Jan 27, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- SKOS analysis for Elasticsearch☆54Jun 15, 2016Updated 10 years ago
- NLP text recommendation system built in Python using Gensim, spaCy, and Plotly Dash☆15Mar 8, 2018Updated 8 years ago
- EEA ElasticSearch RDF River Plugin☆65Dec 14, 2021Updated 4 years ago
- A project aiming "to significantly advance the state of the art with regard to indexing and querying biomedical data with freely availabl…☆80Feb 17, 2026Updated 4 months ago
- Process, enhance and evaluate multiple OCR output.☆24Dec 2, 2025Updated 6 months ago
- A text tagger based on Lucene / Solr, using FST technology☆176Dec 18, 2023Updated 2 years ago
- Underlay explorer 🧭☆13Jan 7, 2023Updated 3 years ago
- Database to RDF mapping engine and SPARQL server☆325Oct 20, 2019Updated 6 years ago
- lod-explorativ is a prototype of a Svelte webapp which let you explore bibliographic resources from a topic's point of view.☆15Jan 19, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Sep 12, 2021Updated 4 years ago
- Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.☆204May 25, 2026Updated 3 weeks ago
- The first Open Source document analysis platform☆65Aug 2, 2021Updated 4 years ago
- Specifications of the reconciliation API☆39Nov 10, 2025Updated 7 months ago
- Python scripts for interacting with the hypothes.is API☆49Jun 19, 2017Updated 9 years ago
- A collection of various discourse segmenters☆10Jun 30, 2017Updated 8 years ago
- Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.☆264Jun 10, 2026Updated last week
- 📖 Home for the Data2Services project documentation☆17Jun 17, 2022Updated 4 years ago
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆26Mar 13, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Panzoom extension for Cytoscape.js☆68Mar 14, 2026Updated 3 months ago
- A quick Elasticsearch/Logstash/Kibana (ELK) 7.x environment to quickly ingest realtime filtered tweets, perform Natural Language Processi…☆16Jun 18, 2024Updated 2 years ago
- This repo stores the code used in the article: "Coinbase API - A Introduction Guide". Code by Igor Radovanovic☆14Nov 25, 2020Updated 5 years ago
- A re-useable, stand-alone version of LittleSis network storytelling tool☆12Jan 30, 2016Updated 10 years ago
- Mirror of Apache Stanbol (incubating)☆117Feb 29, 2024Updated 2 years ago
- A heterogeneous data mapping language based on Shape Expressions☆22Jun 11, 2026Updated last week
- Java Wiktionary Library☆61Nov 19, 2022Updated 3 years ago