opensemanticsearch / open-semantic-etlView external linksLinks
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
☆277Oct 9, 2022Updated 3 years ago
Alternatives and similar repositories for open-semantic-etl
Users that are interested in open-semantic-etl are comparing it to the libraries listed below
Sorting:
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Jan 16, 2022Updated 4 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆99Oct 9, 2022Updated 3 years ago
- Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of e…☆198Oct 9, 2022Updated 3 years ago
- Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Tex…☆1,131Apr 19, 2025Updated 9 months ago
- Solr client and user interface for search☆22Apr 25, 2024Updated last year
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Mar 13, 2019Updated 6 years ago
- SKOS Support for Apache Lucene and Solr☆56May 12, 2021Updated 4 years ago
- A Python wrapper for the nascent hypothes.is web API☆11Jan 28, 2026Updated 2 weeks ago
- Django SKOS-XL Thesaurus manager☆13Oct 18, 2021Updated 4 years ago
- Extract Data from Wikipedia Tables☆34Aug 26, 2017Updated 8 years ago
- Solr Relevance Ranking Analysis and Visualization Tool☆15Oct 27, 2019Updated 6 years ago
- FoGFaaS: Add serverless computing (faas) to ifogsim☆22Mar 30, 2025Updated 10 months ago
- React component for rendering RDF graphs and datasets using n3.js and cytoscape.js☆10Nov 8, 2021Updated 4 years ago
- Extract Data from Wikipedia Lists☆31Aug 27, 2017Updated 8 years ago
- An RDF plugin for Solr☆114Jan 27, 2025Updated last year
- LDIF - Linked Data Integration Framework☆37Aug 2, 2016Updated 9 years ago
- Semantic faceted search using SPARQL☆19May 18, 2018Updated 7 years ago
- NLP text recommendation system built in Python using Gensim, spaCy, and Plotly Dash☆15Mar 8, 2018Updated 7 years ago
- This repo stores the code used in the article: "Coinbase API - A Introduction Guide". Code by Igor Radovanovic☆13Nov 25, 2020Updated 5 years ago
- How can we improve name matching in screening tools?☆15Aug 13, 2025Updated 6 months ago
- A text mining tool for developing visual and interactive relationship networks from PubMed article information.☆16Aug 1, 2024Updated last year
- 📖 Home for the Data2Services project documentation☆17Jun 17, 2022Updated 3 years ago
- GitHub repo explorer app built with reactivesearch☆37Dec 7, 2022Updated 3 years ago
- EEA ElasticSearch RDF River Plugin☆64Dec 14, 2021Updated 4 years ago
- Trying to generate name synonyms from wikidata☆35Jun 28, 2020Updated 5 years ago
- neonion is a user-centered collaborative semantic annotation webapp developed at the Human-Centered Computing group at Freie Universität …☆69Feb 13, 2019Updated 7 years ago
- Mirror of Apache Stanbol (incubating)☆116Feb 29, 2024Updated last year
- A project aiming "to significantly advance the state of the art with regard to indexing and querying biomedical data with freely availabl…☆79Jul 30, 2024Updated last year
- SKOS analysis for Elasticsearch☆54Jun 15, 2016Updated 9 years ago
- Cross-domain temporal information extractors: temporal expressions, events and temporal links.☆21Oct 29, 2015Updated 10 years ago
- A text tagger based on Lucene / Solr, using FST technology☆177Dec 18, 2023Updated 2 years ago
- Open Access PDF harvester☆42May 3, 2024Updated last year
- The first Open Source document analysis platform☆65Aug 2, 2021Updated 4 years ago
- ☆35Dec 19, 2025Updated last month
- European Parliament website Python scraper☆12Oct 19, 2016Updated 9 years ago
- OSoMe API mashups☆11Jan 29, 2019Updated 7 years ago
- Process, enhance and evaluate multiple OCR output.☆24Dec 2, 2025Updated 2 months ago
- Database to RDF mapping engine and SPARQL server☆323Oct 20, 2019Updated 6 years ago
- Firefox and Chrome compatible extension that acts as annotation tool for websites (Named Entity Recognition)☆10Feb 17, 2019Updated 7 years ago