opensemanticsearch / open-semantic-searchLinks
Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, …
☆1,040Updated last month
Alternatives and similar repositories for open-semantic-search
Users that are interested in open-semantic-search are comparing it to the libraries listed below
Sorting:
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & N…☆268Updated 2 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆97Updated 2 years ago
- Carrot2: Text Clustering Algorithms and Applications☆811Updated 2 weeks ago
- Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of e…☆194Updated 2 years ago
- Websites crawler with built-in exploration and control web interface☆352Updated 2 weeks ago
- Textricator is a tool to extract text from documents and generate structured data.☆345Updated 2 months ago
- The low-code Knowledge Graph application platform. Apache license.☆542Updated this week
- Blazegraph High Performance Graph Database☆938Updated 2 years ago
- A self-hosted search engine for documents.☆634Updated this week
- Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.☆1,589Updated last month
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆319Updated last year
- Language, Knowledge, Cognition☆605Updated this week
- YAGO is a large semantic knowledge base, derived from Wikipedia, WordNet, WikiData, GeoNames, and other data sources☆736Updated 2 years ago
- The software used to extract structured data from Wikipedia☆897Updated 3 months ago
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆83Updated 5 years ago
- Information Integration Tool☆600Updated last month
- Ambar: Document Search Engine☆1,949Updated 3 years ago
- A cross-platform command line tool for parallelised content extraction and analysis.☆245Updated last week
- A web-based document annotation tool, powered by GPT-4☆260Updated last year
- The webprotege code base☆658Updated last year
- Software that makes labeling PDFs easy.☆416Updated last year
- 🦆 Contextually-keyed word vectors☆1,653Updated last month
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,176Updated 10 months ago
- INCEpTION provides a semantic annotation platform offering intelligent annotation assistance and knowledge management.☆636Updated this week
- A machine learning tool for fishing entities☆264Updated 2 weeks ago
- Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.☆228Updated this week
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆523Updated 7 months ago
- A post-processing tool for scanned sheets of paper.☆1,080Updated 10 months ago
- PDF to XML ALTO file converter☆240Updated last week
- Silk Linked Data Integration Framework☆249Updated this week