opensemanticsearch / open-semantic-searchLinks
Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, …
☆1,100Updated 7 months ago
Alternatives and similar repositories for open-semantic-search
Users that are interested in open-semantic-search are comparing it to the libraries listed below
Sorting:
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & N…☆275Updated 3 years ago
- Carrot2: Text Clustering Algorithms and Applications☆835Updated this week
- Textricator is a tool to extract text from documents and generate structured data.☆350Updated 8 months ago
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆329Updated 2 years ago
- PDF to XML ALTO file converter☆257Updated 2 weeks ago
- LexNLP by LexPredict☆754Updated last year
- The low-code Knowledge Graph application platform. Apache license.☆578Updated last week
- Information Integration Tool☆603Updated 7 months ago
- INCEpTION provides a semantic annotation platform offering intelligent annotation assistance and knowledge management.☆668Updated this week
- Content ExtRactor and MINEr☆509Updated 3 years ago
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆299Updated 6 months ago
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆87Updated 5 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆99Updated 3 years ago
- Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.☆245Updated last week
- A cross-platform command line tool for parallelised content extraction and analysis.☆247Updated last month
- Language, Knowledge, Cognition☆621Updated 3 months ago
- A self‑hosted search engine for documents☆668Updated this week
- Social Network Analysis and Visualization software application.☆239Updated 6 months ago
- Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of e…☆197Updated 3 years ago
- A web-based document annotation tool, powered by GPT-4☆265Updated last year
- A search interface and wayback machine for the UKWA Solr based warc-indexer framework.☆131Updated 2 weeks ago
- Software that makes labeling PDFs easy.☆421Updated last year
- A basic tool that extracts the structure from the PDF files of scientific articles.☆76Updated 3 years ago
- 🏖TagEditor - Annotation tool for spaCy☆193Updated 3 years ago
- ☆113Updated last week
- Find legal citations in any block of text☆182Updated last month
- A list of memex-related tools and their repository URLs☆156Updated 7 years ago
- Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.☆1,633Updated 7 months ago
- A curated list of ontology things☆449Updated 2 months ago
- A list of selected resources, methods, and tools dedicated to legal data schemes and ontologies.☆141Updated last year