opensemanticsearch / open-semantic-search
Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, …
☆992Updated last year
Alternatives and similar repositories for open-semantic-search:
Users that are interested in open-semantic-search are comparing it to the libraries listed below
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & N…☆265Updated 2 years ago
- Ambar: Document Search Engine☆1,946Updated 3 years ago
- Carrot2: Text Clustering Algorithms and Applications☆790Updated 3 months ago
- Textricator is a tool to extract text from documents and generate structured data.☆347Updated 2 months ago
- Language, Knowledge, Cognition☆586Updated 2 months ago
- A self-hosted search engine for documents.☆606Updated this week
- PDF to XML ALTO file converter☆222Updated last week
- INCEpTION provides a semantic annotation platform offering intelligent annotation assistance and knowledge management.☆605Updated this week
- Open-source Enterprise Grade Search Engine Software☆503Updated 2 years ago
- Blazegraph High Performance Graph Database☆909Updated last year
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆306Updated last year
- Information Integration Tool☆591Updated 9 months ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆95Updated 2 years ago
- 🏖TagEditor - Annotation tool for spaCy☆189Updated 2 years ago
- YAGO is a large semantic knowledge base, derived from Wikipedia, WordNet, WikiData, GeoNames, and other data sources☆733Updated 2 years ago
- A web-based document annotation tool, powered by GPT-4☆256Updated last year
- Data model and processing tools for investigative entity data☆222Updated this week
- A cross-platform command line tool for parallelised content extraction and analysis.☆242Updated last month
- LexNLP by LexPredict☆706Updated 7 months ago
- A multilingual, cross-domain temporal tagger developed at the Database Systems Research Group at Heidelberg University.☆344Updated last year
- A set of tools to allow PDF to XML conversion, utilising Apache Beam and other tools. The aim of this project is to bring multiple tools…☆293Updated 2 years ago
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆279Updated 11 months ago
- Structr is an integrated low-code development and runtime environment that uses a graph database.☆791Updated this week
- Lightweight web scraping toolkit for documents and structured data.☆310Updated last year
- The software used to extract structured data from Wikipedia☆870Updated last week
- Heuristic based boilerplate removal tool☆744Updated 8 months ago
- Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of e…☆188Updated 2 years ago
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆83Updated 5 years ago
- Arcade Analytics is the first Open Source Graph Analytics platform. Connect your Graph Database (Neo4j, OrientDB, Amazon Neptune, Microso…☆189Updated 2 years ago