opensemanticsearch / open-semantic-searchLinks
Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, …
☆1,081Updated 5 months ago
Alternatives and similar repositories for open-semantic-search
Users that are interested in open-semantic-search are comparing it to the libraries listed below
Sorting:
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & N…☆273Updated 3 years ago
- Carrot2: Text Clustering Algorithms and Applications☆829Updated 2 weeks ago
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆327Updated last year
- The software used to extract structured data from Wikipedia☆907Updated this week
- INCEpTION provides a semantic annotation platform offering intelligent annotation assistance and knowledge management.☆661Updated this week
- LexNLP by LexPredict☆748Updated last year
- A self‑hosted search engine for documents☆660Updated this week
- Find legal citations in any block of text☆174Updated last week
- The webprotege code base☆691Updated last year
- A list of memex-related tools and their repository URLs☆153Updated 7 years ago
- YAGO is a large semantic knowledge base, derived from Wikipedia, WordNet, WikiData, GeoNames, and other data sources☆741Updated 3 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆98Updated 3 years ago
- Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.☆237Updated last week
- Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of e…☆196Updated 3 years ago
- Textricator is a tool to extract text from documents and generate structured data.☆350Updated 6 months ago
- Language, Knowledge, Cognition☆617Updated 2 months ago
- A list of selected resources, methods, and tools dedicated to legal data schemes and ontologies.☆133Updated last year
- The low-code Knowledge Graph application platform. Apache license.☆569Updated this week
- PDF to XML ALTO file converter☆253Updated last month
- Information Integration Tool☆601Updated 5 months ago
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆526Updated 11 months ago
- A machine learning tool for fishing entities☆264Updated 4 months ago
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆86Updated 5 years ago
- extract text from any document. no muss. no fuss.☆4,321Updated 10 months ago
- ACHE is a web crawler for domain-specific search.☆473Updated last month
- A curated list of ontology things☆436Updated last month
- LexPredict Legal Dictionaries☆127Updated 3 years ago
- Open-source Enterprise Grade Search Engine Software☆511Updated 3 years ago
- A cross-platform command line tool for parallelised content extraction and analysis.☆249Updated this week
- Run a high-fidelity browser-based web archiving crawler in a single Docker container☆888Updated this week