opensemanticsearch / open-semantic-searchLinks
Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, …
☆1,045Updated 2 months ago
Alternatives and similar repositories for open-semantic-search
Users that are interested in open-semantic-search are comparing it to the libraries listed below
Sorting:
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & N…☆268Updated 2 years ago
- Websites crawler with built-in exploration and control web interface☆354Updated this week
- Language, Knowledge, Cognition☆607Updated 3 weeks ago
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆82Updated 5 years ago
- A self-hosted search engine for documents.☆636Updated last week
- Carrot2: Text Clustering Algorithms and Applications☆814Updated last month
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆99Updated 2 years ago
- The software used to extract structured data from Wikipedia☆899Updated 4 months ago
- Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of e…☆194Updated 2 years ago
- Search and browse documents and data; find the people and companies you look for.☆2,175Updated 2 weeks ago
- A search interface and wayback machine for the UKWA Solr based warc-indexer framework.☆115Updated 3 weeks ago
- News crawling with StormCrawler - stores content as WARC☆348Updated 4 months ago
- brozzler - distributed browser-based web crawler☆720Updated 2 weeks ago
- Open-source Enterprise Grade Search Engine Software☆507Updated 2 years ago
- A machine learning tool for fishing entities☆264Updated last month
- A cross-platform command line tool for parallelised content extraction and analysis.☆245Updated 3 weeks ago
- Blazegraph High Performance Graph Database☆942Updated 2 years ago
- The low-code Knowledge Graph application platform. Apache license.☆542Updated last week
- INCEpTION provides a semantic annotation platform offering intelligent annotation assistance and knowledge management.☆639Updated this week
- Core Python Web Archiving Toolkit for replay and recording of web archives☆1,517Updated last month
- 🏖TagEditor - Annotation tool for spaCy☆192Updated 2 years ago
- Lightweight web scraping toolkit for documents and structured data.☆312Updated last year
- Run a high-fidelity browser-based web archiving crawler in a single Docker container☆812Updated last week
- Heuristic based boilerplate removal tool☆785Updated 4 months ago
- Streaming WARC/ARC library for fast web archive IO☆416Updated 6 months ago
- A curated list of resources for graph databases and graph computing tools☆1,217Updated 2 years ago
- Software that makes labeling PDFs easy.☆415Updated last year
- 🦆 Contextually-keyed word vectors☆1,654Updated 2 months ago
- spaCy REST API, wrapped in a Docker container.☆267Updated 2 years ago
- 🪐 End-to-end NLP workflows from prototype to production☆1,389Updated 8 months ago