opensemanticsearch / open-semantic-search
Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, …
☆957Updated last year
Related projects: ⓘ
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & N…☆254Updated last year
- Language, Knowledge, Cognition☆581Updated last month
- ☆743Updated this week
- The software used to extract structured data from Wikipedia☆850Updated last month
- YAGO is a large semantic knowledge base, derived from Wikipedia, WordNet, WikiData, GeoNames, and other data sources☆725Updated 2 years ago
- Carrot2: Text Clustering Algorithms and Applications☆764Updated last week
- Blazegraph High Performance Graph Database☆887Updated last year
- Textricator is a tool to extract text from documents and generate structured data.☆345Updated 9 months ago
- A curated list of Knowledge Graph related learning materials, databases, tools and other resources☆1,375Updated 3 months ago
- Heuristic based boilerplate removal tool☆717Updated 4 months ago
- PDF to XML ALTO file converter☆209Updated this week
- The low-code Knowledge Graph application platform. Apache license.☆489Updated this week
- INCEpTION provides a semantic annotation platform offering intelligent annotation assistance and knowledge management.☆587Updated this week
- Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.☆1,483Updated 5 months ago
- LexNLP by LexPredict☆691Updated 3 months ago
- A curated list of ontology things☆265Updated 7 months ago
- The webprotege code base☆625Updated 6 months ago
- A curated list of various semantic web and linked data resources.☆1,389Updated last week
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆95Updated last year
- Elasticsearch File System Crawler (FS Crawler)☆1,341Updated this week
- Open-source Enterprise Grade Search Engine Software☆499Updated 2 years ago
- Social Network Analysis and Visualization software application.☆204Updated 5 months ago
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆290Updated 11 months ago
- A curated list of resources for graph databases and graph computing tools☆1,154Updated last year
- A spaCy pipeline and model for NLP on unstructured legal text.☆634Updated 2 months ago
- NeoDash - a Dashboard Builder for Neo4j☆411Updated this week
- ACHE is a web crawler for domain-specific search.☆449Updated last year
- 1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.☆857Updated this week
- Protege Desktop☆998Updated 2 months ago
- Websites crawler with built-in exploration and control web interface☆328Updated 2 weeks ago