Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, …
☆1,140Apr 19, 2025Updated 10 months ago
Alternatives and similar repositories for open-semantic-search
Users that are interested in open-semantic-search are comparing it to the libraries listed below
Sorting:
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆99Oct 9, 2022Updated 3 years ago
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & N…☆279Oct 9, 2022Updated 3 years ago
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆89Jan 16, 2020Updated 6 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Jan 16, 2022Updated 4 years ago
- Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of e…☆200Oct 9, 2022Updated 3 years ago
- Ambar: Document Search Engine☆1,957Aug 26, 2021Updated 4 years ago
- Solr client and user interface for search☆22Apr 25, 2024Updated last year
- Solr Relevance Ranking Analysis and Visualization Tool☆15Oct 27, 2019Updated 6 years ago
- Hypertext-infused personal research productivity/database software (Mac/Win/Linux)☆175Mar 3, 2026Updated last week
- SKOS Support for Apache Lucene and Solr☆56May 12, 2021Updated 4 years ago
- A self‑hosted search engine for documents☆710Updated this week
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆10Dec 3, 2024Updated last year
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,266Updated this week
- The low-code Knowledge Graph application platform. Apache license.☆601Feb 28, 2026Updated last week
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and a…☆24,436Updated this week
- Search and browse documents and data; find the people and companies you look for.☆2,331Feb 20, 2026Updated 2 weeks ago
- Elasticsearch File System Crawler (FS Crawler)☆1,434Updated this week
- Django SKOS-XL Thesaurus manager☆13Oct 18, 2021Updated 4 years ago
- OPUS (opus.nlpl.eu) Python3 API☆18Nov 23, 2024Updated last year
- A browser extension providing Open Access bibliographical services☆18Dec 9, 2022Updated 3 years ago
- A curated list of various semantic web and linked data resources.☆1,613Updated this week
- Scientific articles using or citing Common Crawl data☆28Jan 9, 2026Updated 2 months ago
- This is a AUTOSAR documents specific retriever based on LLM and RAG.☆16Nov 12, 2024Updated last year
- A Comprehensive survey on business use cases of AI that help them thrive in the digital economy☆13Oct 7, 2020Updated 5 years ago
- Blacklight provides a discovery interface for any Solr (http://lucene.apache.org/solr) index.☆783Updated this week
- 💠 An index for linked open data & standard knowledge descriptions (ontologies, vocabularies, shapes, queries, mappings)☆45Nov 6, 2023Updated 2 years ago
- Fess is very powerful and easily deployable Enterprise Search Server.☆1,094Updated this week
- Language, Knowledge, Cognition☆632Updated this week
- Software for creating all the OpenCitations Indexes (e.g. COCI)☆15Updated this week
- OpenRefine is a free, open source power tool for working with messy data and improving it☆11,771Updated this week
- Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with struc…☆15,741Updated this week
- GitHub page for the TextBundle Markdown/text specification☆24Jul 30, 2014Updated 11 years ago
- BlackLab Frontend, a feature-rich corpus search interface for BlackLab.☆23Updated this week
- Tools for analysing and visualising activity around Twitter backchannels☆26Nov 10, 2012Updated 13 years ago
- Syllabus for EDCT GE 2550☆16Oct 3, 2019Updated 6 years ago
- Functional composable pipelines allowing clean separation of the business logic and its implementation☆11Sep 6, 2025Updated 6 months ago
- Consider is a parser for the ThinkGear protocol used by NeuroSky devices (MindSet, BrainBand and others).☆16Apr 3, 2012Updated 13 years ago
- ☆14Apr 26, 2025Updated 10 months ago
- Course materials for PSYC 11: Laboratory in Psychological Science, Dartmouth College (Instructor: Jeremy Manning)☆39Jun 9, 2025Updated 9 months ago