o19s / pdf-discovery-demoLinks
Demonstration of searching PDF document with Solr, Tika, and Tesseract
☆32Updated last year
Alternatives and similar repositories for pdf-discovery-demo
Users that are interested in pdf-discovery-demo are comparing it to the libraries listed below
Sorting:
- Advanced desktop search/corpus exploration prototype☆21Updated 4 years ago
- A fast and simple JavaScript library specifically targeted at collecting search and search related browser events.☆43Updated 3 weeks ago
- A natural language search microservice☆96Updated 4 years ago
- Search Management UI☆56Updated 5 months ago
- Angular JS Solr and Elasticsearch and OpenSearch Diagnostic Search Services☆27Updated last month
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Updated 6 years ago
- Solr Relevance Ranking Analysis and Visualization Tool☆15Updated 6 years ago
- Entity resolution for Elasticsearch.☆164Updated 2 months ago
- Solr AutoComplete implementation☆59Updated 8 years ago
- Search relevance evaluation toolkit☆74Updated 3 years ago
- Extracts a latent knowledge graph from text and index/query it in elasticsearch or solr☆21Updated 3 years ago
- TheMovieDB in Solr☆22Updated last year
- A text tagger based on Lucene / Solr, using FST technology☆177Updated last year
- SOLR bulk indexing utility for the command line.☆45Updated 3 weeks ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Updated 3 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago
- Java library for reading and writing WARC files with a typed API☆51Updated this week
- Solr Query Segmenter for structuring unstructured queries☆22Updated 4 years ago
- Search relevance evaluation toolkit☆34Updated 3 years ago
- Query preprocessor for Java-based search engines (Querqy Core and Lucene implementation)☆189Updated last week
- Search a single field with different query time analyzers in Solr☆25Updated 5 years ago
- Highlighting various OCR formats directly in Solr☆86Updated last week
- An HTTP proxy for Elasticsearch, Solr (etc.) to prevent a 100% full disk situation.☆11Updated 7 years ago
- The Solr Package Directory and Sanctuary☆13Updated last month
- Elasticsearch/Solr Sandbox for exploring explain information and tweaking☆139Updated last year
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup☆70Updated 6 months ago
- Towards an open source stack for e-commerce search☆150Updated 2 months ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Updated 3 years ago
- Export SOLR documents efficiently with cursors.☆38Updated 2 months ago