mitre / rhapsodeLinks
Advanced desktop search/corpus exploration prototype
☆21Updated 4 years ago
Alternatives and similar repositories for rhapsode
Users that are interested in rhapsode are comparing it to the libraries listed below
Sorting:
- Search relevance evaluation toolkit☆73Updated 3 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Updated 6 years ago
- Demonstration of searching PDF document with Solr, Tika, and Tesseract☆31Updated 10 months ago
- A fast and simple JavaScript library specifically targeted at collecting search and search related browser events.☆42Updated last year
- Standalone versions of LUCENE_5205 and other patches: SpanQueryParser, Concordance and Co-occurrence stats☆18Updated 4 years ago
- 📦 The Knowledge Box - A data dependency management framework to help users to publish, find and install data models☆47Updated last month
- An HTTP proxy for Elasticsearch, Solr (etc.) to prevent a 100% full disk situation.☆11Updated 6 years ago
- The Solr Package Directory and Sanctuary☆13Updated last month
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆38Updated last year
- Search a single field with different query time analyzers in Solr☆25Updated 5 years ago
- Wikidata embedding☆51Updated 9 months ago
- Common web archive utility code.☆56Updated last month
- A set of workflows for corpus building through OCR, post-correction and normalisation☆50Updated 2 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- Federated Knowledge Extraction Framework☆193Updated last year
- Java library for reading and writing WARC files with a typed API☆50Updated last month
- A search interface and wayback machine for the UKWA Solr based warc-indexer framework.☆131Updated last month
- Semantic Web related concepts converted to Natural language☆44Updated 8 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- Solr Query Segmenter for structuring unstructured queries☆22Updated 4 years ago
- Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in t…☆128Updated last month
- Trying to generate name synonyms from wikidata☆32Updated 5 years ago
- Filter and format a newline-delimited JSON stream of Wikibase entities☆98Updated 2 months ago
- Homebase of the IPTC EXTRA project about rule-based text categorization☆13Updated 8 years ago
- Solr Relevance Ranking Analysis and Visualization Tool☆17Updated 5 years ago
- Multi Tier Annotation Search☆12Updated last year
- Github mirror of "wikidata/query/rdf" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆151Updated 2 weeks ago
- The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.☆145Updated last year
- Open-Source Information Retrieval Reproducibility Challenge☆50Updated 9 years ago