adsabs / montysolrLinks
Solr for Astrophysics Data System
☆55Updated last week
Alternatives and similar repositories for montysolr
Users that are interested in montysolr are comparing it to the libraries listed below
Sorting:
- Bixo is an open source web mining toolkit that runs as a series of Cascading pipes on top of Hadoop. By building a customized Cascading p…☆142Updated 3 years ago
- [not maintained] Custom Twitter Search via ElasticSearch&Wicket☆60Updated 4 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆283Updated 7 years ago
- Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.☆159Updated 2 years ago
- SIREn - Semi-Structured Information Retrieval Engine☆108Updated 4 years ago
- A Python wrapper for Cascading☆222Updated 5 years ago
- Zohmg is a data store for aggregation of multi-dimensional time series data, built on top of Hadoop, Dumbo and HBase.☆173Updated 12 years ago
- Bulk loading for elastic search☆185Updated last year
- Elasticsearch Index Termlist☆118Updated 6 years ago
- A Lazy Data Flow Framework (no longer active - see Apache TinkerPop)☆279Updated 4 years ago
- Machine learning and natural language processing with Apache Pig☆53Updated 11 years ago
- Search a single field with different query time analyzers in Solr☆25Updated 5 years ago
- An experiment in visualizing your Solr index via term counts, document counts, and memory usage per field and data type.☆15Updated 10 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- A very memory-efficient trie (radix tree) implementation☆47Updated 13 years ago
- Java implementation of a probabilistic set data structure☆144Updated 8 years ago
- A Query Autofiltering SearchComponent for Solr that can translate free-text queries into structured queries using index metadata☆27Updated 6 years ago
- Slinky, a high-performance web crawler / text analytics in Python, Redis, Hadoop, R, Gephi☆41Updated 15 years ago
- Use Solr clients/tools with ElasticSearch☆77Updated 12 years ago
- A distributed task queue worker designed for throughput, parallelism, and clustering.☆238Updated 2 years ago
- A library of examples showing how to use the Common Crawl corpus (2008-2012, ARC format)☆65Updated 9 years ago
- iSAX Indexing persisted in HBase☆39Updated 14 years ago
- Implementation of Tyler Neylon's Locality-Specific Hash based on simplex tesselations☆28Updated 13 years ago
- ☆33Updated 6 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 9 years ago
- A REST API for Mozilla Metrics services.☆57Updated 6 years ago
- Common Crawl support library to access 2008-2012 crawl archives (ARC files)☆500Updated 7 years ago
- Realtime Analytics☆68Updated 12 years ago
- Some utilities for Lucene☆111Updated 12 years ago
- A library that adds some NLP capabilities to the Lucene search engine☆50Updated 12 years ago