lucidworks / data-quality
Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilities
☆26Updated 2 months ago
Alternatives and similar repositories for data-quality:
Users that are interested in data-quality are comparing it to the libraries listed below
- Ambari View for the Ambari Store☆15Updated 9 years ago
- Katta - distributed Lucene☆60Updated 11 years ago
- Cascading on Apache Flink®☆54Updated last year
- Provided Guidance on Creating End to End Solutions for Common SILK Use Cases☆13Updated 9 years ago
- Chorus, now for Elasticsearch!☆16Updated 9 months ago
- Ambari stack service for easily installing and managing Solr on HDP cluster☆19Updated 6 years ago
- Angular JS Solr and Elasticsearch and OpenSearch Diagnostic Search Services☆26Updated last month
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 9 years ago
- The next generation of open source search☆91Updated 7 years ago
- A collection of datasets and databases☆24Updated 6 years ago
- A template-based cluster provisioning system☆61Updated 2 years ago
- solr-logstash☆43Updated 9 years ago
- Connect DBVisualizer to Hortonwork HiveServer2☆9Updated 10 years ago
- Toolkit that can bundle any Spring Boot application into an Apache Ambari Service, enabling Ambari to provision, manage and monitor the s…☆13Updated 9 years ago
- Visualization of result returning by Solr 6 graph query☆10Updated 8 years ago
- A java library for stored queries☆16Updated last year
- Storm / Solr Integration☆19Updated last year
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- ElasticSearch plugin to watch segment dynamics (additions, merges, deletes)☆136Updated 8 years ago
- ☆20Updated 2 years ago
- VoltDB Click Stream Processing Example.☆16Updated 7 years ago
- Examples of user defined functions for Apache Drill☆18Updated 7 years ago
- Code to index HDFS to Solr using MapReduce☆52Updated 6 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago
- Distributed processing framework for search solutions☆81Updated 2 years ago
- Solr Redis Extensions☆52Updated last year
- Distributed Dexecutor Using Ignite☆10Updated 7 years ago
- Java code for Apache Nifi processors☆11Updated 7 years ago