lucidworks / data-qualityLinks
Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilities
☆26Updated 4 months ago
Alternatives and similar repositories for data-quality
Users that are interested in data-quality are comparing it to the libraries listed below
Sorting:
- Angular JS Solr and Elasticsearch and OpenSearch Diagnostic Search Services☆26Updated 3 months ago
- Ambari stack service for easily installing and managing Solr on HDP cluster☆19Updated 6 years ago
- VoltDB Click Stream Processing Example.☆16Updated 7 years ago
- Provided Guidance on Creating End to End Solutions for Common SILK Use Cases☆13Updated 9 years ago
- Mirror of Apache MetaModel Membrane☆16Updated 6 years ago
- Toolkit that can bundle any Spring Boot application into an Apache Ambari Service, enabling Ambari to provision, manage and monitor the s…☆13Updated 9 years ago
- Storm / Solr Integration☆19Updated last year
- Solr Redis Extensions☆53Updated last year
- A template-based cluster provisioning system☆61Updated 2 years ago
- Examples of user defined functions for Apache Drill☆18Updated 8 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Updated 9 years ago
- The next generation of open source search☆92Updated 8 years ago
- Chorus, now for Elasticsearch!☆16Updated last year
- Visualization of result returning by Solr 6 graph query☆10Updated 9 years ago
- A java library for stored queries☆16Updated last year
- Ambari View for the Ambari Store☆15Updated 9 years ago
- Cascading on Apache Flink®☆54Updated last year
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆12Updated 4 months ago
- solr-logstash☆43Updated 9 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago
- phData Pulse application log aggregation and monitoring☆13Updated 5 years ago
- ☆9Updated 9 years ago
- A collection of datasets and databases☆24Updated 7 years ago
- ElasticSearch plugin to watch segment dynamics (additions, merges, deletes)☆136Updated 9 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- sql interface for solr cloud☆40Updated 2 years ago
- Distributed Dexecutor Using Ignite☆10Updated 7 years ago
- Provides a Pythonic interface for reading and writing Avro schemas☆27Updated 2 years ago