eBay / griffin
Model driven data quality service
☆240Updated 7 years ago
Alternatives and similar repositories for griffin:
Users that are interested in griffin are comparing it to the libraries listed below
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆320Updated 2 years ago
- Hadoop Job for schemaless incremental loading of messages from Kafka topics onto hdfs with configurable output partitioning.☆90Updated 8 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆283Updated 6 years ago
- Mirror of Apache Atlas (Incubating)☆94Updated last year
- Some information about Apache Kylin interaction with Pentaho Mondrian☆329Updated 9 years ago
- SparkOnHBase☆279Updated 3 years ago
- Plugins for Azkaban.☆131Updated 6 years ago
- StreamLine - Streaming Analytics☆164Updated last year
- spark summit 2017 SanFrancisco☆97Updated 7 years ago
- ☆121Updated 3 weeks ago
- This repository trackes the code and files for building docker image with Apache Kylin.☆126Updated 3 years ago
- A Spark Atlas connector to track data lineage in Apache Atlas☆267Updated 2 years ago
- Storm-yarn enables Storm clusters to be deployed into machines managed by Hadoop YARN.☆417Updated last year
- Hive UDFs for funnel analysis☆83Updated 2 years ago
- Mirror of Apache Eagle☆411Updated 4 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆159Updated 2 years ago
- This code base is retained for historical interest only, please visit Apache Incubator Repo for latest one☆560Updated 2 years ago
- MySQL-like queries for Druid built on top of Plywood☆147Updated 5 years ago
- Mirror of Apache Falcon☆103Updated 6 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆173Updated 2 years ago
- An open-source columnar data format designed for fast & realtime analytic with big data.☆453Updated 2 years ago
- Unified SQL Analytics Engine Based on SparkSQL☆210Updated 2 years ago
- ☆76Updated 11 years ago
- ☆77Updated 6 years ago
- Stream computing platform for bigdata☆401Updated 11 months ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆153Updated last year
- Mirror of Apache Sentry☆120Updated 4 years ago
- A Maven-based example of using Cloudera Impala's JDBC driver☆118Updated 8 years ago
- Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.☆167Updated last year
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆283Updated 6 years ago