eBay/griffin

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eBay/griffin)

eBay / griffin

Model driven data quality service

☆239

Alternatives and similar repositories for griffin

Users that are interested in griffin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apache / griffin
View on GitHub
Mirror of Apache griffin
☆1,172Aug 3, 2025Updated 11 months ago
apache / eagle
View on GitHub
Mirror of Apache Eagle
☆411Aug 22, 2020Updated 5 years ago
eBay / WebRex
View on GitHub
WebRex is a tool for web resource aggregation and optimization in runtime. In comparison with other open source optimizers like wro4j, it…
☆14Jun 19, 2018Updated 8 years ago
yfwangpeng / hammal
View on GitHub
hammal is a framework in which you can extract data from different data sources to different data destinations
☆11Nov 1, 2014Updated 11 years ago
KylinOLAP / Kylin
View on GitHub
This code base is retained for historical interest only, please visit Apache Incubator Repo for latest one
☆559Oct 5, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
FRosner / drunken-data-quality
View on GitHub
Spark package for checking data quality
☆220Feb 28, 2020Updated 6 years ago
mustangore / kylin-mondrian-interaction
View on GitHub
Some information about Apache Kylin interaction with Pentaho Mondrian
☆326Nov 5, 2015Updated 10 years ago
apache / falcon
View on GitHub
Mirror of Apache Falcon
☆104Mar 7, 2019Updated 7 years ago
byzer-org / byzer-lang
View on GitHub
Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.
☆1,835May 29, 2024Updated 2 years ago
datacleaner / DataCleaner
View on GitHub
The premier open source Data Quality solution
☆651Jun 30, 2026Updated last month
linkedin / dr-elephant
View on GitHub
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
☆1,370Aug 22, 2023Updated 2 years ago
apache / kylin
View on GitHub
Apache Kylin
☆3,771Jul 16, 2026Updated last week
apache / carbondata
View on GitHub
High performance data store solution
☆1,448Jul 4, 2026Updated 3 weeks ago
apache / apex-core
View on GitHub
Mirror of Apache Apex core
☆350Jun 7, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
shunfei / indexr
View on GitHub
An open-source columnar data format designed for fast & realtime analytic with big data.
☆447Nov 16, 2022Updated 3 years ago
BriData / DBus
View on GitHub
DBus
☆1,214Dec 6, 2022Updated 3 years ago
uber-archive / AthenaX
View on GitHub
SQL-based streaming analytics platform at scale
☆1,223Jun 21, 2020Updated 6 years ago
edp963 / wormhole
View on GitHub
Wormhole is a SPaaS (Stream Processing as a Service) Platform
☆975Nov 16, 2022Updated 3 years ago
guofei1219 / BinlogAnalysis
View on GitHub
解析Mysql binlog日志并发至Kafka
☆23Nov 25, 2016Updated 9 years ago
Teradata / kylo
View on GitHub
Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…
☆1,111Jan 12, 2023Updated 3 years ago
harbby / sylph
View on GitHub
Stream computing platform for bigdata
☆406Apr 24, 2024Updated 2 years ago
uber-archive / chaperone
View on GitHub
A Kafka audit system
☆635Jan 20, 2021Updated 5 years ago
Ctrip-DI / Hue-Ctrip-DI
View on GitHub
Ctrip Data Infrastructure team works for hue
☆16Dec 10, 2014Updated 11 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Huawei-Spark / Spark-SQL-on-HBase
View on GitHub
Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces
☆316Apr 12, 2022Updated 4 years ago
azkaban / azkaban
View on GitHub
Azkaban workflow manager.
☆4,508Jul 3, 2024Updated 2 years ago
criccomini / ezdb
View on GitHub
EZDB provides a nice Java wrapper around LevelDB, RocksDB, and LMDB.
☆65Nov 2, 2023Updated 2 years ago
huawei-noah / streamDM
View on GitHub
Stream Data Mining Library for Spark Streaming
☆497Apr 16, 2023Updated 3 years ago
WeBankFinTech / Scriptis
View on GitHub
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, res…
☆813Dec 11, 2024Updated last year
openbigdatagroup / speedo
View on GitHub
Parallelizing Stochastic Gradient Descent for Deep Convolutional Neural Network
☆45Apr 8, 2016Updated 10 years ago
CHINA-JD / presto
View on GitHub
分布式大数据SQL查询引擎，适用于交互式分析查询
☆408Jun 13, 2016Updated 10 years ago
spiculedata / saiku
View on GitHub
Open-source semantic layer: one cube for Excel (MDX/XMLA), dashboards, and AI agents (MCP). Mondrian + Apache Calcite.
☆1,313Updated this week
sunlet / jplugin
View on GitHub
Server side plugin framework for java
☆127Dec 16, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TalkingData / Fregata
View on GitHub
A light weight, super fast, large scale machine learning library on spark .
☆676Mar 23, 2018Updated 8 years ago
shirdrn / libsvm-dp
View on GitHub
Refactored version for https://github.com/shirdrn/document-processor.git
☆15Apr 5, 2017Updated 9 years ago
pulsarIO / jetstream-esper
View on GitHub
Jetstream Esper Processor implementation
☆23Aug 28, 2015Updated 10 years ago
datahub-project / datahub
View on GitHub
The Context Platform for your Data and AI Stack
☆12,369Updated this week
apache / zeppelin
View on GitHub
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
☆6,648Updated this week
uavorg / uavstack
View on GitHub
UAVStack Open Source All in One Repository
☆717Dec 16, 2022Updated 3 years ago
apache / incubator-heron
View on GitHub
Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter
☆3,629Mar 1, 2023Updated 3 years ago