Model driven data quality service
☆240Dec 4, 2017Updated 8 years ago
Alternatives and similar repositories for griffin
Users that are interested in griffin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mirror of Apache griffin☆1,171Aug 3, 2025Updated 9 months ago
- Mirror of Apache Eagle☆411Aug 22, 2020Updated 5 years ago
- WebRex is a tool for web resource aggregation and optimization in runtime. In comparison with other open source optimizers like wro4j, it…☆14Jun 19, 2018Updated 7 years ago
- hammal is a framework in which you can extract data from different data sources to different data destinations☆11Nov 1, 2014Updated 11 years ago
- Spark package for checking data quality☆221Feb 28, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A simple Java API and command line interface for importing, managing and retrieving data from HBase.☆51Sep 28, 2014Updated 11 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,263May 19, 2026Updated last week
- Some information about Apache Kylin interaction with Pentaho Mondrian☆326Nov 5, 2015Updated 10 years ago
- Mirror of Apache Falcon☆104Mar 7, 2019Updated 7 years ago
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,839May 29, 2024Updated 2 years ago
- The premier open source Data Quality solution☆653May 6, 2026Updated 3 weeks ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,369Aug 22, 2023Updated 2 years ago
- High performance data store solution☆1,446May 15, 2026Updated 2 weeks ago
- Mirror of Apache Apex core☆350Jun 7, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An open-source columnar data format designed for fast & realtime analytic with big data.☆450Nov 16, 2022Updated 3 years ago
- DBus☆1,216Dec 6, 2022Updated 3 years ago
- Apache Kylin☆3,767May 15, 2026Updated 2 weeks ago
- SQL-based streaming analytics platform at scale☆1,224Jun 21, 2020Updated 5 years ago
- Wormhole is a SPaaS (Stream Processing as a Service) Platform☆976Nov 16, 2022Updated 3 years ago
- 解析Mysql binlog日志并发至Kafka☆23Nov 25, 2016Updated 9 years ago
- A Kafka audit system☆637Jan 20, 2021Updated 5 years ago
- Stream computing platform for bigdata☆408Apr 24, 2024Updated 2 years ago
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆1,112Jan 12, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Stream Data Mining Library for Spark Streaming☆497Apr 16, 2023Updated 3 years ago
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆316Apr 12, 2022Updated 4 years ago
- Azkaban workflow manager.☆4,508Jul 3, 2024Updated last year
- EZDB provides a nice Java wrapper around LevelDB, RocksDB, and LMDB.☆65Nov 2, 2023Updated 2 years ago
- Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, res…☆814Dec 11, 2024Updated last year
- A light weight, super fast, large scale machine learning library on spark .☆677Mar 23, 2018Updated 8 years ago
- Saiku Analytics - The Worlds Greatest Open Source OLAP Browser☆1,305Updated this week
- ☆12Jul 3, 2019Updated 6 years ago
- 分布式大数据SQL查询引擎,适用于交互式分析查询☆412Jun 13, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Parallelizing Stochastic Gradient Descent for Deep Convolutional Neural Network☆45Apr 8, 2016Updated 10 years ago
- Server side plugin framework for java☆127Dec 16, 2023Updated 2 years ago
- Refactored version for https://github.com/shirdrn/document-processor.git☆15Apr 5, 2017Updated 9 years ago
- Jetstream Esper Processor implementation☆23Aug 28, 2015Updated 10 years ago
- A Spark Reliability Testing Suite☆13Jan 10, 2017Updated 9 years ago
- The Context Platform for your Data and AI Stack☆11,995Updated this week
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,621Updated this week