apache / submarine
Submarine is Cloud Native Machine Learning Platform.
☆691Updated 5 months ago
Related projects: ⓘ
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆862Updated this week
- Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond☆894Updated this week
- Apache YuniKorn Core☆819Updated this week
- Deep Learning on Flink aims to integrate Flink and deep learning frameworks (e.g. TensorFlow, PyTorch, etc) to enable distributed deep le…☆687Updated 2 weeks ago
- World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.☆921Updated this week
- Scalable, redundant, and distributed object store for Apache Hadoop☆827Updated this week
- TonY is a framework to natively run deep learning frameworks on Apache Hadoop.☆703Updated 11 months ago
- Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.☆838Updated this week
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,143Updated this week
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,071Updated this week
- 汇总Apache Hudi相关资料☆535Updated this week
- Machine learning library of Apache Flink☆299Updated 5 months ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆780Updated 2 weeks ago
- ☆488Updated last year
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆252Updated last year
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆880Updated this week
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆370Updated this week
- Mirror of Apache griffin☆1,123Updated last week
- Apache Tez☆471Updated this week
- ☆568Updated 10 months ago
- FeatHub - A stream-batch unified feature store for real-time machine learning☆313Updated 3 months ago
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,830Updated 3 months ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆266Updated last month
- The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are c…☆840Updated last year
- A Spark Atlas connector to track data lineage in Apache Atlas☆264Updated last year
- Apache Atlas☆1,813Updated 2 weeks ago
- Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and…☆608Updated last week
- Compass is a task diagnosis platform for bigdata☆348Updated last month
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆785Updated this week
- Stream computing platform for bigdata☆404Updated 4 months ago