A library based on delta for Spark and MLSQL
☆60Dec 24, 2020Updated 5 years ago
Alternatives and similar repositories for delta-plus
Users that are interested in delta-plus are comparing it to the libraries listed below
Sorting:
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆152Apr 21, 2023Updated 2 years ago
- ☆13Jun 17, 2022Updated 3 years ago
- Big data smart alarm by sql☆12May 11, 2021Updated 4 years ago
- This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache A…☆49Apr 21, 2023Updated 2 years ago
- ☆22Jun 21, 2022Updated 3 years ago
- This is a library for SQL optimizing/rewriting including Materialized View rewrite☆69Jun 21, 2022Updated 3 years ago
- A library based on Hudi for Spark.☆10Nov 30, 2021Updated 4 years ago
- A HBase datasource implementation for Spark and [MLSQL](http://www.mlsql.tech).☆15Sep 29, 2023Updated 2 years ago
- ☆19Jun 16, 2021Updated 4 years ago
- hudi-spark-utilities-plus☆11Jul 29, 2022Updated 3 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Jul 9, 2025Updated 8 months ago
- Wormhole is a SPaaS (Stream Processing as a Service) Platform☆976Nov 16, 2022Updated 3 years ago
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,842May 29, 2024Updated last year
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆182Apr 6, 2022Updated 3 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23May 7, 2018Updated 7 years ago
- Example Application to consume a twitter stream with Neo4j☆11Mar 16, 2017Updated 9 years ago
- Stream computing platform for bigdata☆407Apr 24, 2024Updated last year
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Apr 27, 2017Updated 8 years ago
- spark学习中文笔记☆13Mar 26, 2019Updated 6 years ago
- 智能数据探索服务(Intelligent Data Exploration Service),一站式Data + AI数据解决方案!☆35Jul 10, 2023Updated 2 years ago
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆136Mar 6, 2023Updated 3 years ago
- ☆235Sep 15, 2022Updated 3 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- Moonbox is a DVtaaS (Data Virtualization as a Service) Platform☆506Apr 14, 2023Updated 2 years ago
- an open source dataworks platform☆21Jun 4, 2021Updated 4 years ago
- MySQL增量数据实时同步到HDFS/Hive☆11Jul 24, 2018Updated 7 years ago
- Apache CarbonData Learning☆53Mar 5, 2020Updated 6 years ago
- 录制Spak视频课程讲解涉及编写的源代码 https://edu.hellobi.com/course/107/overview☆13Apr 23, 2019Updated 6 years ago
- Flink parcel for Cloudera Manager☆22Aug 1, 2019Updated 6 years ago
- My Blog☆76May 3, 2018Updated 7 years ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Jul 19, 2023Updated 2 years ago
- 简单易用的ETL工具☆17Mar 28, 2019Updated 6 years ago
- flinksql-platform☆19Mar 22, 2021Updated 5 years ago
- Spark 脚手架工程,标准化 spark 开发、部署、测试流程。☆94Oct 10, 2024Updated last year
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,369Aug 22, 2023Updated 2 years ago
- Study Notes☆56Sep 23, 2019Updated 6 years ago
- ☆17Mar 19, 2024Updated 2 years ago
- A RPC framework leveraging Spark RPC module☆209Mar 13, 2019Updated 7 years ago
- Mirror of Apache griffin☆1,171Aug 3, 2025Updated 7 months ago