Aiden-Dong / spark-source-code-analyze
spark 源码解读, 基于2.4.0
☆19Updated last year
Alternatives and similar repositories for spark-source-code-analyze:
Users that are interested in spark-source-code-analyze are comparing it to the libraries listed below
- ☆190Updated 3 years ago
- A data integration framework☆4,045Updated last month
- Flink CDC is a streaming data integration tool☆6,036Updated this week
- Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch …☆2,755Updated this week
- 这是我自己的Flink中文社区翻译稿存储仓库,用于提供给需要朋友进行二次创作。同时提供Flink一些课外的相关知识文档供大家学习☆370Updated 4 months ago
- 该仓库专注 于让读者秒懂Flink组件,包含Flink实战代码和文档、200个Flink教程知识点,Flink Datastream、Flink Table、Flink Window、Flink State、Flink Checkpoint、Flink Metrics、Fli…☆727Updated 10 months ago
- Fluss is a streaming storage built for real-time analytics.☆1,109Updated this week
- ☆455Updated 2 years ago
- 基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法☆2,046Updated last year
- Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.☆954Updated this week
- Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.☆3,401Updated this week
- 基于flink的实时流计算web平台☆1,834Updated 7 months ago
- CloudEon uses Kubernetes to install and deploy open-source big data components, enabling the containerized operation of an open-source bi…☆457Updated last month
- 汇总Apache Hudi相关资料☆550Updated this week
- Apache InLong - a one-stop, full-scenario integration framework for massive data☆1,432Updated this week
- The next generation of cloud-native big data management expert , Aims to help users rapidly build stable, efficient, and scalable cloud-n…☆1,200Updated 8 months ago
- ☆202Updated last week
- Flink 中文视频课程(持续更新...)☆4,582Updated 4 years ago
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆947Updated this week
- The java sdk for operating Apache Dolphinscheduler.☆67Updated 2 months ago
- Flink Connector for Apache Doris☆343Updated this week
- QLExpress is a powerful, lightweight, dynamic language for the Java platform aimed at improving developers’ productivity in different bus…☆5,025Updated this week
- Make stream processing easier! Easy-to-use streaming application development framework and operation platform.☆4,049Updated 2 weeks ago
- RocketMQ integration for Apache Flink. This module includes the RocketMQ source and sink that allows a flink job to either write messages…☆152Updated last month
- 懒松鼠Flink-Boot 脚手架让Flink全面拥抱Spring生态体系,使得开发者可以以Java WEB开发模式开发出分布式运行的流处理程序,懒松鼠让跨界变得更加简单。懒松鼠旨在让开发者以更底上手成本(不需要理解分布式计算的理论知识和Flink框架的细节)便可以快速编写…☆808Updated 11 months ago
- DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持RDBMS、Hive、HBase、ClickHouse、MongoDB等数据源,批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、…☆5,747Updated 10 months ago
- Flink SQL connector for ClickHouse. Support ClickHouseCatalog and read/write primary data, maps, arrays to clickhouse.☆389Updated this week
- A fast and versatile ETL tool that can transfer data between RDBMS and NoSQL seamlessly☆1,262Updated this week
- SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offlin…☆672Updated last week
- datax-kuduwriter☆11Updated last year