s1mplecc / spark-hadoop-docker
☆139Updated 2 years ago
Alternatives and similar repositories for spark-hadoop-docker:
Users that are interested in spark-hadoop-docker are comparing it to the libraries listed below
- A Hadoop cluster based on Docker, including Hive and Spark.☆77Updated 2 years ago
- Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.☆138Updated 4 months ago
- 一个实时数仓项目,从0到1搭建实时数仓☆55Updated 3 years ago
- 电商平台数据仓库搭建☆125Updated 2 years ago
- 大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse☆229Updated last month
- Docker images for building hadoop3.2, hive 3.1, hbase2.3, presto 0.247, flink1.11.3 on yarn, etc.☆30Updated last year
- ☆126Updated 3 years ago
- FlinkSQL数据脱敏和行级权限解决方案及源码,支持面向用户级别的数据脱敏和行级数据访问控制,即特定用户只能访问到脱敏后的数据或授权过的行。此方案是实时领域Flink的解决方案,类似于离线数仓Hive Ranger中的Row-level Filter和Column Mas…☆131Updated last year
- 从本地IDEA提交Flink/Spark任务到Yarn/k8s集群☆161Updated 3 years ago
- Apache StreamPark quickstart☆68Updated this week
- 基于 Flink 的 sqlSubmit 程序☆144Updated 10 months ago
- Flink SQL connector for ClickHouse. Support ClickHouseCatalog and read/write primary data, maps, arrays to clickhouse.☆377Updated this week
- 大数据组件 All-in-One 的 Dockerfile☆92Updated 2 months ago
- 基 于Flink流处理的动态实时亿级全端用户画像系统☆474Updated 2 years ago
- Demo: Build End-to-End Streaming Application using Flink SQL☆255Updated 2 years ago
- Flink Tutorial Project☆197Updated 6 months ago
- The Lineage Analysis system for FlinkSQL supports advanced syntax such as Watermark, UDTF, CEP, Windowing TVFs, and CTAS.☆375Updated 8 months ago
- 分享一些在工作中的大数据实战案例,包括flink、kafka、hadoop、presto等等。欢迎大家关注我的公众号【大数据技术与应用实战】,一起成长。☆263Updated 11 months ago
- 数据治理、数据质量检核/监控平台(Django+jQuery+MySQL)☆183Updated 2 years ago
- 基于antlr4的sql解析,实现格式化,元数据,血源等自定义解析,包括hive☆110Updated 2 years ago
- 本 GitHub 项目是 Flink Forward Asia Hackathon (2021) 的投票专用项目。☆122Updated 3 years ago
- Using Flink SQL to build ETL job☆200Updated last year
- Stock analysis MLOps system based on DolphinScheduler☆12Updated 2 years ago
- ☆195Updated 2 weeks ago
- jdbc2 datasource suport DUPLICATE KEY incrment☆18Updated 4 years ago
- 基于Docker构建的Hadoop开发测试环境,包含Hadoop,Hive,HBase,Spark☆301Updated 5 years ago
- flink简易使用教程,结合官方仓库的example样例,结合常见场景,使用flink的基本功能☆111Updated 2 years ago
- flink 集成CDH5的自定义paracels☆71Updated 2 years ago
- dataService platform is a low-code platform, which only needs to write SQL to realize the development of API services, solve the unificat…☆110Updated last year
- ☆186Updated 3 years ago