hrchlhck / k8s-bigdata
Apache Spark with HDFS cluster within Kubernetes
☆12Updated last year
Alternatives and similar repositories for k8s-bigdata:
Users that are interested in k8s-bigdata are comparing it to the libraries listed below
- k8s hadoop,在k8s上快速搭建一个hadoop/hbase/hive环境,很早的项目自已用,腾讯tbds培训,以此为基础(多了一个kafka/flink)搭一套环境练习,又捡起来了☆21Updated 3 years ago
- hadoop on kubernetes. It contains the configuration of HDFS and Yarn☆29Updated 6 years ago
- ☆19Updated last year
- 基于argo的云原生调度,项目管理,在线notebook,在线镜像构建,拖拉拽编排pipeline,定时调度,实例管理☆66Updated last year
- ☆23Updated 2 years ago
- flink rest api的spring-boot-starter☆17Updated last year
- 最简单的 spark sql on kubernetes 生产环境部署方案☆18Updated last year
- ☆61Updated last month
- flink iceberg integration tests, jobs running on yarn.☆38Updated 3 years ago
- ☆28Updated 3 years ago
- ☆41Updated this week
- ☆14Updated 2 years ago
- 通过观看尚硅谷的Flink实战视频,开了一个仓库,记录源码和一些所需要的数据文件,也欢迎大家积极讨论☆17Updated 4 years ago
- 用户行为分析-用户关联☆14Updated 4 years ago
- 基于Flink的批流处理实战案例☆37Updated last year
- 数据仓库实战:Hive、HBase、Kylin、ClickHouse☆21Updated 5 months ago
- Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆31Updated 4 years ago
- 此项目主要应用于数据中台或数据平台的数据总线,支持直接实时监听MySQL、MongoDB、PostgreSQL、Oracle、SQL Server、Db2和Cassandra等数据库的数据变更。☆62Updated last year
- Helm chart from stable/hadoop, updated to hadoop 3.2.1☆22Updated 5 years ago
- DataX分布式集群与负载均衡、任务执行/统计,基于DataX的通用数据同步微服务,一个Restful接口搞定所有通用数据同步☆42Updated 4 years ago
- Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-an…☆105Updated 2 months ago
- Apache StreamPark quickstart☆69Updated last month
- 自助搭建的 hadoop + spark + kafka + zookeeper + storm + hbase + hive + flume 集群,一主两从。☆30Updated 6 years ago
- Here is my git repo for my Docker files related to Cloudera Hadoop CDH, to start, best is to check the documentation on https://github.co…☆56Updated 6 years ago
- Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.☆139Updated 5 months ago
- Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.☆18Updated 9 months ago
- DOP是一个基于蓝鲸智云开发的数据管理工具,旨在简化各类大数据组件的日常运维操作、降低使用门槛、提高运维效率,目前支持Elasticsearch、Kafka、Hadoop。☆14Updated 2 years ago
- Stock analysis MLOps system based on DolphinScheduler☆12Updated 2 years ago
- Apache DolphinScheduler Python API, aka PyDolphinscheduler.☆54Updated last month
- EOI数据中台产品☆30Updated 2 years ago