hrchlhck / k8s-bigdataLinks
Apache Spark with HDFS cluster within Kubernetes
☆11Updated 2 years ago
Alternatives and similar repositories for k8s-bigdata
Users that are interested in k8s-bigdata are comparing it to the libraries listed below
Sorting:
- k8s hadoop,在k8s上快速搭建一个hadoop/hbase/hive环境,很早的项目自已用,腾讯tbds培训,以此为基础(多了一个kafka/flink)搭一套环境练习,又捡起来了☆22Updated 4 years ago
- hadoop on kubernetes. It contains the configuration of HDFS and Yarn☆30Updated 7 years ago
- ☆19Updated 2 years ago
- ☆48Updated 2 years ago
- Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-an…☆112Updated 8 months ago
- Here is my git repo for my Docker files related to Cloudera Hadoop CDH, to start, best is to check the documentation on https://github.co…☆56Updated 7 years ago
- flink iceberg integration tests, jobs running on yarn.☆38Updated 4 years ago
- KDP(Kubernetes Data Platform) delivers a modern, hybrid and cloud-native data platform based on Kubernetes.☆210Updated 8 months ago
- 基于argo的云原生调度,项目管理,在线notebook,在线镜像构建,拖拉拽编排pipeline,定时调度,实例管理☆73Updated 2 years ago
- Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.☆149Updated last year
- 最简单的 spark sql on kubernetes 生产环境部署方案☆19Updated 2 years ago
- 用户行为分析-用户关联☆14Updated 5 years ago
- 通过观看尚硅谷的Flink实战视频,开了一个仓库,记录源码和一些所需要的数据文件,也欢迎大家积极讨论☆17Updated 4 years ago
- Stock analysis MLOps system based on DolphinScheduler☆12Updated 3 years ago
- Airflow Dag可视化编辑和管理☆45Updated 3 years ago
- Apache DolphinScheduler Python API, aka PyDolphinscheduler.☆65Updated 6 months ago
- ☆62Updated 2 months ago
- ☆14Updated 3 years ago
- HiveReader for alibaba DataX☆17Updated 2 years ago
- Make data connection easier☆22Updated 3 years ago
- A distributed data factory, providing data access, etl, scheduling. Easily manage tasks such as hive, spark, clickhouse, flink, shell, py…☆33Updated 3 years ago
- CDAP UI☆20Updated last month
- flink endpoint for open world☆28Updated 2 months ago
- A library developed to ease the data ETL development process.☆134Updated last month
- ☆17Updated 6 months ago
- ☆29Updated 2 weeks ago
- DataTunnel 是一个基于spark引擎的超高性能的分布式数据集成软件,支持海量数据的同步。基于spark extensions 扩展的DSL语法,结合的Spark SQL,更加便捷融入数仓 ETLT 过程中,简单易用。☆35Updated 2 weeks ago
- 低代码平台,前端低代码,兼后端低代码, python后端框架 react前 端框架☆65Updated 3 years ago
- 【合并到至轻云】☆25Updated 6 months ago
- Apache Flink docker image☆197Updated 3 years ago