hyeonsangjeon / dataplatformLinks
Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.
☆11Updated 6 years ago
Alternatives and similar repositories for dataplatform
Users that are interested in dataplatform are comparing it to the libraries listed below
Sorting:
- k8s hadoop,在k8s上快速搭建一个hadoop/hbase/hive环境,很早的项目自已用,腾讯tbds培训,以此为基础(多了一个kafka/flink)搭一套环境练习,又捡起来了☆22Updated 4 years ago
- 【易车】- Spark、flink、HBase、Hive、flume集成了一些Hadoop的原生api的一些demo(如HDFS、MapReduce:目前就这两个);同时测试一些异常功能☆16Updated 6 years ago
- Fast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)☆22Updated 2 years ago
- HokStack - Run Hadoop Stack on Kubernetes☆25Updated 5 years ago
- Data Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !☆37Updated last year
- DataX分布式集群与负载均衡、任务执行/统计,基 于DataX的通用数据同步微服务,一个Restful接口搞定所有通用数据同步☆43Updated 5 years ago
- 使用K8S部署Apache Doris (incubating)(原百度palo)☆12Updated 6 years ago
- 常用大数据工具学习实战,包含Hadoop、HBase、Kafka、ClickHouse、Hive、Redis、Zookeeper...☆23Updated 3 years ago
- ns4_chatbox is a communication component that integrates qqbot, wxchat, rasa, and Web Services☆52Updated 3 years ago
- 内嵌AI的数据质量控制系统☆49Updated 4 years ago
- kafka connector 插件,支持输入 mysql binlog 和 json 格式写入ClickHouse。持续更新☆44Updated 5 years ago
- User behavior log analysis system based on Flink☆24Updated 5 years ago
- hadoop on kubernetes. It contains the configuration of HDFS and Yarn☆30Updated 7 years ago
- 大数据自动化部署,包括自动化部署hadoop、hive、hbase、spark、storm等等一系列组件☆71Updated 7 years ago
- SuperBI 是达闼科技以开源项目superset为基础开发的企业级快速BI应用。 可扩展的框架设计,支持多种DBMS数据源,让数据BI更加简单。 superbi提供直观的UI,拖拽式的编辑体验,配置式的图例创建,轻松创建数据可视化dashboard的能力。☆48Updated 4 years ago
- 基于argo的云原生调度,项目管理,在线notebook,在线镜像构建,拖拉拽编排pipeline,定时调度,实例管理☆74Updated 2 years ago
- DataX 是阿里巴巴集团内被广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、HDFS、Hive、OceanBase、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。☆23Updated 4 years ago
- Yarn on Docker - Managing Hadoop Yarn cluster with Docker Swarm.☆37Updated 4 years ago
- 同步Hive数据仓库数据到Elasticsearch的小工具☆21Updated 8 years ago
- compare elastic and clickhouse☆24Updated 4 years ago
- Deploy a simple Multi-Node Clickhouse Cluster with docker-compose in minutes.☆17Updated 4 years ago
- 从0开始搭建IOT平台☆26Updated 8 years ago
- 记录使用过的Docker-compose☆23Updated last year
- 自助搭建的 hadoop + spark + kafka + zookeeper + storm + hbase + hive + flume 集群,一主两从。☆31Updated 7 years ago
- docker-hadoop-spark-hive 快速构建你的大数据环境☆21Updated 6 years ago
- Here is my git repo for my Docker files related to Cloudera Hadoop CDH, to start, best is to check the documentation on https://github.co…☆56Updated 7 years ago
- Apache Spark Docker Image☆68Updated 7 years ago
- Kernel for Kubeflow in Jupyter Notebook☆65Updated 6 years ago
- 大数据组件学习;包括dataflow,spring cloud stream;elasticsearch;flink;spark;kafka;phoenix;Hive;Hbase;☆22Updated 3 years ago
- The example for using OpenTelemetry Collector in Java☆12Updated 2 years ago