eiswar / hadoop-on-k8s
Kubernetes manifest files for building Hadoop clusters
☆9Updated 6 years ago
Alternatives and similar repositories for hadoop-on-k8s:
Users that are interested in hadoop-on-k8s are comparing it to the libraries listed below
- k8s hadoop,在k8s上快速搭建一个hadoop/hbase/hive环境,很早的项目自已用,腾讯tbds培训,以此为基础(多了一个kafka/flink)搭一套环境练习,又捡起来了☆22Updated 4 years ago
- Serializable ACID transactions on streaming data☆24Updated 2 years ago
- Eskimo is a state of the art Big Data Infrastructure and Management Web Console to build, manage and operate Big Data 2.0 Analytics clust…☆26Updated last year
- Spark Connector to read and write with Pulsar☆113Updated 5 months ago
- flink iceberg integration tests, jobs running on yarn.☆38Updated 4 years ago
- Demo code for implementing and showcasing a Fraud Detection Engine with Apache Flink.☆32Updated 2 years ago
- ACL Management for Apache Spark SQL with Apache Ranger☆17Updated 4 years ago
- Docker image for Apache Hive Metastore☆71Updated 2 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23Updated 6 years ago
- ☆66Updated 2 years ago
- Helm chart: single-node, pseudo-distributed, kerberized, hadoop cluster: K8S☆19Updated 7 years ago
- A web application for submitting spark application☆8Updated 3 years ago
- Here is my git repo for my Docker files related to Cloudera Hadoop CDH, to start, best is to check the documentation on https://github.co…☆56Updated 6 years ago
- ☆47Updated last year
- ☆39Updated 6 years ago
- A library based on delta for Spark and MLSQL☆61Updated 4 years ago
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆54Updated 3 years ago
- 反应式 海量数据治理平台☆40Updated 4 years ago
- DataTunnel 是一个基于spark引擎的超高性能的分布式数据集成软件,支持海量数据的同步。基于spark extensions 扩展的DSL语法,结合的Spark SQL,更加便捷融入数仓 ETLT 过程中,简单易用。☆24Updated this week
- Apache Flink docker image☆193Updated 2 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated last year
- Ranger Hive Metastore Plugin☆18Updated last year
- This is a library for SQL optimizing/rewriting including Materialized View rewrite☆68Updated 2 years ago
- Java library to integrate Flink and Kudu☆54Updated 7 years ago
- hadoop on kubernetes. It contains the configuration of HDFS and Yarn☆29Updated 7 years ago
- Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers☆69Updated 2 years ago
- Instructions for getting started with Ververica Platform on minikube.☆91Updated 3 months ago
- Import data from clickhouse to hadoop with pure SQL☆36Updated 6 years ago
- Flink native Kubernetes Operator is a java based control plane for running Apache Flink native application on Kubernetes.☆52Updated 2 years ago
- ☆79Updated last year