Segence / docker-hadoopLinks
A Docker container with a full Hadoop cluster setup with Spark and Zeppelin
☆67Updated 5 years ago
Alternatives and similar repositories for docker-hadoop
Users that are interested in docker-hadoop are comparing it to the libraries listed below
Sorting:
- Multiple node cluster on Docker for self development.☆93Updated 7 years ago
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆183Updated 2 years ago
- Docker build for Zeppelin, a web-based Spark notebook☆221Updated 5 years ago
- ☆240Updated 3 years ago
- Apache Flink docker image☆195Updated 3 years ago
- [EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook a…☆694Updated 4 years ago
- The Internals of Spark Structured Streaming☆419Updated 2 years ago
- Examples of Spark 2.0☆211Updated 3 years ago
- Dockerized HDP Cluster☆84Updated 7 years ago
- Self-contained examples of Apache Spark streaming integrated with Apache Kafka.☆199Updated 7 years ago
- Serverless proxy for Spark cluster☆326Updated 4 years ago
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆98Updated 5 years ago
- spark on kubernetes☆104Updated 2 years ago
- hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE☆292Updated 2 years ago
- Docker image for main Apache Hadoop components (Yarn/Hdfs)☆56Updated 2 years ago
- Docker build for Apache Spark☆673Updated 3 years ago
- Docker image for Apache Spark☆76Updated 5 years ago
- Docker image with Ambari☆291Updated 7 years ago
- StreamSets Tutorials☆350Updated 11 months ago
- Docker Cloudera Quick Start Image☆93Updated 7 years ago
- Simple examle for Spark Streaming over Kafka topic☆106Updated 4 years ago
- The Internals of Apache Kafka☆131Updated 2 years ago
- ☆105Updated 5 years ago
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆183Updated 2 years ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆83Updated 5 years ago
- The Internals of Delta Lake☆184Updated 6 months ago
- StreamLine - Streaming Analytics☆164Updated last year
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆242Updated 10 years ago
- Mirror of Apache Bahir☆336Updated 2 years ago
- A modern real-time streaming application serving as a reference framework for developing a big data pipeline, complete with a broad range…☆42Updated 5 years ago