big-data-europe / docker-spark-notebookLinks
Spark Notebook docker image
☆10Updated 7 years ago
Alternatives and similar repositories for docker-spark-notebook
Users that are interested in docker-spark-notebook are comparing it to the libraries listed below
Sorting:
- [EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook a…☆694Updated 4 years ago
- Docker container for Kafka - Spark Streaming - Cassandra☆98Updated 6 years ago
- One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)☆120Updated 3 years ago
- Docker build for Apache Spark☆673Updated 3 years ago
- A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python,…☆209Updated 6 years ago
- ☆26Updated 2 years ago
- ☆32Updated 7 years ago
- Apache Flink docker image☆195Updated 3 years ago
- Examples of Spark 2.0☆211Updated 3 years ago
- A simple spark standalone cluster for your testing environment purposses☆572Updated last year
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆183Updated 2 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆174Updated last month
- Code for docker images☆39Updated 2 years ago
- Docker build for Zeppelin, a web-based Spark notebook☆221Updated 5 years ago
- Apache Spark™ and Scala Workshops☆264Updated 11 months ago
- Self-contained examples of Apache Spark streaming integrated with Apache Kafka.☆199Updated 7 years ago
- Real-Time Data Processing Pipeline & Visualization with Docker, Spark, Kafka and Cassandra☆85Updated 8 years ago
- ☆75Updated 5 years ago
- Examples for High Performance Spark☆511Updated 8 months ago
- Jupyter kernel for scala and spark☆189Updated last year
- Docker image for Apache Spark☆76Updated 5 years ago
- Spark + HDFS cluster using docker compose☆48Updated 6 years ago
- A full big data pipeline (Lambda Architecture) with Spark, Kafka, HDFS and Cassandra.☆179Updated 2 weeks ago
- A boilerplate for writing PySpark Jobs☆394Updated last year
- ☆129Updated 8 years ago
- The Internals of Spark Structured Streaming☆419Updated 2 years ago
- ☆247Updated 5 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- spark on kubernetes☆104Updated 2 years ago
- Vagrant project to spin up a single node VM running current versions of Hadoop, Hive and Spark☆67Updated 3 years ago