big-data-europe / docker-zeppelinLinks
☆26Updated 2 years ago
Alternatives and similar repositories for docker-zeppelin
Users that are interested in docker-zeppelin are comparing it to the libraries listed below
Sorting:
- A simple spark standalone cluster for your testing environment purposses☆572Updated last year
- ☆32Updated 7 years ago
- [EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook a…☆694Updated 4 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆174Updated last month
- Vagrant project to spin up a single node VM running current versions of Hadoop, Hive and Spark☆67Updated 3 years ago
- Postgresql configured to work as metastore for Hive.☆32Updated 2 years ago
- One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)☆120Updated 3 years ago
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆495Updated 2 years ago
- Apache Flink docker image☆195Updated 3 years ago
- Code for docker images☆39Updated 2 years ago
- spark on kubernetes☆104Updated 2 years ago
- Spark Notebook docker image☆10Updated 7 years ago
- Oozie Workflow to Airflow DAGs migration tool☆87Updated 4 months ago
- Multi-container environment with Hadoop, Spark and Hive☆217Updated 2 months ago
- This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.☆142Updated last year
- Docker image for main Apache Hadoop components (Yarn/Hdfs)☆56Updated 2 years ago
- General README for the Big Data Europe project's sources☆83Updated last year
- A Spark cluster setup running on Docker containers☆60Updated 5 years ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 3 years ago
- Docker image for Apache Hive Metastore☆71Updated 2 years ago
- ☆252Updated 2 years ago
- A boilerplate for writing PySpark Jobs☆394Updated last year
- Resource for the book Trino: The Definitive Guide (and formerly Presto: The Definitive Guide)☆226Updated 2 years ago
- A simplified, lightweight ETL Framework based on Apache Spark☆587Updated last year
- The Internals of Spark SQL☆469Updated last week
- Jupyter kernel for scala and spark☆189Updated last year
- Ansible roles to install an Spark Standalone cluster (HDFS/Spark/Jupyter Notebook) or Ambari based Spark cluster☆61Updated last year
- A collection of templates for use with Apache NiFi.☆279Updated 8 years ago
- StreamSets Tutorials☆350Updated 11 months ago
- Apache Airflow CI pipeline☆19Updated 6 years ago