edbullen / DockerSpark245
Spark cluster in docker containers with sample training Jupyter notebooks
☆27Updated 2 years ago
Alternatives and similar repositories for DockerSpark245:
Users that are interested in DockerSpark245 are comparing it to the libraries listed below
- ☆36Updated 2 years ago
- Delta-Lake, ETL, Spark, Airflow☆47Updated 2 years ago
- Docker with Airflow and Spark standalone cluster☆255Updated last year
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆33Updated 4 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Updated 4 years ago
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆49Updated last year
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆112Updated 2 weeks ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆23Updated 3 years ago
- The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Pos…☆63Updated 2 years ago
- ☆87Updated 2 years ago
- SQL Queries & Alerts for Databricks System Tables access.audit Logs☆26Updated 6 months ago
- Delta Lake examples☆224Updated 6 months ago
- trino + hive + minio with postgres in docker compose☆21Updated last year
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆91Updated last month
- ☆86Updated 2 months ago
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆488Updated 2 years ago
- This repo contains a spark standalone cluster on docker for anyone who wants to play with PySpark by submitting their applications.☆34Updated last year
- Code samples, etc. for Databricks☆63Updated 3 weeks ago
- Examples surrounding Databricks.☆57Updated 9 months ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆60Updated last year
- Resources for video demonstrations and blog posts related to DataOps on AWS☆175Updated 3 years ago
- This project is for demonstrating knowledge of Data Engineering tools and concepts and also learning in the process☆46Updated 2 years ago
- Guide for databricks spark certification☆58Updated 3 years ago
- ☆47Updated last year
- ☆50Updated last year
- ☆43Updated 3 years ago
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆35Updated last year
- Notebooks to learn Databricks Lakehouse Platform☆22Updated 2 weeks ago
- ☆28Updated last year
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆41Updated last year