rockthejvm / spark-cluster-dockerLinks
☆10Updated 4 years ago
Alternatives and similar repositories for spark-cluster-docker
Users that are interested in spark-cluster-docker are comparing it to the libraries listed below
Sorting:
- The official repository for the Rock the JVM Spark Optimization with Scala course☆58Updated 2 years ago
- The official repository for the Rock the JVM Spark Optimization 2 course☆42Updated 2 years ago
- Spark Examples☆126Updated 3 years ago
- Source code for the "Scala For Beginners" book. https://leanpub.com/scalaforbeginners/☆13Updated 6 years ago
- The official repository for the Rock the JVM Spark Essentials with Scala course☆278Updated 4 months ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 5 years ago
- Source code examples for the Second Edition of the Scala Cookbook☆47Updated 3 years ago
- An example project that implements a data pipeline using Scala, Akka, and Spark and works with document-oriented and graph databases to l…☆11Updated 6 years ago
- Apache Spark Course Material☆96Updated 2 years ago
- A Python PySpark Projet with Poetry☆24Updated 6 months ago
- Scala Programming Projects, published by Packt☆84Updated 3 years ago
- The official repository for the Rock the JVM Spark Streaming course☆19Updated 2 years ago
- Exercises for the "Functional Programming Principles in Scala", part of the FP in Scala specialized program by EPFL.☆165Updated last year
- Delta lake and filesystem helper methods☆50Updated last year
- The official repository for the Scala & Functional Programming Practice course☆84Updated last year
- For Udemy students: the official repository of Rock the JVM's Spark Streaming course☆26Updated 3 years ago
- Flowchart for debugging Spark applications☆106Updated last year
- Code snippets used in demos recorded for the blog.☆37Updated 2 weeks ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated 2 years ago
- Spark DataFrame transformation and UDF test examples☆22Updated 2 years ago
- A tool to validate data, built around Apache Spark.☆100Updated this week
- Building Big Data Pipelines with Apache Beam, published by Packt☆89Updated 2 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆61Updated 2 years ago
- ☆65Updated last year
- An ETL framework in Scala for Data Engineers☆23Updated 3 years ago
- Apache Spark 3 - Structured Streaming Course Material☆46Updated 5 years ago
- Data quality control tool built on spark and deequ☆25Updated last week
- The official repository for the Rock the JVM Flink course☆31Updated last month
- type-class based data cleansing library for Apache Spark SQL☆78Updated 6 years ago
- One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)☆120Updated 4 years ago