spark-examples / spark-scala-examplesLinks
This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language
☆566Updated last year
Alternatives and similar repositories for spark-scala-examples
Users that are interested in spark-scala-examples are comparing it to the libraries listed below
Sorting:
- Spark Examples☆125Updated 3 years ago
- The Internals of Spark SQL☆469Updated last week
- ☆310Updated 6 years ago
- A Spark plugin for reading and writing Excel files☆504Updated last week
- The Internals of Spark Structured Streaming☆419Updated 2 years ago
- Databricks - Apache Spark™ - 2X Certified Developer☆266Updated 4 years ago
- The official repository for the Rock the JVM Spark Essentials with Scala course☆272Updated 5 months ago
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆216Updated 2 years ago
- A simplified, lightweight ETL Framework based on Apache Spark☆587Updated last year
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆447Updated 2 weeks ago
- Spark style guide☆258Updated 9 months ago
- Stream Processing with Apache Flink - Scala Examples☆405Updated last year
- Essential Spark extensions and helper methods ✨😲☆761Updated last week
- Self-contained examples of Apache Spark streaming integrated with Apache Kafka.☆199Updated 7 years ago
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆183Updated 2 years ago
- Apache Spark Course Material☆95Updated 2 years ago
- Scala examples for learning to use Spark☆445Updated 4 years ago
- Docker multi-nodes Hadoop cluster with Spark 2.4.1 on Yarn☆51Updated 4 years ago
- Apache Spark 3 - Structured Streaming Course Material☆45Updated 4 years ago
- Apache Spark™ and Scala Workshops☆264Updated 11 months ago
- Examples for High Performance Spark☆511Updated 8 months ago
- Examples of Spark 3.0☆47Updated 4 years ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆769Updated last month
- The official repository for the Rock the JVM Spark Optimization with Scala course☆58Updated last year
- The Internals of Delta Lake☆184Updated 6 months ago
- Multi-container environment with Hadoop, Spark and Hive☆217Updated 2 months ago
- ETL pipeline using pyspark (Spark - Python)☆117Updated 5 years ago
- Qubole Sparklens tool for performance tuning Apache Spark☆579Updated last year
- A simple Spark-powered ETL framework that just works 🍺☆181Updated 2 weeks ago
- The Internals of Apache Spark☆1,507Updated last week