spark-examples / spark-scala-examples
This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language
☆561Updated 11 months ago
Alternatives and similar repositories for spark-scala-examples:
Users that are interested in spark-scala-examples are comparing it to the libraries listed below
- Spark Examples☆125Updated 3 years ago
- The Internals of Spark SQL☆460Updated last month
- Apache Spark Course Material☆87Updated last year
- The official repository for the Rock the JVM Spark Essentials with Scala course☆268Updated 3 weeks ago
- The Internals of Spark Structured Streaming☆418Updated 2 years ago
- A Spark plugin for reading and writing Excel files☆480Updated this week
- ☆307Updated 6 years ago
- A simplified, lightweight ETL Framework based on Apache Spark☆586Updated last year
- Apache Spark 3 - Structured Streaming Course Material☆121Updated last year
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆439Updated last week
- Scala examples for learning to use Spark☆444Updated 4 years ago
- Examples for High Performance Spark☆506Updated 4 months ago
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆211Updated last year
- Spark style guide☆258Updated 5 months ago
- Databricks - Apache Spark™ - 2X Certified Developer☆265Updated 4 years ago
- Essential Spark extensions and helper methods ✨😲☆757Updated 4 months ago
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆443Updated 4 months ago
- The Internals of Delta Lake☆183Updated last month
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆732Updated 3 weeks ago
- Self-contained examples of Apache Spark streaming integrated with Apache Kafka.☆199Updated 6 years ago
- Apache Spark™ and Scala Workshops☆264Updated 7 months ago
- Multi-container environment with Hadoop, Spark and Hive☆205Updated last year
- Qubole Sparklens tool for performance tuning Apache Spark☆571Updated 8 months ago
- Apache Spark 3 - Structured Streaming Course Material☆45Updated 4 years ago
- ETL pipeline using pyspark (Spark - Python)☆112Updated 4 years ago
- Pyspark RDD, DataFrame and Dataset Examples in Python language☆1,227Updated 11 months ago
- A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.☆677Updated 2 years ago
- The official repository for the Rock the JVM Spark Optimization with Scala course☆57Updated last year
- Apache Spark (PySpark) Practice on Real Data☆273Updated 5 years ago
- The Internals of Apache Spark☆1,491Updated 5 months ago