This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language
☆566Mar 20, 2024Updated last year
Alternatives and similar repositories for spark-scala-examples
Users that are interested in spark-scala-examples are comparing it to the libraries listed below
Sorting:
- Pyspark RDD, DataFrame and Dataset Examples in Python language☆1,346Dec 7, 2025Updated 2 months ago
- Spark Examples☆127Feb 1, 2022Updated 4 years ago
- Apache Spark Course Material☆96Apr 21, 2023Updated 2 years ago
- The official repository for the Rock the JVM Spark Essentials with Scala course☆278Sep 10, 2025Updated 5 months ago
- ☆12Sep 25, 2024Updated last year
- Apache Spark 3 - Structured Streaming Course Material☆46Sep 8, 2020Updated 5 years ago
- The Internals of Spark SQL☆486Jan 25, 2026Updated last month
- Fundamentals of Spark with Python (using PySpark), code examples☆362Oct 29, 2022Updated 3 years ago
- A free tutorial for Apache Spark.☆992Jan 5, 2026Updated 2 months ago
- The official repository for the Rock the JVM Spark Optimization with Scala course☆58Dec 4, 2023Updated 2 years ago
- Apache Spark Connector for SQL Server and Azure SQL☆287Feb 27, 2025Updated last year
- ☆20Aug 17, 2019Updated 6 years ago
- ☆202Feb 18, 2026Updated 2 weeks ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Jan 22, 2024Updated 2 years ago
- Essential Spark extensions and helper methods ✨😲☆766Sep 14, 2025Updated 5 months ago
- The Internals of Apache Spark☆1,541Jul 5, 2025Updated 8 months ago
- Repository used for Spark Trainings☆54Apr 21, 2023Updated 2 years ago
- Spark implementation of Slowly Changing Dimension type 2☆11Jan 8, 2019Updated 7 years ago
- The Internals of Spark Structured Streaming☆422Updated this week
- A Spark plugin for reading and writing Excel files☆520Feb 12, 2026Updated 3 weeks ago
- Snowflake Data Source for Apache Spark.☆229Updated this week
- A curated list of awesome Apache Spark packages and resources.☆1,863Feb 27, 2026Updated last week
- Spark with Scala example projects☆34Apr 17, 2019Updated 6 years ago
- 《Spark: The Definitive Guide Big Data Processing Made Simple》学习心得,说翻译嘛也不算完全翻译吧,只能说以个人经验和理解重新叙述一遍。同步更新在掘金上,点链接可跳转☆36Aug 4, 2019Updated 6 years ago
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆454Feb 8, 2026Updated 3 weeks ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,608Updated this week
- Supporting code for the tutorials on https://www.baeldung.com/scala☆348Feb 23, 2026Updated last week
- Scala examples for learning to use Spark☆445Sep 17, 2020Updated 5 years ago
- All Algorithms implemented in Scala☆1,103Oct 6, 2024Updated last year
- A connector for Spark that allows reading and writing to/from Redis cluster☆948Oct 22, 2024Updated last year
- Apache Spark - A unified analytics engine for large-scale data processing☆42,933Updated this week
- ☆13Dec 16, 2020Updated 5 years ago
- A complete data engineering project demonstrating modern data stack practices with Apache Flink, Iceberg, Trino and Superset☆20Sep 29, 2025Updated 5 months ago
- ☆151Apr 4, 2018Updated 7 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,308Updated this week
- A Data Engineering & Machine Learning Knowledge Hub☆1,140Feb 2, 2024Updated 2 years ago
- Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks☆1,666Mar 16, 2024Updated last year
- Project for James' Apache Spark with Scala course☆125Jul 6, 2020Updated 5 years ago
- Delta Lake examples☆240Oct 8, 2024Updated last year