spark-examples / spark-scala-examples
This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language
☆556Updated 6 months ago
Related projects: ⓘ
- Spark Examples☆124Updated 2 years ago
- The Internals of Spark SQL☆448Updated last month
- The official repository for the Rock the JVM Spark Essentials with Scala course☆260Updated 8 months ago
- The Internals of Spark Structured Streaming☆415Updated last year
- A Spark plugin for reading and writing Excel files☆461Updated this week
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆429Updated last week
- XML data source for Spark SQL and DataFrames☆500Updated last month
- Examples for High Performance Spark☆497Updated 3 weeks ago
- Essential Spark extensions and helper methods ✨😲☆747Updated 2 years ago
- A simplified, lightweight ETL Framework based on Apache Spark☆581Updated 7 months ago
- ☆303Updated 5 years ago
- Databricks - Apache Spark™ - 2X Certified Developer☆261Updated 4 years ago
- Apache Spark Course Material☆84Updated last year
- Qubole Sparklens tool for performance tuning Apache Spark☆561Updated 2 months ago
- Spark style guide☆255Updated last year
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆202Updated last year
- Stream Processing with Apache Flink - Scala Examples☆392Updated 10 months ago
- Scala examples for learning to use Spark☆444Updated 4 years ago
- The Internals of Apache Spark☆1,461Updated this week
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆445Updated last year
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆693Updated last month
- This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]☆1,172Updated 4 months ago
- Pyspark RDD, DataFrame and Dataset Examples in Python language☆1,148Updated 5 months ago
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆184Updated last year
- ☆582Updated 2 years ago
- A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.☆673Updated 2 years ago
- This repository contains the notebooks and presentations we use for our Databricks Tech Talks☆689Updated last year
- ☆235Updated this week
- ETL pipeline using pyspark (Spark - Python)☆106Updated 4 years ago
- Apache Spark 3 - Structured Streaming Course Material☆120Updated last year