Spark Examples
☆127Feb 1, 2022Updated 4 years ago
Alternatives and similar repositories for spark-examples
Users that are interested in spark-examples are comparing it to the libraries listed below
Sorting:
- This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language☆566Mar 20, 2024Updated last year
- ☆10Aug 2, 2021Updated 4 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Jan 22, 2024Updated 2 years ago
- Atomic Scala Book Solutions - for Beginners and first time Functional Programmers☆12Mar 10, 2020Updated 5 years ago
- Java OutOfMemory Example☆11Jun 19, 2021Updated 4 years ago
- ☆23Apr 22, 2019Updated 6 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Dec 13, 2017Updated 8 years ago
- Apache Spark Course Material☆96Apr 21, 2023Updated 2 years ago
- ☆20Dec 19, 2023Updated 2 years ago
- ☆16Aug 31, 2019Updated 6 years ago
- The official repository for the Rock the JVM Spark Essentials with Scala course☆278Sep 10, 2025Updated 5 months ago
- Open Source Capital Markets Platform: Unified Cross-Asset Trading, Risk Management & Post-Trade Operations. Modular, Auditable, Sovereign…☆16Feb 21, 2026Updated last week
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated last year
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆48Jan 7, 2025Updated last year
- Spark Databricks Notebooks☆14Dec 19, 2020Updated 5 years ago
- ☆24Oct 3, 2023Updated 2 years ago
- Essential Spark extensions and helper methods ✨😲☆766Sep 14, 2025Updated 5 months ago
- Source code examples for the Second Edition of the Scala Cookbook☆47Sep 30, 2022Updated 3 years ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆49Dec 2, 2023Updated 2 years ago
- Standalone examples shown in the book "Practical FP in Scala: A hands-on approach"☆199Jul 6, 2022Updated 3 years ago
- Repo which holds the materials for the EMR Zero To Hero☆27May 7, 2022Updated 3 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Jun 17, 2025Updated 8 months ago
- A tool to validate data, built around Apache Spark.☆100Feb 19, 2026Updated last week
- The Internals of Spark SQL☆486Jan 25, 2026Updated last month
- This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.☆57Jun 10, 2018Updated 7 years ago
- Rest API for Todobackend on top of Cassandra☆26Feb 22, 2023Updated 3 years ago
- SoundCloud Backend Developer Challenge☆25Jan 29, 2017Updated 9 years ago
- ☆38May 22, 2024Updated last year
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆31Feb 19, 2024Updated 2 years ago
- Machine Learning Workshop Resources☆12Feb 16, 2019Updated 7 years ago
- DAG-based blockchain☆10Apr 20, 2019Updated 6 years ago
- The Internals of Delta Lake☆188Nov 30, 2025Updated 3 months ago
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70May 8, 2023Updated 2 years ago
- This repository has the code from the text and the videos for "Introduction to Programming and Problem Solving using Scala".☆30Feb 11, 2018Updated 8 years ago
- Spark大型项目实战:电商用户行为分析大数据平台\Spark大型项目实战:电商用户行为分析大数据平台(史上第一套高端大数据项目实战课程)☆34Apr 14, 2023Updated 2 years ago
- Terraform module to provision an Elastic MapReduce (EMR) cluster on AWS☆74Oct 1, 2025Updated 5 months ago
- Apache Spark 3 - Spark Programming in Python for Beginners☆513Jul 25, 2024Updated last year
- ☆203Apr 25, 2023Updated 2 years ago
- Spark with Scala example projects☆34Apr 17, 2019Updated 6 years ago