bartosz25 / spark-scala-playgroundView external linksLinks
Sample processing code using Spark 2.1+ and Scala
☆51Jun 28, 2020Updated 5 years ago
Alternatives and similar repositories for spark-scala-playground
Users that are interested in spark-scala-playground are comparing it to the libraries listed below
Sorting:
- Examples of Spark 3.0☆45Nov 11, 2020Updated 5 years ago
- Code snippets used in demos recorded for the blog.☆37Jan 17, 2026Updated last month
- Spark Custome Stream Source and Sink☆12Jan 19, 2019Updated 7 years ago
- ☆11Apr 15, 2019Updated 6 years ago
- Gives TreeLog a GUI, the ScalaJS ReactTreeView☆10Jun 23, 2016Updated 9 years ago
- UDF, GenericUDF, UDTF, UDAF☆12Jul 1, 2022Updated 3 years ago
- Data Exploration Using Spark 2.0☆14Apr 17, 2018Updated 7 years ago
- Scala API for Apache Spark SQL high-order functions☆14Aug 4, 2023Updated 2 years ago
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆29Nov 4, 2024Updated last year
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Dec 31, 2024Updated last year
- calcite-arrow-sample(WIP)☆13Dec 17, 2017Updated 8 years ago
- Code that was used as an example during the Data+AI Summit 2020☆15Mar 8, 2021Updated 4 years ago
- A sample custom Spark Structured Streaming Datasource with Websockets☆14May 14, 2020Updated 5 years ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆94May 9, 2025Updated 9 months ago
- ## Auto-archived due to inactivity. ## Simple JVM Profiler Using StatsD and Other Metrics Backends☆15Oct 3, 2023Updated 2 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- ☆13Dec 12, 2020Updated 5 years ago
- Scala implementation of Histogrammar, with optional front-ends and back-ends as separate Maven projects.☆15Dec 29, 2023Updated 2 years ago
- Cloudera Manager CM API Python end-to-end example☆15Aug 29, 2019Updated 6 years ago
- ☆40May 16, 2023Updated 2 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Jan 22, 2024Updated 2 years ago
- Essential Spark extensions and helper methods ✨😲☆765Sep 14, 2025Updated 5 months ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆811Feb 5, 2026Updated last week
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 4 years ago
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆18Feb 10, 2026Updated last week
- Mastering Apache Spark 2x, published by Packt☆17Jan 30, 2023Updated 3 years ago
- Library for organizing batch processing pipelines in Apache Spark☆42Jan 4, 2017Updated 9 years ago
- ☆45Apr 27, 2020Updated 5 years ago
- Qubole Sparklens tool for performance tuning Apache Spark☆589Jun 26, 2024Updated last year
- Route requests based on deployed Funcs☆21Apr 20, 2017Updated 8 years ago
- Data Sketches for Apache Spark☆22Dec 22, 2022Updated 3 years ago
- Edit code in IntelliJ, eval/run in Zeppelin notebook☆18Mar 17, 2019Updated 6 years ago
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆454Feb 8, 2026Updated last week
- Utilities for writing tests that use Apache Spark.☆24Dec 29, 2018Updated 7 years ago
- 🚚 ETL for Spark and Airflow☆25Mar 19, 2018Updated 7 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23May 7, 2018Updated 7 years ago
- Sample Spark Code☆91Sep 19, 2018Updated 7 years ago
- A tool to validate data, built around Apache Spark.☆100Feb 9, 2026Updated last week
- ☆23Oct 8, 2018Updated 7 years ago