Examples for High Performance Spark
☆526Feb 24, 2026Updated last week
Alternatives and similar repositories for high-performance-spark-examples
Users that are interested in high-performance-spark-examples are comparing it to the libraries listed below
Sorting:
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆108Feb 1, 2018Updated 8 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆94Apr 24, 2017Updated 8 years ago
- Base classes to use when writing tests with Spark☆1,549Dec 22, 2025Updated 2 months ago
- The Internals of Apache Spark☆1,540Jul 5, 2025Updated 7 months ago
- Scala examples for learning to use Spark☆445Sep 17, 2020Updated 5 years ago
- REST job server for Apache Spark☆2,843Jul 8, 2025Updated 7 months ago
- Notes talking about the design and implementation of Apache Spark☆5,365Apr 2, 2024Updated last year
- ☆31Oct 14, 2019Updated 6 years ago
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆186Feb 7, 2023Updated 3 years ago
- The Internals of Spark Structured Streaming☆422Updated this week
- Spark, Spark Streaming and Spark SQL unit testing strategies☆215Oct 12, 2016Updated 9 years ago
- Learning Apache spark,including code and data .Most part can run local.☆598Nov 4, 2021Updated 4 years ago
- Examples for Fast Data Processing with Spark☆59Sep 10, 2013Updated 12 years ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆816Updated this week
- Examples of Spark 2.0☆212Aug 11, 2021Updated 4 years ago
- Interactive and Reactive Data Science using Scala and Spark.☆3,150May 16, 2023Updated 2 years ago
- Elastic Search on Spark☆112Oct 21, 2014Updated 11 years ago
- Code to accompany Advanced Analytics with Spark from O'Reilly Media☆1,526Sep 25, 2024Updated last year
- Examples for learning spark☆332Nov 9, 2015Updated 10 years ago
- Mirror of Apache Toree (Incubating)☆749Feb 21, 2026Updated last week
- Profiling Spark Applications for Performance Comparison and Diagnosis☆17Nov 11, 2018Updated 7 years ago
- 酷玩 Spark: Spark 源代码解析、Spark 类库等☆3,482May 18, 2022Updated 3 years ago
- Qubole Sparklens tool for performance tuning Apache Spark☆590Jun 26, 2024Updated last year
- Real Time Analytics and Data Pipelines based on Spark Streaming☆531Oct 24, 2019Updated 6 years ago
- Apache Spark and Apache Kafka integration example☆124Dec 21, 2017Updated 8 years ago
- Learning to write Spark examples☆161Aug 20, 2014Updated 11 years ago
- Low level integration of Spark and Kafka☆130Mar 15, 2018Updated 7 years ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,371Aug 22, 2023Updated 2 years ago
- An efficient updatable key-value store for Apache Spark☆254Mar 11, 2017Updated 8 years ago
- A library for time series analysis on Apache Spark☆1,196Oct 13, 2020Updated 5 years ago
- ☆243Jun 14, 2018Updated 7 years ago
- Serverless proxy for Spark cluster☆324Oct 29, 2020Updated 5 years ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,608Updated this week
- Apache Spark jobs such as Principal Coordinate Analysis.☆75Jan 30, 2017Updated 9 years ago
- A curated list of awesome Apache Spark packages and resources.☆1,862Updated this week
- Essential Spark extensions and helper methods ✨😲☆766Sep 14, 2025Updated 5 months ago
- The missing MatPlotLib for Scala + Spark☆731Jan 30, 2022Updated 4 years ago
- A Time Series Library for Apache Spark☆1,022Jul 3, 2020Updated 5 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,037Nov 21, 2022Updated 3 years ago