japila-books / apache-spark-internalsLinks
The Internals of Apache Spark
β1,521Updated 3 months ago
Alternatives and similar repositories for apache-spark-internals
Users that are interested in apache-spark-internals are comparing it to the libraries listed below
Sorting:
- Essential Spark extensions and helper methods β¨π²β764Updated last month
- The Internals of Spark SQLβ477Updated last week
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spaβ¦β793Updated 3 weeks ago
- Qubole Sparklens tool for performance tuning Apache Sparkβ584Updated last year
- Examples for High Performance Sparkβ521Updated last month
- β312Updated 6 years ago
- Base classes to use when writing tests with Sparkβ1,544Updated this week
- The Internals of Spark Structured Streamingβ421Updated 2 years ago
- A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.β676Updated 3 years ago
- Spark Gotchas. A subjective compilation of the Apache Spark tips and tricksβ364Updated 8 years ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Sparkβ1,366Updated 2 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhereβ1,007Updated 3 years ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.β930Updated this week
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)β452Updated 2 months ago
- A free tutorial for Apache Spark.β991Updated 4 years ago
- A curated list of awesome Apache Spark packages and resources.β1,838Updated last year
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.β551Updated 4 years ago
- Scala examples for learning to use Sparkβ445Updated 5 years ago
- REST job server for Apache Sparkβ2,845Updated 3 months ago
- Apache Sparkβ’ and Scala Workshopsβ263Updated last year
- A connector for Spark that allows reading and writing to/from Redis clusterβ944Updated last year
- Mirror of Apache Toree (Incubating)β746Updated last week
- A Spark plugin for reading and writing Excel filesβ514Updated 2 weeks ago
- Diagrams describing Apache Hadoop internals (2.3.0 or later).β430Updated 5 years ago
- A simplified, lightweight ETL Framework based on Apache Sparkβ588Updated last year
- The Internals of Delta Lakeβ186Updated 9 months ago
- β247Updated 6 years ago
- Data Lineage Tracking And Visualization Solutionβ643Updated last week
- This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala languageβ567Updated last year
- Jupyter magics and kernels for working with remote Spark clustersβ1,363Updated last month