holdenk / spark-testing-base
Base classes to use when writing tests with Spark
β1,528Updated 3 months ago
Alternatives and similar repositories for spark-testing-base:
Users that are interested in spark-testing-base are comparing it to the libraries listed below
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)β444Updated 3 weeks ago
- Essential Spark extensions and helper methods β¨π²β758Updated 6 months ago
- Examples for High Performance Sparkβ508Updated 6 months ago
- The Internals of Apache Sparkβ1,499Updated 7 months ago
- Spark Gotchas. A subjective compilation of the Apache Spark tips and tricksβ363Updated 7 years ago
- Qubole Sparklens tool for performance tuning Apache Sparkβ575Updated 10 months ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spaβ¦β745Updated 3 weeks ago
- Expressive types for Spark.β884Updated 2 weeks ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhereβ1,007Updated 2 years ago
- A tool for monitoring and tuning Spark jobs for efficiency.β358Updated 2 years ago
- REST job server for Apache Sparkβ2,836Updated this week
- The Internals of Spark Structured Streamingβ420Updated 2 years ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.β909Updated last week
- Mirror of Apache Toree (Incubating)β742Updated 2 months ago
- command line options parsing for Scalaβ1,437Updated last year
- A Spark plugin for reading and writing Excel filesβ492Updated this week
- JSON libraryβ1,486Updated last week
- A free tutorial for Apache Spark.β989Updated 4 years ago
- Scala examples for learning to use Sparkβ444Updated 4 years ago
- A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.β676Updated 2 years ago
- The missing MatPlotLib for Scala + Sparkβ729Updated 3 years ago
- A Scala kernel for Jupyterβ1,615Updated last month
- β309Updated 6 years ago
- Avro SerDe for Apache Spark structured APIs.β234Updated 9 months ago
- Deploy ΓΌber-JARs. Restart processes. (port of codahale/assembly-sbt)β1,953Updated last month
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Sparkβ1,355Updated last year
- Docker build for Apache Sparkβ673Updated 3 years ago
- A simplified, lightweight ETL Framework based on Apache Sparkβ585Updated last year
- Spark, Spark Streaming and Spark SQL unit testing strategiesβ218Updated 8 years ago
- Avro schema generation and serialization / deserialization for Scalaβ722Updated last week