holdenk / spark-testing-base
Base classes to use when writing tests with Spark
β1,525Updated 2 weeks ago
Alternatives and similar repositories for spark-testing-base:
Users that are interested in spark-testing-base are comparing it to the libraries listed below
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)β438Updated last month
- Essential Spark extensions and helper methods β¨π²β754Updated 3 months ago
- The Internals of Apache Sparkβ1,489Updated 4 months ago
- Qubole Sparklens tool for performance tuning Apache Sparkβ569Updated 7 months ago
- Examples for High Performance Sparkβ506Updated 2 months ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spaβ¦β722Updated last week
- A tool for monitoring and tuning Spark jobs for efficiency.β357Updated 2 years ago
- Expressive types for Spark.β882Updated this week
- Livy is an open source REST interface for interacting with Apache Spark from anywhereβ1,007Updated 2 years ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.β899Updated 2 months ago
- Mirror of Apache Toree (Incubating)β742Updated 2 months ago
- A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.β678Updated 2 years ago
- REST job server for Apache Sparkβ2,835Updated 3 weeks ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Sparkβ1,350Updated last year
- command line options parsing for Scalaβ1,434Updated 9 months ago
- Spark Gotchas. A subjective compilation of the Apache Spark tips and tricksβ363Updated 7 years ago
- The Internals of Spark Structured Streamingβ416Updated 2 years ago
- The missing MatPlotLib for Scala + Sparkβ731Updated 2 years ago
- The Internals of Spark SQLβ459Updated 2 weeks ago
- A Scala kernel for Jupyterβ1,601Updated 2 months ago
- β306Updated 6 years ago
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.β553Updated 3 years ago
- JSON libraryβ1,486Updated this week
- A simplified, lightweight ETL Framework based on Apache Sparkβ585Updated last year
- A free tutorial for Apache Spark.β989Updated 4 years ago
- A Spark plugin for reading and writing Excel filesβ475Updated this week
- Scala examples for learning to use Sparkβ444Updated 4 years ago
- Avro SerDe for Apache Spark structured APIs.β231Updated 6 months ago
- Deploy ΓΌber-JARs. Restart processes. (port of codahale/assembly-sbt)β1,951Updated last week
- hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDEβ291Updated 2 years ago