holdenk / spark-testing-baseLinks
Base classes to use when writing tests with Spark
β1,534Updated 4 months ago
Alternatives and similar repositories for spark-testing-base
Users that are interested in spark-testing-base are comparing it to the libraries listed below
Sorting:
- Essential Spark extensions and helper methods β¨π²β760Updated 7 months ago
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)β447Updated last week
- Examples for High Performance Sparkβ509Updated 6 months ago
- The Internals of Apache Sparkβ1,501Updated 8 months ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spaβ¦β755Updated 3 weeks ago
- Qubole Sparklens tool for performance tuning Apache Sparkβ577Updated 11 months ago
- Spark Gotchas. A subjective compilation of the Apache Spark tips and tricksβ363Updated 7 years ago
- A tool for monitoring and tuning Spark jobs for efficiency.β358Updated 2 years ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.β913Updated last week
- Expressive types for Spark.β885Updated last week
- Mirror of Apache Toree (Incubating)β743Updated 3 weeks ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhereβ1,007Updated 2 years ago
- Apache Spark to Apache Cassandra connectorβ1,948Updated last month
- A free tutorial for Apache Spark.β989Updated 4 years ago
- A Spark plugin for reading and writing Excel filesβ495Updated last week
- REST job server for Apache Sparkβ2,837Updated last month
- The Internals of Spark Structured Streamingβ420Updated 2 years ago
- command line options parsing for Scalaβ1,438Updated 2 weeks ago
- β309Updated 6 years ago
- Avro SerDe for Apache Spark structured APIs.β235Updated 10 months ago
- JSON libraryβ1,487Updated this week
- A Scala kernel for Jupyterβ1,615Updated last week
- Deploy ΓΌber-JARs. Restart processes. (port of codahale/assembly-sbt)β1,955Updated 2 months ago
- Data Lineage Tracking And Visualization Solutionβ626Updated this week
- An Open Source unit test framework for Hive queries based on JUnit 4 and 5β257Updated 4 months ago
- A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.β676Updated 2 years ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Sparkβ1,357Updated last year
- A simplified, lightweight ETL Framework based on Apache Sparkβ585Updated last year
- β247Updated 5 years ago
- MLeap: Deploy ML Pipelines to Productionβ1,516Updated 6 months ago