ShopRunner / stork
Make your libraries magically appear in Databricks.
☆47Updated last year
Alternatives and similar repositories for stork:
Users that are interested in stork are comparing it to the libraries listed below
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆72Updated 4 years ago
- Spark package for checking data quality☆221Updated 5 years ago
- type-class based data cleansing library for Apache Spark SQL☆78Updated 5 years ago
- Create HTML profiling reports from Apache Spark DataFrames☆196Updated 5 years ago
- An example PySpark project with pytest☆16Updated 7 years ago
- The iterative broadcast join example code.☆69Updated 7 years ago
- Example unit tests for Apache Spark Python scripts using the py.test framework☆84Updated 9 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- These are some code examples☆55Updated 5 years ago
- Google Spreadsheets datasource for SparkSQL and DataFrames☆57Updated last year
- Examples for High Performance Spark☆15Updated 6 months ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆109Updated 7 years ago
- Magic to help Spark pipelines upgrade☆34Updated 7 months ago
- File compaction tool that runs on top of the Spark framework.☆59Updated 6 years ago
- Real-world Spark pipelines examples