ShopRunner / stork
Make your libraries magically appear in Databricks.
☆47Updated last year
Alternatives and similar repositories for stork:
Users that are interested in stork are comparing it to the libraries listed below
- type-class based data cleansing library for Apache Spark SQL☆78Updated 5 years ago
- Examples for High Performance Spark☆15Updated 4 months ago
- The iterative broadcast join example code.☆69Updated 7 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆72Updated 3 years ago
- Create HTML profiling reports from Apache Spark DataFrames☆195Updated 5 years ago
- Real-world Spark pipelines examples☆83Updated 7 years ago
- A tutorial on Apache Spark Unit Testing☆37Updated 9 years ago
- Databricks Migration Tools☆43Updated 3 years ago
- An example PySpark project with pytest☆17Updated 7 years ago
- Spark package for checking data quality☆221Updated 5 years ago
- Google BigQuery support for Spark, SQL, and DataFrames☆155Updated 5 years ago
- Conversion utility from Zeppelin notes to Jupyter notebooks.☆44Updated 5 years ago
- Python API for Deequ☆41Updated 4 years ago
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70Updated last year
- ☆198Updated last year
- Apache (Py)Spark type annotations (stub files).☆116Updated 2 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Example unit tests for Apache Spark Python scripts using the py.test framework☆84Updated 8 years ago
- A simple Scala Based Project Template for Apache Spark☆22Updated 8 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆109Updated 7 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 9 years ago
- ☆37Updated 8 years ago
- Kinesis Connector for Structured Streaming☆136Updated 8 months ago
- Repository of sample Databricks notebooks☆256Updated 11 months ago
- A stack overflow for Apache Spark☆72Updated 7 years ago
- Spark connector for SFTP☆100Updated last year
- Oozie Workflow to Airflow DAGs migration tool☆88Updated last week
- Structured Streaming Machine Learning example with Spark 2.0☆92Updated 7 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago