sat28 / zeppelin_notebook_to_script
Converting a zeppelin notebook in single programming language to respective script
☆18Updated 4 years ago
Alternatives and similar repositories for zeppelin_notebook_to_script:
Users that are interested in zeppelin_notebook_to_script are comparing it to the libraries listed below
- Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL☆90Updated last year
- ☆16Updated last year
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆70Updated 4 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Create HTML profiling reports from Apache Spark DataFrames☆195Updated 4 years ago
- type-class based data cleansing library for Apache Spark SQL☆79Updated 5 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 9 years ago
- Examples for High Performance Spark☆15Updated 2 months ago
- Make your libraries magically appear in Databricks.☆47Updated last year
- These are some code examples☆55Updated 5 years ago
- An example PySpark project with pytest☆17Updated 7 years ago
- Magic to help Spark pipelines upgrade☆34Updated 3 months ago
- Fake Pandas / PySpark DataFrame creator☆44Updated 10 months ago
- A simple Spark TDD example☆26Updated 7 years ago
- MLflow samples - deprecated☆22Updated last year
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- The iterative broadcast join example code.☆69Updated 7 years ago
- Installation guide for Apache Spark + Hadoop on Mac/Linux☆59Updated 7 years ago
- Flowchart for debugging Spark applications☆104Updated 3 months ago
- Test suite to document the behavior of Spark☆21Updated 3 years ago
- Cheatsheet for Spark DataFrame☆91Updated 5 years ago
- Spark and Delta Lake Workshop☆22Updated 2 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 6 years ago
- The official repository for the Rock the JVM Spark Optimization 2 course☆38Updated last year
- A pyspark lib to validate data quality☆18Updated 2 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 7 years ago