stream-processing-with-spark / notebooksLinks
Interactive Notebooks that support the book
☆40Updated 5 years ago
Alternatives and similar repositories for notebooks
Users that are interested in notebooks are comparing it to the libraries listed below
Sorting:
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 6 years ago
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆227Updated 2 years ago
- Apache Spark examples exclusively in Java☆103Updated 2 years ago
- Data Engineering with Spark and Delta Lake☆106Updated 3 years ago
- Resource for the book Trino: The Definitive Guide (and formerly Presto: The Definitive Guide)☆231Updated 3 years ago
- My Study guide used to pass the CRT020 Spark Certification exam☆34Updated 6 years ago
- Spark on Kubernetes using Helm☆33Updated 5 years ago
- These are some code examples☆56Updated 6 years ago
- ☆65Updated last year
- Real-world Spark pipelines examples☆83Updated 7 years ago
- spark on kubernetes☆104Updated 2 years ago
- Examples of Spark 3.0☆45Updated 5 years ago
- Mastering Spark for Data Science, published by Packt☆49Updated 3 years ago
- Spark Examples☆127Updated 4 years ago
- ☆63Updated 6 years ago
- Magic to help Spark pipelines upgrade☆34Updated last year
- Spark in Action, 2nd edition - chapter 1 - Introduction☆107Updated 2 years ago
- A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlo…☆87Updated 2 years ago
- Databricks - Apache Spark™ - 2X Certified Developer☆265Updated 5 years ago
- The Internals of Spark on Kubernetes☆72Updated 3 years ago
- Building Big Data Pipelines with Apache Beam, published by Packt☆89Updated 2 years ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Updated 7 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆88Updated 6 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 5 years ago
- An example Apache Beam project.☆111Updated 8 years ago
- AWS Big Data Certification☆25Updated last year
- PySpark data-pipeline testing and CICD☆28Updated 5 years ago
- ☆32Updated 4 months ago
- Ambari stack service for installing and managing Apache Airflow on HDP cluster☆58Updated 7 years ago
- ☆20Updated 6 years ago