runawayhorse001 / learning-apache-sparkLinks
☆17Updated 8 years ago
Alternatives and similar repositories for learning-apache-spark
Users that are interested in learning-apache-spark are comparing it to the libraries listed below
Sorting:
- [DEPRECATED] Demo repository implementing an end-to-end MLOps workflow on Databricks. Project derived from dbx basic python template☆114Updated 2 years ago
- Just starting your DE journey or along the way already?. I will be sharing a short list of DATA-ENGINEERING-CENTRED books that covers the…☆34Updated 3 years ago
- The data science project used in my Datacamp course Unit Testing for Data Science in Python☆143Updated 2 years ago
- Hey this is the repo that has all the queries and data for my video game training series!☆153Updated 3 years ago
- Delta Lake examples☆227Updated 11 months ago
- Guide for databricks spark certification☆58Updated 4 years ago
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆221Updated 2 years ago
- ☆120Updated last month
- ☆88Updated 2 years ago
- The official repository for the Rock the JVM Spark Optimization with Scala course☆58Updated last year
- (project & tutorial) dag pipeline tests + ci/cd setup☆88Updated 4 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆104Updated 4 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆169Updated last year
- PySpark Cheatsheet☆36Updated 2 years ago
- A tutorial for the Great Expectations library.☆71Updated 4 years ago
- This is repository of my YouTube Course on End to End Apache Spark in AIEngineering YouTube Channel☆188Updated 4 years ago
- Example repo to kickstart integration with mlflow pipelines.☆77Updated 2 years ago
- how to unit test your PySpark code☆29Updated 4 years ago
- This repo contains commands that data engineers use in day to day work.☆61Updated 2 years ago
- Projects done in the Data Engineering Nanodegree by Udacity.com☆273Updated 6 years ago
- Spark style guide☆263Updated 11 months ago
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆478Updated 11 months ago
- Code repository for the "PySpark in Action" book☆206Updated 3 months ago
- ☆143Updated 2 years ago
- I will attempt to create my own spotify wrapped by collecting data from the spotify API, perform transformations and create informative d…☆74Updated 2 years ago
- Udacity Data Engineering Nano Degree (DEND)☆185Updated 5 years ago
- Delta Lake helper methods in PySpark☆325Updated last year
- ☆88Updated 3 years ago
- Data engineering interviews Q&A for data community by data community☆65Updated 5 years ago
- Jupyter notebooks for pyspark tutorials given at University☆110Updated 2 months ago