PacktPublishing / Essential-PySpark-for-Scalable-Data-Analytics
Essential PySpark for Scalable Data Analytics, published by Packt
☆43Updated last year
Alternatives and similar repositories for Essential-PySpark-for-Scalable-Data-Analytics:
Users that are interested in Essential-PySpark-for-Scalable-Data-Analytics are comparing it to the libraries listed below
- ☆87Updated 2 years ago
- ☆28Updated last year
- Data Engineering with Databricks Cookbook, published by Packt☆62Updated 7 months ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 5 months ago
- Simplifying Data Engineering and Analytics with Delta, published by Packt☆21Updated last year
- Source Code for 'Applied Data Science Using PySpark' by Ramcharan Kakarla, Sundar Krishnan, and Sridhar Alla☆45Updated 3 years ago
- Machine Learning Engineering on AWS, published by Packt☆66Updated 10 months ago
- Databricks ML in Action, Published by Packt☆27Updated 8 months ago
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆13Updated 2 years ago
- ☆30Updated last month
- Data Engineering with AWS, 2nd edition - Published by Packt☆126Updated last year
- Practical Machine Learning on Databricks, published by packt☆16Updated this week
- Code for "Advanced data transformations in SQL" free live workshop☆71Updated 3 months ago
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆28Updated 9 months ago
- Snowflake Cookbook, published by Packt☆76Updated last year
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 5 years ago
- Data Engineering with Spark and Delta Lake☆94Updated 2 years ago
- Final Project of the MLOps Zoomcamp hosted by DataTalksClub.☆26Updated 2 years ago
- ☆31Updated last year
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆105Updated 2 years ago
- Azure Data Engineering Cookbook 2nd-edition, published by Packt☆31Updated last year
- This repo will guide you step-by-step method to create star schema dimensional model.☆24Updated 3 years ago
- Building ETL Pipelines with Python☆118Updated 6 months ago
- Data engineering with dbt, published by Packt☆66Updated 10 months ago
- ☆37Updated last year
- Course Material Data Engineering on AWS Course☆28Updated 4 months ago
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆46Updated last year
- Simple ETL pipeline using Python☆25Updated last year
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆75Updated 5 years ago
- Data Engineering with Google Cloud Platform, published by Packt☆113Updated last year