PacktPublishing / Essential-PySpark-for-Scalable-Data-Analytics
Essential PySpark for Scalable Data Analytics, published by Packt
☆43Updated 2 years ago
Alternatives and similar repositories for Essential-PySpark-for-Scalable-Data-Analytics:
Users that are interested in Essential-PySpark-for-Scalable-Data-Analytics are comparing it to the libraries listed below
- Practical Machine Learning on Databricks, published by packt☆17Updated 2 months ago
- Source Code for 'Applied Data Science Using PySpark' by Ramcharan Kakarla, Sundar Krishnan, and Sridhar Alla☆45Updated 3 years ago
- Machine Learning Engineering on AWS, published by Packt☆67Updated last year
- Data Engineering with Spark and Delta Lake☆96Updated 2 years ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 5 years ago
- ☆87Updated 2 years ago
- ☆27Updated 2 years ago
- ☆27Updated last year
- ☆84Updated 2 years ago
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆13Updated 2 years ago
- ☆30Updated 3 months ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 7 months ago
- Data Engineering with AWS Cookbook, published by Packt☆18Updated 4 months ago
- Azure Data Engineering Cookbook 2nd-edition, published by Packt☆31Updated last year
- Databricks ML in Action, Published by Packt☆28Updated 10 months ago
- Simplifying Data Engineering and Analytics with Delta, published by Packt☆21Updated last year
- Snowflake Cookbook, published by Packt☆79Updated 2 years ago
- Apache Airflow Best Practices, published by Packt☆40Updated 4 months ago
- Data engineering with dbt, published by Packt☆76Updated last year
- Spark Databricks Notebooks☆14Updated 4 years ago
- ☆33Updated last year
- Final Project of the MLOps Zoomcamp hosted by DataTalksClub.☆26Updated 2 years ago
- Data Engineering with Google Cloud Platform, published by Packt☆114Updated last year
- This is a repo for building out Github Actions and Tricks☆44Updated 2 months ago
- Data Engineering with Databricks Cookbook, published by Packt☆77Updated 9 months ago
- Data Engineering with Google Cloud Platform - Second Edition, published by Packt☆33Updated 10 months ago
- Data Engineering with AWS, 2nd edition - Published by Packt☆136Updated last year
- Azure Databricks Cookbook, Published by Packt☆59Updated last year
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆30Updated 11 months ago
- Cleaning Data for Effective Data Science, published by Packt☆97Updated 2 years ago