Master Big Data With PySpark and AWS
☆132Jun 27, 2023Updated 2 years ago
Alternatives and similar repositories for course-master-big-data-with-pyspark-and-aws
Users that are interested in course-master-big-data-with-pyspark-and-aws are comparing it to the libraries listed below
Sorting:
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆17Dec 18, 2018Updated 7 years ago
- Materials for the next course☆25Feb 3, 2023Updated 3 years ago
- Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark☆11May 22, 2018Updated 7 years ago
- ☆15Jul 31, 2022Updated 3 years ago
- Repository related to Spark SQL and Pyspark using Python3☆42Jun 12, 2022Updated 3 years ago
- ☆18Nov 9, 2025Updated 4 months ago
- Repository for Spark using Python material. It is popularly known as PySpark.☆20Aug 18, 2021Updated 4 years ago
- ☆20Aug 23, 2020Updated 5 years ago
- Collection of Databricks and Jupyter Notebooks☆22Feb 9, 2026Updated last month
- ☆20Aug 26, 2023Updated 2 years ago
- ☆24Jan 7, 2021Updated 5 years ago
- ☆25Jan 30, 2023Updated 3 years ago
- A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0☆26Aug 5, 2021Updated 4 years ago
- ☆25Apr 18, 2021Updated 4 years ago
- ETL jobs for Firefox Telemetry☆29Nov 7, 2025Updated 4 months ago
- ☆10Jun 21, 2021Updated 4 years ago
- Learn various Algorithms of Machine Learning like SVC, Decision Tree , Random Forest , Logistic Regression, Linear Regression and much Mo…☆11Jul 31, 2019Updated 6 years ago
- Unit testing using databricks connect☆32Nov 3, 2021Updated 4 years ago
- Data sets and ML models versioning example from DVC get started☆10Jun 4, 2024Updated last year
- ☆26May 25, 2020Updated 5 years ago
- SAM application for creating Billing Conductor custom line items to distribute SP/RI benefits purchased outside of billing groups☆17May 22, 2024Updated last year
- Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMR☆10Jul 26, 2023Updated 2 years ago
- Hands-On Big Data Analytics with PySpark, Published by Packt☆37Jan 30, 2023Updated 3 years ago
- A part of the course Mobile Application Development☆13Nov 30, 2021Updated 4 years ago
- Learning PySpark video series☆11Mar 5, 2018Updated 8 years ago
- ☆14Sep 14, 2021Updated 4 years ago
- Natural Language Processing☆11Jun 23, 2021Updated 4 years ago
- ☆11Mar 27, 2024Updated last year
- ☆23Jan 31, 2026Updated last month
- GestureX is an OpenCV-based hand motion sensing system for intuitive, efficient user control.This project aims to investigate the potenti…☆16Jun 29, 2024Updated last year
- PredictorFinc is a scalable supervised machine learning model the predicts stock price change through Decision Tree Regressor using data …☆12Sep 5, 2023Updated 2 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆211Dec 31, 2025Updated 2 months ago
- PySpark Cheatsheet☆36Jan 18, 2023Updated 3 years ago
- ☆34Aug 6, 2020Updated 5 years ago
- Data Engineering on GCP☆41Oct 20, 2022Updated 3 years ago
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆34Oct 18, 2020Updated 5 years ago
- Serverless enables you to focus on code, not infrastructure. Deploy a Docker Container to Cloud Run using this series. Cloud Run is a ser…☆43Feb 15, 2023Updated 3 years ago
- ☆42Apr 10, 2024Updated last year
- Sample AutoML notebooks evolving towards MLOps☆11Feb 15, 2022Updated 4 years ago