AISCIENCES / course-master-big-data-with-pyspark-and-awsView external linksLinks
Master Big Data With PySpark and AWS
☆132Jun 27, 2023Updated 2 years ago
Alternatives and similar repositories for course-master-big-data-with-pyspark-and-aws
Users that are interested in course-master-big-data-with-pyspark-and-aws are comparing it to the libraries listed below
Sorting:
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆17Dec 18, 2018Updated 7 years ago
- Materials for the next course☆25Feb 3, 2023Updated 3 years ago
- Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark☆11May 22, 2018Updated 7 years ago
- Repository related to Spark SQL and Pyspark using Python3☆42Jun 12, 2022Updated 3 years ago
- ☆18Nov 9, 2025Updated 3 months ago
- ☆20Aug 23, 2020Updated 5 years ago
- Collection of Databricks and Jupyter Notebooks☆22Feb 9, 2026Updated last week
- ☆24Jan 7, 2021Updated 5 years ago
- Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.☆21Jan 30, 2019Updated 7 years ago
- Section 3 of the Django + Angular + Ionic Course☆23Jul 22, 2018Updated 7 years ago
- ☆20Aug 26, 2023Updated 2 years ago
- ☆23Jun 3, 2021Updated 4 years ago
- ☆25Apr 18, 2021Updated 4 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆28Jun 13, 2022Updated 3 years ago
- ETL jobs for Firefox Telemetry☆29Nov 7, 2025Updated 3 months ago
- ☆10Jun 21, 2021Updated 4 years ago
- Learn various Algorithms of Machine Learning like SVC, Decision Tree , Random Forest , Logistic Regression, Linear Regression and much Mo…☆11Jul 31, 2019Updated 6 years ago
- Unit testing using databricks connect☆32Nov 3, 2021Updated 4 years ago
- SAM application for creating Billing Conductor custom line items to distribute SP/RI benefits purchased outside of billing groups☆17May 22, 2024Updated last year
- GestureX is an OpenCV-based hand motion sensing system for intuitive, efficient user control.This project aims to investigate the potenti…☆16Jun 29, 2024Updated last year
- Natural Language Processing☆11Jun 23, 2021Updated 4 years ago
- Learning PySpark video series☆11Mar 5, 2018Updated 7 years ago
- 哔哩哔哩-API收集整理【不断更新中....】☆10Apr 25, 2025Updated 9 months ago
- ☆11Mar 27, 2024Updated last year
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆202Dec 31, 2025Updated last month
- PySpark Cheatsheet☆36Jan 18, 2023Updated 3 years ago
- ☆34Aug 6, 2020Updated 5 years ago
- Python for Text Classification with Machine Learning in Python 3.6.☆36Sep 27, 2018Updated 7 years ago
- Serverless enables you to focus on code, not infrastructure. Deploy a Docker Container to Cloud Run using this series. Cloud Run is a ser…☆43Feb 15, 2023Updated 3 years ago
- Tutorial about discovering and exploring hidden web APIs☆10Mar 13, 2019Updated 6 years ago
- Python package for parsing log lines in the logfmt style.☆20Nov 9, 2018Updated 7 years ago
- Sample AutoML notebooks evolving towards MLOps☆11Feb 15, 2022Updated 4 years ago
- My applied big data analytic project with pyspark.☆10Sep 21, 2022Updated 3 years ago
- ☆12Jan 25, 2018Updated 8 years ago
- Anaconda plugin for StarCluster☆21Aug 14, 2024Updated last year
- ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…☆11Mar 9, 2022Updated 3 years ago
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- ☆11Apr 27, 2020Updated 5 years ago
- ☆10Aug 12, 2024Updated last year