osin-vladimir / architect_big_data_solutions_with_spark
code, labs and lectures for the course
☆45Updated last year
Related projects ⓘ
Alternatives and complementary repositories for architect_big_data_solutions_with_spark
- ☆19Updated 6 years ago
- Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computin…☆24Updated last year
- PySpark Cheatsheet☆35Updated last year
- Deep Learning with Apache Spark and Deep Cognition☆58Updated 6 years ago
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆60Updated 6 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆25Updated 2 years ago
- Repository used for Spark Trainings☆53Updated last year
- ETL pipeline using pyspark (Spark - Python)☆108Updated 4 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- ☆37Updated 8 years ago
- MLFlow Spark Summit 2019 Presentation☆67Updated 5 years ago
- My Study guide used to pass the CRT020 Spark Certification exam☆31Updated 4 years ago
- Because its never late to start taking notes and 'public' it...☆60Updated 3 weeks ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- My Git Repo for Csv Data☆19Updated 4 years ago
- My presentation at ODSC India 2018 about Deep Learning with Apache Spark☆27Updated 6 years ago
- PySpark-ETL☆23Updated 4 years ago
- Python Notes on IPython Notebook files.☆37Updated 3 years ago
- ☆148Updated 6 years ago
- PySpark Tutorial for Beginners on Google Colab: Hands-On Guide☆16Updated 4 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- This is repository of my YouTube Course on End to End Apache Spark in AIEngineering YouTube Channel☆188Updated 3 years ago
- Guide for databricks spark certification☆58Updated 3 years ago
- ☆111Updated 4 years ago
- PySpark Code for Hands-on Learners☆114Updated 5 years ago
- Jupyter notebooks for pyspark tutorials given at University☆104Updated 2 months ago
- Fundamentals of Spark with Python (using PySpark), code examples☆331Updated 2 years ago
- Managing machine learning life-cycle with MLflow tutorial☆23Updated last year
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆83Updated 4 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 5 years ago