osin-vladimir / architect_big_data_solutions_with_sparkLinks
code, labs and lectures for the course
☆47Updated 2 years ago
Alternatives and similar repositories for architect_big_data_solutions_with_spark
Users that are interested in architect_big_data_solutions_with_spark are comparing it to the libraries listed below
Sorting:
- Because its never late to start taking notes and 'public' it...☆59Updated last month
- Deep Learning with Apache Spark and Deep Cognition☆59Updated 7 years ago
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆59Updated 7 years ago
- My Study guide used to pass the CRT020 Spark Certification exam☆34Updated 5 years ago
- Managing machine learning life-cycle with MLflow tutorial☆23Updated 2 years ago
- ☆150Updated 7 years ago
- Repository used for Spark Trainings☆53Updated 2 years ago
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploym…☆61Updated 2 years ago
- Jupyter notebooks for pyspark tutorials given at University☆108Updated 7 months ago
- PySpark Code for Hands-on Learners☆116Updated 5 years ago
- This repository hosts the code/projects/demos/slides for Big Data technologies under Apache Hadoop and Apache Spark umbrella.☆42Updated 2 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 6 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- ☆86Updated 2 years ago
- This is repository of my YouTube Course on End to End Apache Spark in AIEngineering YouTube Channel☆189Updated 4 years ago
- Best practices for engineering ML pipelines.☆35Updated 3 years ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆39Updated 4 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆86Updated 5 years ago
- Course on Udemy by Jose Portilla☆99Updated 7 years ago
- ETL pipeline using pyspark (Spark - Python)☆117Updated 5 years ago
- Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple …☆30Updated 4 years ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆35Updated 5 years ago
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Updated 9 years ago
- Example of orchestrating dependent Databricks jobs using Airflow☆11Updated 5 years ago
- ☆18Updated 7 years ago
- ☆18Updated 3 years ago
- ☆87Updated 2 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University☆158Updated 7 months ago
- PySpark Cheatsheet☆36Updated 2 years ago