osin-vladimir / architect_big_data_solutions_with_sparkLinks
code, labs and lectures for the course
☆47Updated 2 years ago
Alternatives and similar repositories for architect_big_data_solutions_with_spark
Users that are interested in architect_big_data_solutions_with_spark are comparing it to the libraries listed below
Sorting:
- Repository used for Spark Trainings☆53Updated 2 years ago
- My Git Repo for Csv Data☆21Updated 4 years ago
- ☆18Updated 7 years ago
- Because its never late to start taking notes and 'public' it...☆59Updated 3 weeks ago
- Guide for databricks spark certification☆58Updated 4 years ago
- My Study guide used to pass the CRT020 Spark Certification exam☆33Updated 5 years ago
- PySpark Cheatsheet☆36Updated 2 years ago
- PySpark Tutorial for Beginners on Google Colab: Hands-On Guide☆16Updated 4 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- ☆37Updated 3 weeks ago
- ☆87Updated 2 years ago
- MLFlow Spark Summit 2019 Presentation☆67Updated 6 years ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆39Updated 4 years ago
- ☆18Updated 3 years ago
- PySpark-ETL☆23Updated 5 years ago
- Projects submitted as part of working through udacity's data engineering nanodegree.☆9Updated 5 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- A way for home buyers to know about factors affecting a state☆48Updated 6 years ago
- Data Engineering with Spark and Delta Lake☆101Updated 2 years ago
- ETL pipeline using pyspark (Spark - Python)☆117Updated 5 years ago
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆59Updated 6 years ago
- This is part of the Artificial Intelligence live course, hosted by Packtpub. In this repository, you can find information to build your e…☆15Updated 6 years ago
- PySpark Code for Hands-on Learners☆116Updated 5 years ago
- Developing a Lambda Architecture pipeline using Apache Kafka, Spark Structured Streaming, Redshift, S3, Python☆23Updated 5 years ago
- Course on Udemy by Jose Portilla☆99Updated 7 years ago
- ☆40Updated 3 years ago
- Workshop for Spark and Databricks☆54Updated 5 years ago
- ☆86Updated 2 years ago
- Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple …☆30Updated 4 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago