runawayhorse001 / learning-apache-sparkLinks
☆18Updated 8 years ago
Alternatives and similar repositories for learning-apache-spark
Users that are interested in learning-apache-spark are comparing it to the libraries listed below
Sorting:
- Hey this is the repo that has all the queries and data for my video game training series!☆154Updated 3 years ago
- [DEPRECATED] Demo repository implementing an end-to-end MLOps workflow on Databricks. Project derived from dbx basic python template☆114Updated 2 years ago
- A list of awesome data podcasts☆382Updated 2 years ago
- The data science project used in my Datacamp course Unit Testing for Data Science in Python☆144Updated 2 years ago
- A Data Engineering & Machine Learning Knowledge Hub☆1,138Updated last year
- This is repository of my YouTube Course on End to End Apache Spark in AIEngineering YouTube Channel☆189Updated 4 years ago
- Guide for databricks spark certification☆58Updated 4 years ago
- ☆120Updated 3 months ago
- Just starting your DE journey or along the way already?. I will be sharing a short list of DATA-ENGINEERING-CENTRED books that covers the…☆34Updated 3 years ago
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆223Updated 2 years ago
- Final Project of the MLOps Zoomcamp hosted by DataTalksClub.☆26Updated 2 years ago
- Jupyter notebooks for pyspark tutorials given at University☆110Updated 3 months ago
- ☆360Updated 2 years ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆90Updated 3 years ago
- Udacity Data Engineering Nanodegree Program☆52Updated 4 years ago
- 🧱 A collection of supplementary utilities and helper notebooks to perform admin tasks on Databricks☆56Updated 4 months ago
- Code from the book Fighting Churn With Data☆300Updated 3 months ago
- ☆36Updated 2 years ago
- Awesome list of resources for analytics engineers☆29Updated 3 years ago
- Because its never late to start taking notes and 'public' it...☆61Updated 5 months ago
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆479Updated last year
- ML Zoomcamp fall 2021 homework and stuff☆66Updated 3 years ago
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for …☆139Updated 5 years ago
- ☆191Updated 4 years ago
- Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution☆67Updated 5 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆104Updated 4 years ago
- Projects done in the Data Engineering Nanodegree by Udacity.com☆273Updated 6 years ago
- Learning paths for data roles☆143Updated last month
- ☆90Updated 2 years ago
- LearningApacheSpark☆248Updated last year