team-data-science / learning-apache-sparkLinks
Repository for Apache Spark course at Team Data Science
☆16Updated 4 years ago
Alternatives and similar repositories for learning-apache-spark
Users that are interested in learning-apache-spark are comparing it to the libraries listed below
Sorting:
- ☆88Updated 2 years ago
- Data Engineering on GCP☆38Updated 2 years ago
- Data Engineering with Google Cloud Platform, published by Packt☆118Updated last year
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆104Updated 4 years ago
- ☆86Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆56Updated 4 years ago
- Apache Spark 3 - Structured Streaming Course Material☆122Updated 2 years ago
- A Series of Notebooks on how to start with Kafka and Python☆152Updated 6 months ago
- Snowflake Data Engineering in Action☆31Updated 10 months ago
- PySpark Cheatsheet☆36Updated 2 years ago
- Deploy Flask Machine Learning Application on Azure App Services☆112Updated 7 months ago
- ☆187Updated 4 years ago
- Mastering Big Data Analytics with PySpark, Published by Packt☆161Updated last year
- Building ETL Pipelines with Python☆159Updated last year
- Data Engineering with AWS, 2nd edition - Published by Packt☆150Updated last year
- All Data Engineering notebooks from Datacamp course☆115Updated 5 years ago
- Code Repository for AWS Certified Big Data Specialty 2019 - In Depth and Hands On!, published by Packt☆42Updated last year
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆92Updated 6 years ago
- Processing TfL data for bike usage with Google Cloud Platform.☆45Updated 3 years ago
- Resources for the free AWS Data Engineering course on youtube☆101Updated 4 years ago
- Recohut - Learn data engineering, data science☆99Updated 2 years ago
- ☆116Updated 4 years ago
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆30Updated last year
- ☆139Updated 6 months ago
- Data Engineering with Spark and Delta Lake☆103Updated 2 years ago
- implementing an end-to-end tweets ETL/Analysis pipeline.☆57Updated 2 years ago
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…☆57Updated 2 years ago
- Content related to Mastering Postgresql along with videos.☆19Updated 4 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆42Updated last year
- Apache Airflow Best Practices, published by Packt☆46Updated 10 months ago