tomaztk / Spark-for-data-engineers
Apache Spark for data engineers
☆56Updated 2 years ago
Alternatives and similar repositories for Spark-for-data-engineers:
Users that are interested in Spark-for-data-engineers are comparing it to the libraries listed below
- Azure Databricks - Advent of 2020 Blogposts☆60Updated 2 years ago
- Data engineering with dbt, published by Packt☆77Updated last year
- ☆87Updated 2 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 8 months ago
- This repo will guide you step-by-step method to create star schema dimensional model.☆25Updated 3 years ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆73Updated 10 months ago
- End to end data engineering project☆54Updated 2 years ago
- This repo contains all code and data for WWCode Python DE workshop Aug 18 and 25 2022☆24Updated 2 years ago
- ☆23Updated last year
- A tutorial for the Great Expectations library.☆70Updated 4 years ago
- Code for "Advanced data transformations in SQL" free live workshop☆76Updated 5 months ago
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆28Updated 2 years ago
- Azure Databricks Cookbook, Published by Packt☆59Updated last year
- Course notes for the Astronomer Certification DAG Authoring for Apache Airflow☆52Updated last year
- Sample project to demonstrate data engineering best practices☆185Updated last year
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆30Updated last year
- ☆28Updated last year
- Data Modeling with Snowflake, published by Packt☆65Updated last week
- ☆40Updated 9 months ago
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆98Updated 8 months ago
- Unit testing using databricks connect☆31Updated 3 years ago
- Apache Airflow Best Practices, published by Packt☆40Updated 5 months ago
- With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset wh…☆13Updated 2 years ago
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…☆56Updated 2 years ago
- Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution☆66Updated 4 years ago
- ☆181Updated 4 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆38Updated 9 months ago
- Simplifying Data Engineering and Analytics with Delta, published by Packt☆21Updated last year
- ☆84Updated 2 years ago
- Source code for 'Building a Data Warehouse' by Vincent Rainardi☆30Updated 8 years ago