KiranGunturu / lakehouse-formation
☆23Updated 7 months ago
Alternatives and similar repositories for lakehouse-formation:
Users that are interested in lakehouse-formation are comparing it to the libraries listed below
- Sample project to demonstrate data engineering best practices☆186Updated last year
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆121Updated 11 months ago
- Hey this is the repo that has all the queries and data for my video game training series!☆142Updated 2 years ago
- ☆136Updated 2 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆101Updated 4 years ago
- ☆34Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆74Updated 10 months ago
- ☆128Updated 2 months ago
- ☆106Updated 3 years ago
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆98Updated 8 months ago
- Data Engineering examples for Airflow, Prefect; dbt for BigQuery, Redshift, ClickHouse, Postgres, DuckDB; PySpark for Batch processing; K…☆65Updated 2 months ago
- End to end data engineering project☆54Updated 2 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆260Updated 9 months ago
- ☆151Updated 2 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 8 months ago
- Project for "Data pipeline design patterns" blog.☆45Updated 8 months ago
- This is a template you can use for your next data engineering portfolio project.☆176Updated 3 years ago
- This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.☆66Updated 8 months ago
- Introduction to performing Machine Learning on Snowflake☆123Updated 7 months ago
- ☆15Updated last year
- Course notes for the Astronomer Certification DAG Authoring for Apache Airflow☆52Updated last year
- ☆69Updated 3 months ago
- A tutorial for the Great Expectations library.☆71Updated 4 years ago
- Code for "Advanced data transformations in SQL" free live workshop☆78Updated 6 months ago
- My notes of the Data Engineering Zoomcamp by DataTalksClub☆38Updated 2 years ago
- This repo contains commands that data engineers use in day to day work.☆60Updated 2 years ago
- Apartments Data Pipeline using Airflow and Spark.☆20Updated 3 years ago
- an end-to-end data pipeline extracting music listening habits and producing an insightful dashboard☆17Updated last year
- ☆23Updated 3 months ago
- This repo is meant to make it really easy to analyze the interplays between health and social media use.☆43Updated 2 years ago