lewagon / data-engineering-setupLinks
☆15Updated last month
Alternatives and similar repositories for data-engineering-setup
Users that are interested in data-engineering-setup are comparing it to the libraries listed below
Sorting:
- ☆24Updated 2 years ago
- Curated templates for data analysis / science☆44Updated 2 years ago
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆208Updated last month
- My notes of the Data Engineering Zoomcamp by DataTalksClub☆38Updated 2 years ago
- This repository will contain all of the resources for the Mage component of the Data Engineering Zoomcamp: https://github.com/DataTalksCl…☆99Updated 10 months ago
- In this repository we store all materials for dlt workshops, courses, etc.☆193Updated this week
- ☆182Updated 4 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆265Updated 11 months ago
- This is project documentation templates derived from CRISP-DM to be used for Data Engineering projects.☆53Updated 3 years ago
- Slow & local data allows you to move fast and deliver business value for the 99.9% of the data challenges.☆246Updated 2 months ago
- Project for "Data pipeline design patterns" blog.☆45Updated 10 months ago
- Code for the Data Engineering Zoomcamp☆47Updated 2 years ago
- Repo for saving cheat sheets☆56Updated last year
- Polars Cookbook, Published by Packt☆330Updated this week
- ☆132Updated 11 months ago
- Django-based course management platform for Zoomcamps☆67Updated last week
- ☆145Updated last year
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆30Updated 2 years ago
- Code for dbt tutorial☆156Updated 3 weeks ago
- ⚙️ Airflow data pipeline with Terraform, GCP BigQuery, dbt, Soda and Looker Studio.☆21Updated last year
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Updated 2 years ago
- I will attempt to create my own spotify wrapped by collecting data from the spotify API, perform transformations and create informative d…☆74Updated 2 years ago
- Introduction to performing Machine Learning on Snowflake☆124Updated 9 months ago
- ☆30Updated 8 months ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Updated 9 months ago
- This repository provides various demos/examples of using Snowpark for Python.☆276Updated last year
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆142Updated 11 months ago
- Deploy a complete data stack in just a couple of minutes.☆14Updated last year
- Sample project to demonstrate data engineering best practices☆194Updated last year
- A Python package extending pandas with helper functions for simpler exploratory data analysis and data wrangling.☆10Updated 4 months ago