ozkary / data-engineering-mta-turnstile
Data Engineering - Metropolitan Transportation Authority (MTA) Subway Data Analysis
☆28Updated 5 months ago
Alternatives and similar repositories for data-engineering-mta-turnstile:
Users that are interested in data-engineering-mta-turnstile are comparing it to the libraries listed below
- Step by step instructions to create a production-ready data pipeline☆50Updated 4 months ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆74Updated 11 months ago
- Intro to Polars Tutorial☆23Updated 2 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 9 months ago
- Repo for CDC with debezium blog post☆28Updated 7 months ago
- ☆17Updated 9 months ago
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆30Updated last year
- This project leverages GCS, Composer, Dataflow, BigQuery, and Looker on Google Cloud Platform (GCP) to build a robust data engineering so…☆22Updated last year
- ☆44Updated this week
- Code for "Advanced data transformations in SQL" free live workshop☆79Updated this week
- ☆25Updated 3 years ago
- This repo is meant to make it really easy to analyze the interplays between health and social media use.☆43Updated 2 years ago
- Code and materials for Effective Polars book☆81Updated last year
- Course Material Data Engineering on AWS Course☆28Updated 8 months ago
- Cost Efficient Data Pipelines with DuckDB☆52Updated 9 months ago
- Public data and analytics for our open course☆32Updated last year
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆28Updated 2 years ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆11Updated 11 months ago
- A tutorial for the Great Expectations library.☆71Updated 4 years ago
- ☆16Updated last year
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆50Updated 6 months ago
- A Postgres data warehouse for processing synthetic data using IAC principles☆17Updated 2 years ago
- Welcome to the Machine Learning Engineering Repository, a comprehensive collection of resources, code, and insights to guide you through…☆21Updated 2 months ago
- Syllabus for Artificial Intelligence for Product Innovation Master of Engineering: https://ai.meng.duke.edu/degree☆32Updated last year
- Data Engineering with Google Cloud Platform, published by Packt☆117Updated last year
- Full stack data engineering tools and infrastructure set-up☆52Updated 4 years ago
- Essential PySpark for Scalable Data Analytics, published by Packt☆44Updated 2 years ago
- This repo will guide you step-by-step method to create star schema dimensional model.☆25Updated 3 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆27Updated 2 years ago
- The getting started notebook for the DTC Zoomcamp Q&A challenge☆29Updated last year