ozkary / data-engineering-mta-turnstileLinks
Data Engineering - Metropolitan Transportation Authority (MTA) Subway Data Analysis
☆28Updated 7 months ago
Alternatives and similar repositories for data-engineering-mta-turnstile
Users that are interested in data-engineering-mta-turnstile are comparing it to the libraries listed below
Sorting:
- ☆18Updated 11 months ago
- Code for my "Efficient Data Processing in SQL" book.☆57Updated 11 months ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆12Updated last year
- Step by step instructions to create a production-ready data pipeline☆54Updated 6 months ago
- A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Doc…☆21Updated 7 months ago
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆30Updated 2 years ago
- Repo for CDC with debezium blog post☆28Updated 10 months ago
- ☆16Updated last year
- Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from…☆34Updated 5 months ago
- End-to-end data engineer project☆20Updated last year
- ☆16Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆80Updated last year
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 2 years ago
- Lightweight, open source, locally-hosted Modern Data Stack☆15Updated 3 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆52Updated 8 months ago
- A Postgres data warehouse for processing synthetic data using IAC principles☆18Updated 2 years ago
- This repo is meant to make it really easy to analyze the interplays between health and social media use.☆44Updated 3 years ago
- Template for Data Engineering and Data Pipeline projects☆112Updated 2 years ago
- Official repository of the Manning book - Fight Fraud with Machine Learning - by Ashish Ranjan Jha☆11Updated last month
- This repo will guide you step-by-step method to create star schema dimensional model.☆25Updated 4 years ago
- This is the code repo for the O'Reilly book "Data Science: The Hard Parts"☆15Updated last year
- This project leverages GCS, Composer, Dataflow, BigQuery, and Looker on Google Cloud Platform (GCP) to build a robust data engineering so…☆25Updated last year
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- A demo of the Mito Streamlit Spreadsheet☆18Updated last year
- duckdb-etl-framework☆12Updated 6 months ago
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆38Updated last year
- Cost Efficient Data Pipelines with DuckDB☆54Updated 2 months ago
- Challenge Data Engineer☆25Updated 3 years ago
- API for distributing Data Lake Data☆11Updated 3 months ago
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆30Updated last year