InfuseAI / taxi_rides_ny_duckdb
PipeRider dbt workshop for DataTalksClub DE Zoomcamp
☆16Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for taxi_rides_ny_duckdb
- Full stack data engineering tools and infrastructure set-up☆44Updated 3 years ago
- Code for my "Efficient Data Processing in SQL" book.☆50Updated 3 months ago
- Cost Efficient Data Pipelines with DuckDB☆46Updated 3 months ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆71Updated last year
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆28Updated 8 months ago
- Code snippets for Data Engineering Design Patterns book☆40Updated this week
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆55Updated last year
- Demo of Streamlit application with Databricks SQL Endpoint☆33Updated 2 years ago
- build dw with dbt☆29Updated last month
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆37Updated 2 weeks ago
- Data-aware orchestration with dagster, dbt, and airbyte☆30Updated last year
- Delta Lake Documentation☆46Updated 5 months ago
- Code for dbt tutorial☆143Updated 5 months ago
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆21Updated 2 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆24Updated 2 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆72Updated last year
- A write-audit-publish implementation on a data lake without the JVM☆41Updated 3 months ago
- Delta Lake examples☆208Updated last month
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆113Updated 4 months ago
- ☆66Updated last month
- Cloned by the `dbt init` task☆59Updated 6 months ago
- New generation opensource data stack☆61Updated 2 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆48Updated last year
- ☆15Updated 6 months ago
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆22Updated 7 months ago
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb☆18Updated last year
- ☆83Updated last year
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆111Updated 7 months ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆36Updated 4 months ago
- ☆32Updated 6 months ago