josephmachado / cost_effective_data_pipelinesLinks
Cost Efficient Data Pipelines with DuckDB
β61Updated 8 months ago
Alternatives and similar repositories for cost_effective_data_pipelines
Users that are interested in cost_effective_data_pipelines are comparing it to the libraries listed below
Sorting:
- Contribute to dlt verified sources π₯β103Updated last month
- A CLI tool to streamline getting started with Apache Airflowβ’ and managing multiple Airflow projectsβ225Updated 9 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.β51Updated 2 years ago
- β40Updated 10 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Supersetβ55Updated 3 months ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projectsβ91Updated 2 years ago
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidenceβ232Updated last month
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish marketβ58Updated 3 years ago
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.β27Updated last year
- Demo Project for Open Source MDSβ170Updated 5 months ago
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.devβ38Updated 8 months ago
- Full stack data engineering tools and infrastructure set-upβ57Updated 4 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, duckdb and Supersetβ46Updated last month
- A modern ELT demo using airbyte, dbt, snowflake and dagsterβ28Updated 3 years ago
- β80Updated last year
- Code for my "Efficient Data Processing in SQL" book.β60Updated last year
- Code for dbt tutorialβ167Updated 4 months ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principleβ¦β124Updated 10 months ago
- csv and flat-file sniffer built in Rust.β45Updated 2 years ago
- Repo for orienting dbt users to the Dagster asset frameworkβ56Updated 3 years ago
- New generation opensource data stackβ76Updated 3 years ago
- Data-aware orchestration with dagster, dbt, and airbyteβ31Updated 3 years ago
- A write-audit-publish implementation on a data lake without the JVMβ45Updated last year
- Dagster University coursesβ121Updated this week
- Step-by-step tutorial on building a Kimball dimensional model with dbtβ163Updated last year
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.β10Updated 2 years ago
- Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!β253Updated 3 weeks ago
- β214Updated last year
- Repo for CDC with debezium blog postβ29Updated last year
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.β126Updated last year