data-catering / insta-infra
Quickstart for any service
☆133Updated this week
Alternatives and similar repositories for insta-infra:
Users that are interested in insta-infra are comparing it to the libraries listed below
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset☆188Updated last week
- ☆105Updated 4 months ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆113Updated 4 months ago
- Code for "Efficient Data Processing in Spark" Course☆252Updated 2 months ago
- A curated list of awesome public DBT projects☆98Updated 11 months ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆57Updated last year
- Dagster Labs' open-source data platform, built with Dagster.☆288Updated this week
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆165Updated this week
- Demo Project for Open Source MDS☆164Updated 10 months ago
- Code for dbt tutorial☆144Updated 6 months ago
- The smallest DuckDB SQL orchestrator on Earth.☆180Updated 2 months ago
- Sample project to demonstrate data engineering best practices☆169Updated 9 months ago
- Turning PySpark Into a Universal DataFrame API☆327Updated this week
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆62Updated 2 months ago
- ☆194Updated last month
- Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!☆156Updated this week
- an ephemeral project repo for the DU Dagster project☆56Updated 2 weeks ago
- A lightweight Python-based tool for extracting and analyzing data column lineage for dbt projects☆117Updated last week
- Collection of dbt Tips and Tricks☆371Updated 2 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆244Updated 4 months ago
- ☆139Updated last week
- Repo for CDC with debezium blog post☆27Updated 2 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆173Updated last week
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆197Updated this week
- ☆71Updated last month
- Data pipeline with dbt, Airflow, Great Expectations☆158Updated 3 years ago
- 🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.☆115Updated 2 months ago
- Quick Guides from Dremio on Several topics☆66Updated last month
- A dbt package for modelling dbt metadata. https://brooklyn-data.github.io/dbt_artifacts☆333Updated last month
- Spark fires is a anti-pattern playground where we deliberately break Spark applications in various ways so you can observe what happens a…☆41Updated 2 weeks ago