luatnc87 / open-source-modern-data-stack
This repo demonstrate a comprehensive modern data stack using popular open-source tools.
☆23Updated last year
Related projects ⓘ
Alternatives and complementary repositories for open-source-modern-data-stack
- A map transformer which implements the `Stream Maps` capability from Meltano's tap and target SDK: https://sdk.meltano.com/☆18Updated last week
- This repository is a production dbt pipeline example that model the profitability of an e-commerce business. Data is extracted and loaded…☆21Updated 4 months ago
- Airbyte connectors (sources & destinations) + Airbyte CDK for JavaScript/TypeScript☆108Updated this week
- Open-source, warehouse-first Customer Data Pipeline and Segment-alternative. Collects and routes clickstream data and builds your custome…☆83Updated this week
- Boiling Insights - From raw S3 data to charts in seconds☆12Updated last week
- Lambda function to serverlessly repartition parquet files in S3☆28Updated this week
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆76Updated this week
- Fivetran's Shopify dbt package☆55Updated 3 weeks ago
- Where the Meltano team runs Meltano! Get it???☆25Updated last month
- Real-time events for Postgres☆37Updated last year
- A Singer tap that wraps Airbyte sources allowing them to be consumed by Singer targets☆24Updated 2 months ago
- Best practices for data workflows, integrations with the Modern Data Stack (MDS), Infrastructure as Code (IaC), Cloud Provider Services☆18Updated last week
- Demo of orchestrating Airbyte connections with Prefect☆11Updated 2 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset☆180Updated this week
- Droughty helps keep your workflow dry☆63Updated this week
- Pipeline to scrape data from Linkedin using Airbyte and Airflow☆19Updated 2 years ago
- Singer.io tap for extracting data from BigQuery tables☆16Updated 3 months ago
- ⚡ valmi.io reverse ETL (data activation) is the open source ( OSS ) data activation platform to load data from warehouses into Webhooks a…☆142Updated 4 months ago
- Artifacts for applications deployable by plural☆48Updated last month
- Service for bulk-loading data to databases with automatic schema management (Redshift, Snowflake, BigQuery, ClickHouse, Postgres, MySQL)☆145Updated this week
- Have your first meltano project running within 5 minutes - no setup - no install - no boundaries. All inside GitHub Codespaces. (GitHub a…☆30Updated last year
- Singer.io tap for extracting data from Stripe.☆27Updated 2 months ago
- A guide for leading a data (engineering) team☆60Updated 6 months ago
- ☆29Updated last month
- This repository contains examples of how to use dbt's metric functionality on the jaffle shop dataset☆28Updated this week
- the open-source product analytics tool for the modern data stack☆28Updated 2 years ago
- Open-source, warehouse-first Customer Data Pipeline and Segment-alternative. Collects and routes clickstream data and builds your custome…☆61Updated 3 months ago
- Connect Metabase with Cube.js.☆4Updated 2 years ago
- Using DBT for ID Resolution on RudderStack - an open-source, warehouse-first customer data pipeline and Segment alternative.☆17Updated 6 months ago
- A simple Data Engineering solution for testing or education purposes. You only need to know SQL and Python to understand this project. Da…☆24Updated 2 years ago