mebaysan / Modern-Data-Architecture
Introduction to Modern Data Analytics Tools Docker, Airbyte, DBT, Apache Superset with Brazilian Ecommerce Data & Applying RFM in DBT
☆12Updated 2 years ago
Alternatives and similar repositories for Modern-Data-Architecture:
Users that are interested in Modern-Data-Architecture are comparing it to the libraries listed below
- AWS ETL Pipleine☆25Updated 8 months ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆73Updated last year
- A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation an…☆23Updated last year
- Delta Lake Documentation☆48Updated 7 months ago
- This is a demo streaming project simulating a music streaming service.☆34Updated 5 months ago
- ☆15Updated 11 months ago
- Dockerizing and Consuming an Apache Livy environment☆11Updated 2 years ago
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆124Updated 2 years ago
- ☆73Updated 3 months ago
- Sample configuration to deploy a modern data platform.☆87Updated 3 years ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆48Updated 2 months ago
- This repo contains datasets used in trainings.☆51Updated this week
- A simple and easy to use Data Quality (DQ) tool built with Python.☆49Updated last year
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆23Updated 10 months ago
- New generation opensource data stack☆65Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆48Updated 3 years ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆62Updated last year
- Cloned by the `dbt init` task☆60Updated 9 months ago
- The Open-Source Enterprise Data Platform in a single Portal☆230Updated this week
- ☆34Updated 2 years ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆122Updated 6 months ago
- build dw with dbt☆35Updated 3 months ago
- Bu repo udemy spark kursları için oluşturulmuştur.☆37Updated 2 years ago
- ☆14Updated last week
- preparation guide for aws big data / data analytics – specialty exam☆18Updated 4 years ago
- This repo is for generating data from existing dataset to a file or producing dataset rows as message to kafka in a streaming manner.☆22Updated 7 months ago
- Delta Lake examples☆214Updated 3 months ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆43Updated 2 years ago
- 2024 Yılı Temmuz ve Ağustos aylarında gerçekleştirilen Bootcamp kodları☆13Updated 6 months ago
- Apache Kafka Eğitiminin Kodlarıdır. | https://youtu.be/ZphPT3r6fnU☆41Updated 4 years ago