mebaysan / Modern-Data-ArchitectureLinks
Introduction to Modern Data Analytics Tools Docker, Airbyte, DBT, Apache Superset with Brazilian Ecommerce Data & Applying RFM in DBT
☆12Updated 2 years ago
Alternatives and similar repositories for Modern-Data-Architecture
Users that are interested in Modern-Data-Architecture are comparing it to the libraries listed below
Sorting:
- ☆14Updated 5 months ago
- This repo contains datasets used in trainings.☆53Updated 5 months ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆62Updated last year
- This repo is for generating data from existing dataset to a file or producing dataset rows as message to kafka in a streaming manner.☆22Updated last year
- Bu repo udemy spark kursları için oluşturulmuştur.☆37Updated 2 years ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆71Updated last year
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆219Updated last month
- ☆11Updated last year
- Delta Lake Documentation☆49Updated last year
- T-SQL programming examples from basic to advanced. This source code belongs to my Advanced Level T-SQL Programming book I published in 20…☆57Updated 5 years ago
- End-to-end ELT data engineering project☆22Updated 2 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- Hands-On Kubernetes Webinar Series Materials☆96Updated 2 years ago
- Code for dbt tutorial☆156Updated 3 weeks ago
- Building a Data Pipeline with an Open Source Stack☆55Updated 11 months ago
- Example repo to create end to end tests for data pipeline.☆25Updated last year
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 2 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 10 months ago
- ☆80Updated 8 months ago
- Apache Airflow Best Practices, published by Packt☆42Updated 7 months ago
- preparation guide for aws big data / data analytics – specialty exam☆18Updated 4 years ago
- Dockerizing and Consuming an Apache Livy environment☆12Updated 2 years ago
- Docker environment that spins up MongoDB replica set, Spark, and Jupyter Lab. Example code uses PySpark and the MongoDB Spark Connector.☆40Updated 2 years ago
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆127Updated 2 years ago
- ☆16Updated last year
- ☆1Updated 9 months ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆57Updated 2 years ago
- Resources for video demonstrations and blog posts related to DataOps on AWS☆178Updated 3 years ago
- ☆41Updated 11 months ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆142Updated 11 months ago