Stefen-Taime / stream-ingestion-redpanda-minioLinks
In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO, and Apache Spark.
☆11Updated 2 years ago
Alternatives and similar repositories for stream-ingestion-redpanda-minio
Users that are interested in stream-ingestion-redpanda-minio are comparing it to the libraries listed below
Sorting:
- This project is for demonstrating knowledge of Data Engineering tools and concepts and also learning in the process☆47Updated 2 years ago
- Template for Data Engineering and Data Pipeline projects☆114Updated 2 years ago
- Examples surrounding Databricks.☆60Updated last year
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆15Updated 2 years ago
- ☆15Updated 3 years ago
- Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )☆92Updated 7 years ago
- 🧱 A collection of supplementary utilities and helper notebooks to perform admin tasks on Databricks☆56Updated 4 months ago
- Unit testing using databricks connect☆32Updated 4 years ago
- Code samples, etc. for Databricks☆71Updated 5 months ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆41Updated last year
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆270Updated last month
- Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline☆153Updated last year
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆102Updated last month
- Building a Data Pipeline with an Open Source Stack☆54Updated 4 months ago
- build dw with dbt☆47Updated last year
- ☆139Updated 8 months ago
- ☆88Updated 3 years ago
- DBSQL SME Repo contains demos, tutorials, blog code, advanced production helper functions and more!☆74Updated last week
- Sample project to demonstrate data engineering best practices☆198Updated last year
- Data pipeline that scrapes Rust cheater Steam profiles☆54Updated 3 years ago
- Delta-Lake, ETL, Spark, Airflow☆48Updated 3 years ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆78Updated 2 years ago
- Code for dbt tutorial☆161Updated 2 months ago
- dbt adapter for Azure Synapse Dedicated SQL Pools☆75Updated 2 months ago
- A proof of concept to integrate Python and Microsoft Analysis Services☆80Updated 3 years ago
- Data engineering with dbt, published by Packt☆87Updated 2 months ago
- Delta Lake examples☆230Updated last year
- This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.☆119Updated last year
- Databricks Platform - Architecture, Security, Automation and much more!!☆51Updated last week
- End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore,…☆46Updated last year