Stefen-Taime / stream-ingestion-redpanda-minio
In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO, and Apache Spark.
☆10Updated last year
Alternatives and similar repositories for stream-ingestion-redpanda-minio:
Users that are interested in stream-ingestion-redpanda-minio are comparing it to the libraries listed below
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆21Updated 2 years ago
- build dw with dbt☆33Updated 2 months ago
- End to end data engineering project☆53Updated 2 years ago
- Delta-Lake, ETL, Spark, Airflow☆45Updated 2 years ago
- This project is for demonstrating knowledge of Data Engineering tools and concepts and also learning in the process☆45Updated 2 years ago
- Unit testing using databricks connect☆30Updated 3 years ago
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆95Updated 5 months ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆30Updated 10 months ago
- Data engineering with dbt, published by Packt☆66Updated 10 months ago
- ☆28Updated last year
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆18Updated 4 months ago
- Git Repo for EDW Best Practice Assets on the Lakehouse☆15Updated last year
- ☆34Updated last year
- Template for Data Engineering and Data Pipeline projects☆106Updated 2 years ago
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆13Updated last year
- Building a Modern Data Lake with Minio, Spark, Airflow via Docker.☆16Updated 8 months ago
- ☆86Updated 2 years ago
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆45Updated last year
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆24Updated last year
- Demonstration of using Files in Repos with Databricks Delta Live Tables☆30Updated 6 months ago
- Ravi Azure ADB ADF Repository☆65Updated last month
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆39Updated last year
- Code for dbt tutorial☆149Updated 7 months ago
- Pipeline that extracts data from the Spotify API to build a more detailed version of Spotify Wrapped☆32Updated last year
- Examples surrounding Databricks.☆57Updated 6 months ago
- Data pipeline that scrapes Rust cheater Steam profiles☆51Updated 2 years ago
- A course by DataTalks Club that covers Spark, Kafka, Docker, Airflow, Terraform, DBT, Big Query etc☆14Updated 2 years ago
- Delta Lake examples☆214Updated 3 months ago