To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a music streaming platform, let’s delve into the detailed workflow and benefits of each component.
☆46Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for Iceberg-Dbt-Trino-Hive-modern-open-source-data-stack
Users that are interested in Iceberg-Dbt-Trino-Hive-modern-open-source-data-stack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆11Jun 27, 2023Updated 3 years ago
- On-premises ELT Pipeline☆32Jul 10, 2025Updated 11 months ago
- Building a Data Pipeline with an Open Source Stack☆59Jun 27, 2025Updated last year
- ☆39Apr 25, 2024Updated 2 years ago
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆41Dec 15, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A DataOps framework for building a lakehouse.☆57Updated this week
- An LLM-powered chatbot with the added context of the dbt knowledge base.☆39Dec 4, 2024Updated last year
- A tampermonkey / greasemonkey tool to download Scridb.com content☆14Mar 30, 2022Updated 4 years ago
- Tooling to build a custom Confluent Platform Kafka Connect container with additional connectors from Confluent Hub.☆15Oct 26, 2020Updated 5 years ago
- Asynchronous file handers for Python's logging☆15Jul 22, 2017Updated 8 years ago
- Data Pipeline with Dagster, dlt, and dbt using UV Python☆25Feb 15, 2025Updated last year
- Playbook to provision a Confluent Cluster☆10Oct 22, 2017Updated 8 years ago
- Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,…☆29May 19, 2025Updated last year
- Source Code for "Practical OpenTelemetry: Adopting Open Observability Standards Across Your Organization" by Daniel Gomez Blanco☆22Mar 6, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Glue VSCode devcontainer setup☆14Jan 31, 2023Updated 3 years ago
- ☆21Nov 21, 2023Updated 2 years ago
- ☆10Jul 21, 2022Updated 3 years ago
- Python interface to arules for association rule mining☆11Oct 10, 2023Updated 2 years ago
- Python tool to help export Azure DevOps WIKI into a single PDF☆10May 10, 2020Updated 6 years ago
- Metabase Teradata Driver shipped as 3rd party plugin☆14May 28, 2026Updated last month
- Terraform AWS free tier, EC2/ECR/RDS/EFS/DynamoDB/Lambda/S3. Docker running on EC2, Traefik reverse proxy, Lets Encrypt, dynamic DNS, Zer…☆37Jun 19, 2026Updated last week
- Automated basic infrastructure to intall OKD4 on free ESXi☆13Aug 8, 2020Updated 5 years ago
- repo do Diego☆10Nov 7, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆15Jun 26, 2023Updated 3 years ago
- Objects and Animals detection with Wifi camera and Yolo☆19Apr 28, 2024Updated 2 years ago
- Ansible for Kubernetes by Examples by Luca Berton☆12Dec 25, 2023Updated 2 years ago
- Example Code to Supplement the Label Studio Blog☆33Jan 6, 2026Updated 5 months ago
- A scribd-downloader that actually works☆25Aug 17, 2017Updated 8 years ago
- This repo demonstrate a comprehensive modern data stack using popular open-source tools.☆37Sep 11, 2023Updated 2 years ago
- ☆15Oct 10, 2025Updated 8 months ago
- ☆14Jul 18, 2024Updated last year
- A process manager written in C++ and Rust.☆13Oct 26, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Practical Machine Learning Library for Arduino and Other Microcontrollers☆22Sep 13, 2020Updated 5 years ago
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆69Sep 23, 2023Updated 2 years ago
- Trino On K8S Via Helm & Metastore Workshop Querying Delta Tables☆12Jan 27, 2025Updated last year
- ☆11Nov 21, 2023Updated 2 years ago
- Proyecto de juguete para mostrar cómo realizar el setup de un proyecto de data science☆11Nov 24, 2022Updated 3 years ago
- ☆11Jan 17, 2024Updated 2 years ago
- Intelligent Document Processing with AWS AI/ML, published by Packt☆12Apr 22, 2026Updated 2 months ago