To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a music streaming platform, let’s delve into the detailed workflow and benefits of each component.
☆45Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for Iceberg-Dbt-Trino-Hive-modern-open-source-data-stack
Users that are interested in Iceberg-Dbt-Trino-Hive-modern-open-source-data-stack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆11Jun 27, 2023Updated 2 years ago
- On-premises ELT Pipeline☆31Jul 10, 2025Updated 9 months ago
- ☆13Oct 4, 2023Updated 2 years ago
- DevOpsDays Taipei 2025 Observability Bootcamp - Observability Platform 101☆19Jun 9, 2025Updated 10 months ago
- Building a Data Pipeline with an Open Source Stack☆58Jun 27, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆38Apr 25, 2024Updated last year
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆83Mar 9, 2026Updated last month
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆40Dec 15, 2025Updated 4 months ago
- A DataOps framework for building a lakehouse.☆56Updated this week
- An LLM-powered chatbot with the added context of the dbt knowledge base.☆39Dec 4, 2024Updated last year
- A tampermonkey / greasemonkey tool to download Scridb.com content☆14Mar 30, 2022Updated 4 years ago
- Build Data Lake using Open Source tools☆126May 27, 2025Updated 10 months ago
- Transporter for integrating OpenLineage with OpenMetadata☆18Sep 10, 2025Updated 7 months ago
- Tooling to build a custom Confluent Platform Kafka Connect container with additional connectors from Confluent Hub.☆15Oct 26, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Asynchronous file handers for Python's logging☆15Jul 22, 2017Updated 8 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, duckdb and Superset☆48Apr 5, 2026Updated last week
- Playbook to provision a Confluent Cluster☆10Oct 22, 2017Updated 8 years ago
- This repo contains DAGs demonstrating a variety of ELT patterns using Airflow along with dbt.☆12Jan 12, 2023Updated 3 years ago
- ☆24Mar 21, 2025Updated last year
- Glue VSCode devcontainer setup☆14Jan 31, 2023Updated 3 years ago
- Python interface to arules for association rule mining☆11Oct 10, 2023Updated 2 years ago
- Objects and Animals detection with Wifi camera and Yolo☆16Apr 28, 2024Updated last year
- ⚠️⚠️⚠️ DEPRECATED☆14Nov 18, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆11Mar 7, 2021Updated 5 years ago
- ☆23Sep 5, 2022Updated 3 years ago
- Guideline to extract table lineage info in OpenLineage format from access history view☆14May 11, 2023Updated 2 years ago
- Terraform AWS free tier, EC2/ECR/RDS/EFS/DynamoDB/Lambda/S3. Docker running on EC2, Traefik reverse proxy, Lets Encrypt, dynamic DNS, Zer…☆38Jun 19, 2024Updated last year
- Automated basic infrastructure to intall OKD4 on free ESXi☆13Aug 8, 2020Updated 5 years ago
- repo do Diego☆10Nov 7, 2023Updated 2 years ago
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆15Jun 26, 2023Updated 2 years ago
- ☆21Mar 30, 2022Updated 4 years ago
- Ansible for Kubernetes by Examples by Luca Berton☆12Dec 25, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Example Code to Supplement the Label Studio Blog☆33Jan 6, 2026Updated 3 months ago
- Demo Codes will be shared here☆52Nov 19, 2025Updated 5 months ago
- Filter of Pairwise Alignement☆44Jan 31, 2022Updated 4 years ago
- Data Engineering Projects using Mage.ai as orchestrator☆18Jan 20, 2026Updated 2 months ago
- This repo demonstrate a comprehensive modern data stack using popular open-source tools.☆37Sep 11, 2023Updated 2 years ago
- ☆30Dec 4, 2024Updated last year
- How to write test in Golang.☆14Sep 25, 2016Updated 9 years ago