tiagotxm / yt-spark-no-kubernetesView external linksLinks
☆13Feb 19, 2025Updated 11 months ago
Alternatives and similar repositories for yt-spark-no-kubernetes
Users that are interested in yt-spark-no-kubernetes are comparing it to the libraries listed below
Sorting:
- ☆17Apr 2, 2024Updated last year
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.☆17Nov 16, 2022Updated 3 years ago
- Construindo Pipeline de Dados com Astro Python SDK, dbt & Apache Airflow☆10Mar 20, 2024Updated last year
- ☆36Jun 8, 2022Updated 3 years ago
- ☆10May 5, 2022Updated 3 years ago
- This project represents a whole process of Anime data collection, preparation, and delivery as a data app, powered by technologies like P…☆10Oct 4, 2022Updated 3 years ago
- A curated list of awesome tools for Amazon EKS 🌊☆14May 30, 2020Updated 5 years ago
- Companion repository for the book 'Delta Lake Up and Running'☆48Apr 5, 2025Updated 10 months ago
- Meu canal (Python) no YouTube☆11Apr 15, 2024Updated last year
- ☆13Feb 20, 2025Updated 11 months ago
- Visits sessionization pipeline used for the talk☆13May 28, 2024Updated last year
- Airbyte deployment and configuration management tool☆12Feb 5, 2022Updated 4 years ago
- ☆15Jan 16, 2024Updated 2 years ago
- Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools☆14Jun 18, 2022Updated 3 years ago
- ☆19Oct 21, 2024Updated last year
- ☆17Jul 10, 2023Updated 2 years ago
- ☆14Mar 11, 2023Updated 2 years ago
- Run Airflow on Kubernetes. This repository contains scripts to 1) run a multinode kubernets cluster on local machine using KinD, 2) prepa…☆16Apr 12, 2023Updated 2 years ago
- ☆59Mar 3, 2024Updated last year
- End to end data pipeline☆22Apr 13, 2025Updated 10 months ago
- Demo application to showcase integration of Trino with Apache superset using Minio and Hive metastore☆14Sep 28, 2022Updated 3 years ago
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆26Updated this week
- ☆20Jan 16, 2022Updated 4 years ago
- Spark on Kubernetes samples☆20Jun 8, 2021Updated 4 years ago
- Helm Chart for deploying Spark history server in Amazon EKS for S3 Spark Event Logs☆28Feb 9, 2026Updated last week
- ☆22Feb 7, 2024Updated 2 years ago
- An exercise running Kafka, Kafka Connect, PostgreSQL, Superset and AWS S3☆21Apr 29, 2021Updated 4 years ago
- Custom kube-scheduler for binpacking targeting Spark on EKS and other jobs workloads☆26Feb 5, 2026Updated last week
- ☆21Dec 11, 2021Updated 4 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆29Aug 18, 2024Updated last year
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆25Aug 16, 2018Updated 7 years ago
- ☆25Mar 15, 2024Updated last year
- How to Automate SQL: dbt(data build tool) tutorial on bigquery with extensive NOTES☆33Oct 3, 2023Updated 2 years ago
- ☆32Jan 30, 2026Updated 2 weeks ago
- Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,…☆28May 19, 2025Updated 8 months ago
- Don't Panic. This guide will help you when it feels like the end of the world.☆30Feb 7, 2026Updated last week
- Pyspark boilerplate for running prod ready data pipeline☆29Mar 17, 2021Updated 4 years ago
- Repositório da palestra Além do Docker101, boas práticas na construção de aplicações Cloud Native. CODECON 2024.☆35Feb 10, 2026Updated last week
- Bigdata on Kubernetes, Published by Packt☆36Oct 1, 2024Updated last year