Spark development environment for kubernetes, spark-submit and jupyter notebook
☆19Nov 30, 2021Updated 4 years ago
Alternatives and similar repositories for spark-dev-env-docker
Users that are interested in spark-dev-env-docker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.☆17Nov 16, 2022Updated 3 years ago
- ☆59Mar 3, 2024Updated 2 years ago
- Exercícios do módulo 4 - Bootcamp EDC - IGTI☆10May 5, 2022Updated 3 years ago
- Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,…☆28May 19, 2025Updated 10 months ago
- A data engineering personal project for applying some of my skills☆19Jul 11, 2021Updated 4 years ago
- Trying out the Dataframe Polars library with Delta Lake ... feat Python.☆12Jan 29, 2025Updated last year
- ☆15Jul 1, 2021Updated 4 years ago
- This repo contains all the cheatsheets you need to keep handy, I will add more soon.☆42Nov 10, 2022Updated 3 years ago
- Shall your HTTP API pass?☆13May 13, 2019Updated 6 years ago
- The official repository of the Akka Typed Essentials course with Scala☆12May 13, 2024Updated last year
- CLI tool to manage Kafka connectors☆10Mar 2, 2024Updated 2 years ago
- Spark in Action, 2nd edition - chapter 15 - Aggregating your data☆12Sep 8, 2022Updated 3 years ago
- Glue VSCode devcontainer setup☆14Jan 31, 2023Updated 3 years ago
- Cloud-native Trino (prestosql) + Hive + Minio + Superset☆24Nov 29, 2021Updated 4 years ago
- Spending One Hundred days on blogging about cloud computing☆14Jul 12, 2022Updated 3 years ago
- Objects and Animals detection with Wifi camera and Yolo☆16Apr 28, 2024Updated last year
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆11Jun 27, 2023Updated 2 years ago
- Python tool to help export Azure DevOps WIKI into a single PDF☆10May 10, 2020Updated 5 years ago
- Proof of concept of a big data cluster using open source tools☆11Apr 10, 2024Updated last year
- Repository for Microsoft Databricks Training Events - Hosted by BlueGranite☆15Aug 22, 2019Updated 6 years ago
- Automated basic infrastructure to intall OKD4 on free ESXi☆13Aug 8, 2020Updated 5 years ago
- Instalador autonomo do Apache Spark para Sistemas linux: based(Debian,RHEL)☆13Dec 10, 2024Updated last year
- Python library for generating pix codes with CRC16 validation☆19Jul 9, 2025Updated 8 months ago
- Hands-on examples to integrate GX data validation in a data pipeline.☆18Mar 16, 2026Updated last week
- Airflow Examples: code samples for Medium articles☆14Jan 10, 2021Updated 5 years ago
- A framework to implement the saga pattern in Go☆21Oct 31, 2024Updated last year
- ☆16Jun 11, 2020Updated 5 years ago
- ☆18Jun 16, 2024Updated last year
- Projeto Stack de dados OSS☆12Apr 8, 2025Updated 11 months ago
- This repository contains icebreaker examples for migen.☆12Jan 4, 2019Updated 7 years ago
- ☆18Sep 17, 2021Updated 4 years ago
- Deploying a Kubernetes cluster on EC2 Ubuntu 20.04☆16Jun 6, 2022Updated 3 years ago
- Genetic based optimization prior to standard backpropagation, the accompanying medium article can be found here☆15Jan 3, 2020Updated 6 years ago
- An open synthetic population of Sao Paulo Metropolitan region for agent-based transport simulation☆16Jul 6, 2023Updated 2 years ago
- 💾 WIKIBOOKS: SQL Exercises☆21Jul 2, 2019Updated 6 years ago
- Bíblioteca agnóstica para Integração com o gateway de pagamento da Picpay.☆38May 20, 2024Updated last year
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆26Nov 30, 2019Updated 6 years ago
- ☆43Jul 3, 2022Updated 3 years ago
- ☆23Nov 26, 2020Updated 5 years ago