lmassaoy / spark-on-k8s
Presenting 3 ways to run Spark over containers, this project is recommended to those who seek to explore Big Data out of a Hadoop Cluster.
☆10Updated 3 years ago
Related projects: ⓘ
- ☆15Updated 5 months ago
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.☆17Updated last year
- ☆23Updated 2 years ago
- ☆22Updated last year
- Data Engineering com Apache Spark☆43Updated 3 years ago
- ☆58Updated 6 months ago
- ☆8Updated last month
- Spark development environment for kubernetes, spark-submit and jupyter notebook☆19Updated 2 years ago
- Instalador autonomo do Apache Spark para Sistemas linux: based(Debian,RHEL)☆13Updated last year
- This is an ETL application on AWS with general open sales and customer data that you can find here: https://github.com/camposvinicius/dat…☆17Updated 2 years ago
- Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,…☆25Updated 5 months ago
- ☆20Updated this week
- Docker Apache Airflow☆31Updated 2 years ago
- ☆14Updated this week
- ☆8Updated 2 years ago
- ☆44Updated 2 years ago
- Grafana dashboards and StatsD exporter config for Airflow monitoring☆263Updated 6 months ago
- ☆36Updated last month
- ☆22Updated this week
- Deploy of Airflow 2.0 using ECS Fargate and AWS CDK.☆14Updated 2 years ago
- ☆32Updated 3 years ago
- ☆21Updated 9 months ago
- ☆11Updated this week
- Materials for the next course☆22Updated last year
- ☆42Updated 2 months ago
- Sample Airflow DAGs☆60Updated last year
- ☆38Updated this week
- Delta-Lake, ETL, Spark, Airflow☆42Updated last year
- Airflow Deployment on AWS ECS Fargate Using Cloudformation☆204Updated 2 years ago
- ☆34Updated 2 years ago