olxbr / dumping-machine
☆14Updated this week
Related projects: ⓘ
- ☆15Updated 5 months ago
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.☆17Updated last year
- ☆23Updated 2 years ago
- ☆22Updated last year
- Deploy of Airflow 2.0 using ECS Fargate and AWS CDK.☆14Updated 2 years ago
- This is an ETL application on AWS with general open sales and customer data that you can find here: https://github.com/camposvinicius/dat…☆17Updated 2 years ago
- ☆22Updated this week
- Data Engineering com Apache Spark☆43Updated 3 years ago
- Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,…☆25Updated 5 months ago
- ☆36Updated last month
- ☆20Updated 3 years ago
- ☆21Updated 9 months ago
- Código para workshops Spark com ambiente de desenvolvimento em docker☆27Updated 2 years ago
- ☆32Updated 3 years ago
- Exploratory analysis of São Paulo subway data☆19Updated 5 years ago
- ☆8Updated 2 years ago
- Spark development environment for kubernetes, spark-submit and jupyter notebook☆19Updated 2 years ago
- A simple command line tool for fake dataset generation given a specification defined as a JSON DSL☆22Updated last year
- Instalador autonomo do Apache Spark para Sistemas linux: based(Debian,RHEL)☆13Updated last year
- A data engineering personal project for applying some of my skills☆19Updated 3 years ago
- Presenting 3 ways to run Spark over containers, this project is recommended to those who seek to explore Big Data out of a Hadoop Cluster…☆10Updated 3 years ago
- ☆58Updated 6 months ago
- Spark env to Glue development☆9Updated 3 years ago
- ☆20Updated this week
- ☆34Updated 2 years ago
- ☆13Updated last year
- ☆44Updated 2 years ago
- Data Engineering made simple - An opinionated Data Engineering framework☆63Updated 6 months ago
- Repositório dedicado a Workshop de Data Lakehouse com Delta Lake☆18Updated 2 years ago
- Desafio para Engenheiro(a) de Dados - VAGAS.com☆23Updated 5 years ago