datasprints / data-engineer-development-plan
☆8Updated 2 years ago
Related projects: ⓘ
- ☆15Updated 5 months ago
- Spark development environment for kubernetes, spark-submit and jupyter notebook☆19Updated 2 years ago
- This is an ETL application on AWS with general open sales and customer data that you can find here: https://github.com/camposvinicius/dat…☆17Updated 2 years ago
- Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,…☆25Updated 5 months ago
- ☆58Updated 6 months ago
- ☆23Updated 2 years ago
- Out of the box docker-compose for Apache Ranger☆10Updated last year
- ☆22Updated last year
- Data Engineering com Apache Spark☆43Updated 3 years ago
- Código para workshops Spark com ambiente de desenvolvimento em docker☆27Updated 2 years ago
- ☆42Updated 2 months ago
- ☆32Updated 3 years ago
- ☆14Updated this week
- ☆8Updated last month
- Docker Apache Airflow☆31Updated 2 years ago
- ☆22Updated this week
- Instalador autonomo do Apache Spark para Sistemas linux: based(Debian,RHEL)☆13Updated last year
- Notebooks e dicas sobre Databricks☆18Updated this week
- ☆20Updated 3 years ago
- Great Expectations Airflow operator☆158Updated 2 weeks ago
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.☆17Updated last year
- Presenting 3 ways to run Spark over containers, this project is recommended to those who seek to explore Big Data out of a Hadoop Cluster…☆10Updated 3 years ago
- Repositório dedicado a Workshop de Data Lakehouse com Delta Lake☆18Updated 2 years ago
- ☆20Updated this week
- Demo DAGs that show how to run dbt Core in Airflow using Cosmos