techops-recsys-lateral-hiring / dataengineer-transformations-pythonLinks
☆52Updated 10 months ago
Alternatives and similar repositories for dataengineer-transformations-python
Users that are interested in dataengineer-transformations-python are comparing it to the libraries listed below
Sorting:
- ☆33Updated 4 years ago
- ☆23Updated 4 years ago
- ☆24Updated last year
- Data Engineering com Apache Spark☆42Updated 4 years ago
- ☆71Updated 2 years ago
- Repositório dedicado a Workshop de Data Lakehouse com Delta Lake☆18Updated 3 years ago
- ☆23Updated 2 years ago
- Airflow Deployment on AWS ECS Fargate Using Cloudformation☆204Updated 3 years ago
- Public source code for the Udemy online course Apache Airflow: Complete Hands-On Beginner to Advanced Class.☆63Updated 4 years ago
- Spark development environment for kubernetes, spark-submit and jupyter notebook☆19Updated 3 years ago
- ☆40Updated last year
- Code snippets for Data Engineering Design Patterns book☆191Updated 6 months ago
- ☆17Updated last year
- Portfolio of projects and studies conducted in data engineering.☆34Updated 7 months ago
- ☆61Updated last year
- Airflow training for the crunch conf☆105Updated 6 years ago
- This repository exemplifies a simple ELT process using delta to perform upsert and remove data files that aren't in the latest state of t…☆107Updated 3 years ago
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.☆17Updated 2 years ago
- Resources for video demonstrations and blog posts related to DataOps on AWS☆182Updated 3 years ago
- Docker with Airflow and Spark standalone cluster☆261Updated 2 years ago
- This is an ETL application on AWS with general open sales and customer data that you can find here: https://github.com/camposvinicius/dat…☆18Updated 3 years ago
- Repository to place/show my python apps☆20Updated 3 years ago
- Repositório com as demonstrações e dados compartilhadas durante os webinars do Databricks Journey Brasil☆19Updated 3 years ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆266Updated 2 weeks ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆88Updated 4 years ago
- A data engineering personal project for applying some of my skills☆19Updated 4 years ago
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆136Updated 3 months ago
- Docker Apache Airflow☆31Updated 3 years ago
- Código para workshops Spark com ambiente de desenvolvimento em docker☆27Updated 3 years ago
- ☆179Updated 2 years ago