nickrvieira / docker-spark-datahackersLinks
☆33Updated 4 years ago
Alternatives and similar repositories for docker-spark-datahackers
Users that are interested in docker-spark-datahackers are comparing it to the libraries listed below
Sorting:
- ☆41Updated last year
- ☆74Updated 2 years ago
- This repository exemplifies a simple ELT process using delta to perform upsert and remove data files that aren't in the latest state of t…☆114Updated 3 years ago
- Data Engineering made simple - An opinionated Data Engineering framework☆66Updated last year
- Instalador autonomo do Apache Spark para Sistemas linux: based(Debian,RHEL)☆13Updated last year
- Repository to place/show my python apps☆20Updated 3 years ago
- Exercícios do módulo 1 - Bootcamp EDC - IGTI 2021☆49Updated 2 years ago
- ☆24Updated 2 years ago
- ☆43Updated 3 years ago
- Estudos e projetos.☆62Updated 4 years ago
- Projeto de construção de datalake do zero☆105Updated last year
- Projeto de simulação de ingestão, tratamento e analise de dados do Ministério da Cultura☆46Updated last year
- ☆16Updated last year
- ☆36Updated 3 years ago
- ☆17Updated last year
- Data Engineering com Apache Spark☆41Updated 4 years ago
- Repositório do curso de introdução a data pipelines da Alura Online☆34Updated 4 years ago
- ☆59Updated last year
- Personal roadmap to guide my studies.☆81Updated 3 years ago
- ☆145Updated last year
- ☆23Updated 2 years ago
- This repo contains all the cheatsheets you need to keep handy, I will add more soon.☆42Updated 3 years ago
- Spyrk-cluster is a data mini-lab, considering the main technologies used these days. It's useful to either understand how to configure a …☆29Updated 4 years ago
- Criando Lambda Functions para Ingerir Dados de APIs com AWS CDK☆13Updated 4 years ago
- Código para workshops Spark com ambiente de desenvolvimento em docker☆27Updated 4 years ago
- The One Billion Row Challenge using Python☆85Updated last year
- Apply for a job at Olist's Data Team: https://olist.gupy.io/☆51Updated 3 years ago
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.☆17Updated 3 years ago
- This repository contains a Python script that pulls market sentiment data from SentiCrypt API and push it to Stitch Import API.☆34Updated 2 years ago
- Notas das aulas da Aceleração Dev #4 da DIO sobre Engenharia de Dados, ministrado pela Everis.☆13Updated 4 years ago