ROM-mm / spark-setupLinks
Instalador autonomo do Apache Spark para Sistemas linux: based(Debian,RHEL)
☆13Updated last year
Alternatives and similar repositories for spark-setup
Users that are interested in spark-setup are comparing it to the libraries listed below
Sorting:
- ☆74Updated 2 years ago
- ☆33Updated 4 years ago
- ☆41Updated last year
- Projeto de simulação de ingestão, tratamento e analise de dados do Ministério da Cultura☆46Updated last year
- This repository exemplifies a simple ELT process using delta to perform upsert and remove data files that aren't in the latest state of t…☆113Updated 3 years ago
- Exercícios do módulo 1 - Bootcamp EDC - IGTI 2021☆49Updated 2 years ago
- ☆36Updated 3 years ago
- Projeto de construção de datalake do zero☆104Updated last year
- Data Engineering made simple - An opinionated Data Engineering framework☆66Updated last year
- Spark development environment for kubernetes, spark-submit and jupyter notebook☆19Updated 4 years ago
- ☆43Updated 3 years ago
- Big Data Ecosystem Docker☆80Updated 3 years ago
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.☆17Updated 3 years ago
- Repository to place/show my python apps☆20Updated 3 years ago
- Estudos e projetos.☆62Updated 3 years ago
- Criando Lambda Functions para Ingerir Dados de APIs com AWS CDK☆13Updated 4 years ago
- ☆60Updated last year
- Configura containers do Spark (Master, Workers e History Server) + Jupyter☆21Updated last year
- Spyrk-cluster is a data mini-lab, considering the main technologies used these days. It's useful to either understand how to configure a …☆29Updated 4 years ago
- ☆145Updated last year
- Repositório com as demonstrações e dados compartilhadas durante os webinars do Databricks Journey Brasil☆19Updated 3 years ago
- ☆10Updated 4 years ago
- Este é um projeto de exemplo que demonstra um processo de ETL (Extração, Transformação e Carga) de dados usando Python, Polars e AWS Loca…☆15Updated 2 years ago
- ☆17Updated last year
- Estrutura completa para iniciar um projeto de dados com Python, abrangendo ambiente, git, desenvolvimento, testes e documentação.☆115Updated last year
- Repositório do curso de introdução a data pipelines da Alura Online☆34Updated 4 years ago
- ☆16Updated 11 months ago
- Retail data pipeline using Airflow, Dbt, Soda, GCP (GCS and BigQuery) and Metabase☆41Updated last year
- Data Engineering com Apache Spark☆41Updated 4 years ago
- This repo contains all the cheatsheets you need to keep handy, I will add more soon.☆42Updated 3 years ago