ThiagoPanini / sparksnake
Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMR
☆12Updated last year
Alternatives and similar repositories for sparksnake:
Users that are interested in sparksnake are comparing it to the libraries listed below
- Conteúdo das aulas da turma 6 do bootcamp de engenharia de dados da How☆12Updated 3 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆12Updated 4 years ago
- This repository contains example patterns for storing large objects with DynamoDB.☆11Updated 7 months ago
- Git repo to accompany the AWS DevOps Blog: Using AWS DevOps Tools to model and provision AWS Glue workflows☆19Updated 3 years ago
- ☆15Updated last year
- This Guidance helps customers set up an ecommerce website on WordPress.☆10Updated 3 months ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- Python Projects made for beginner, intermediate and advanced levels. [Portuguese]☆14Updated 2 years ago
- ☆15Updated 2 years ago
- ☆51Updated 9 months ago
- Learn more about Amazon FSx and get hands-on experience.☆15Updated 4 years ago
- Sample code that reads Microsoft Excel workbook/CSV File for the details required to create a DMS task CloudFormation template☆14Updated 4 years ago
- This project allows you to provision a mac1 EC2 instance with Jenkins and EKS.☆10Updated last year
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆26Updated 4 years ago
- ☆11Updated 11 months ago
- Script para ingestão de dados do Mercado Bitcoin☆11Updated last year
- ☆11Updated 2 years ago
- Data Engineering com Apache Spark☆43Updated 3 years ago
- ☆15Updated 9 months ago
- ☆12Updated 5 years ago
- ☆14Updated last year
- ☆29Updated 10 months ago
- This repository provides a Terraform implementation that deploys an Amazon EKS cluster in a private VPC and deploys Windows and Linux wor…☆9Updated 2 years ago
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆49Updated last year
- Terraform modules for provisioning and managing AWS Glue resources☆28Updated 3 weeks ago
- ☆23Updated last year
- ☆22Updated 3 years ago
- Build, Test and Deploy ETL solutions using AWS Glue and AWS CDK based CI/CD pipelines☆40Updated 2 years ago
- Ciência de dados☆12Updated 2 years ago
- Build and run Spark Structured Streaming pipelines in Hadoop - project using PySpark.☆12Updated 5 years ago