ThiagoPanini / sparksnake
Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMR
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for sparksnake
- app-server-migration helps in discovering the changes required to migrate the code from source server to target server and provides effor…☆15Updated 3 weeks ago
- This Guidance helps customers set up an ecommerce website on WordPress.☆10Updated last month
- ☆15Updated 10 months ago
- Build and run Spark Structured Streaming pipelines in Hadoop - project using PySpark.☆12Updated 5 years ago
- ☆28Updated 8 months ago
- ☆10Updated 9 months ago
- This repository contains example patterns for storing large objects with DynamoDB.☆11Updated 5 months ago
- ☆12Updated 4 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆12Updated 4 years ago
- ☆11Updated last year
- ☆13Updated 2 years ago
- Build, Test and Deploy ETL solutions using AWS Glue and AWS CDK based CI/CD pipelines☆39Updated 2 years ago
- ☆21Updated last year
- Sample code that reads Microsoft Excel workbook/CSV File for the details required to create a DMS task CloudFormation template☆13Updated 3 years ago
- This repository provides the resources required for the Amazon Redshift Streaming workshop☆11Updated last year
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆14Updated 5 years ago
- ☆18Updated 7 months ago
- ☆15Updated last year
- ☆21Updated 3 years ago
- In this pattern, data records are ingested and then modified with simple transformations such as field level substitutions and data enric…☆12Updated 6 years ago
- This is a collecton of CDK projects to show how to load data from streaming services into Amazon Redshift.☆12Updated 2 months ago
- Python Projects made for beginner, intermediate and advanced levels. [Portuguese]☆14Updated last year
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆26Updated 4 years ago
- A data engineering personal project for applying some of my skills☆19Updated 3 years ago
- Git repo to accompany the AWS DevOps Blog: Using AWS DevOps Tools to model and provision AWS Glue workflows☆19Updated 3 years ago
- Ciência de dados☆12Updated 2 years ago
- ☆12Updated 2 years ago
- Conteúdo das aulas da turma 6 do bootcamp de engenharia de dados da How☆12Updated 3 years ago
- Integrating Amazon API Gateway private endpoints with on-premises networks☆12Updated 3 years ago
- Spy Cli☆16Updated 2 years ago