RealKinetic / aws-glue-pipeline-example
An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.
β12Updated 4 years ago
Alternatives and similar repositories for aws-glue-pipeline-example:
Users that are interested in aws-glue-pipeline-example are comparing it to the libraries listed below
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3β26Updated 4 years ago
- Snowflake Cookbook, published by Packtβ76Updated 2 years ago
- πComplete End to End ETL Pipeline with Spark, Airflow, & AWSβ43Updated 5 years ago
- β34Updated 2 years ago
- code snippet for analytics sessionsβ33Updated 2 years ago
- Udacity Data Engineer Nano Degree - Project-3 (Data Warehouse)β22Updated 5 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatioβ¦β53Updated last year
- Unit testing using databricks connectβ30Updated 3 years ago
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)β15Updated 6 years ago
- Demo code to illustrate the execution of PyTest unit test cases for AWS Glue jobs in AWS CodePipeline using AWS CodeBuild projectsβ42Updated last month
- Git repo to accompany the AWS DevOps Blog: Using AWS DevOps Tools to model and provision AWS Glue workflowsβ19Updated 3 years ago
- Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Sparkβ11Updated 6 years ago
- Repository for AWS Glue Workshopβ31Updated 2 years ago
- Serverless ETL and Analytics with AWS Glue, published by Packtβ46Updated last year
- Repository of notebooks and related collateral used in the Databricks Demo Hub, showing how to use Databricks, Delta Lake, MLflow, and moβ¦β25Updated 3 years ago
- β17Updated 4 years ago
- Data Engineering with AWS Cookbook, published by Packtβ13Updated last month
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.β49Updated last year
- β53Updated 4 years ago
- β117Updated 3 months ago
- β51Updated 9 months ago
- All the Snowflake Virtual Warehouse - Exampleβ11Updated 4 years ago
- β25Updated last year
- Quickstart: Getting Started with Snowpark Pythonβ32Updated 2 years ago
- build dw with dbtβ35Updated 3 months ago
- Azure Data Engineering Cookbook 2nd-edition, published by Packtβ31Updated last year
- Materials for the next courseβ24Updated last year
- Ravi Azure ADB ADF Repositoryβ65Updated this week
- Build, Test and Deploy ETL solutions using AWS Glue and AWS CDK based CI/CD pipelinesβ40Updated 2 years ago