johnny-chivers / sql-for-athena
☆12Updated 2 years ago
Related projects: ⓘ
- Resources for the Glue 101 video series on youtube☆9Updated 3 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆12Updated 3 years ago
- Repo which holds the materials for the EMR Zero To Hero☆27Updated 2 years ago
- code snippet for analytics sessions☆31Updated 2 years ago
- ☆19Updated 5 years ago
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆26Updated 4 years ago
- ☆26Updated 6 months ago
- Serverless ETL and Analytics with AWS Glue, published by Packt☆45Updated 11 months ago
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆14Updated last year
- AWS Glue tutorial for data developers.☆23Updated 5 years ago
- ☆14Updated 2 years ago
- install external python packages on serverless☆38Updated last year
- ☆10Updated 7 months ago
- Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMR☆12Updated last year
- ☆46Updated 5 months ago
- Extract, transform, and load data for analytic processing using AWS Glue☆17Updated 3 years ago
- GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineers☆22Updated 2 years ago
- Machine Learning Engineering on AWS, published by Packt☆66Updated 5 months ago
- Build, Test and Deploy ETL solutions using AWS Glue and AWS CDK based CI/CD pipelines☆36Updated last year
- This repository has a collection of utilities for Glue Crawlers. These utilities come in the form of AWS CloudFormation templates or AWS …☆17Updated 2 years ago
- Demo code to illustrate the execution of PyTest unit test cases for AWS Glue jobs in AWS CodePipeline using AWS CodeBuild projects☆38Updated 3 months ago
- Learn Apache Airflow in easy way☆29Updated 2 years ago
- Learn AWS Automation with boto3, Python, and Lambda Functions, by Packt Publishing☆13Updated last year
- Amazon SageMaker Best Practices, published by Packt☆27Updated last year
- Data Engineering with Scala, published by Packt☆16Updated 7 months ago
- Simplifying Data Engineering and Analytics with Delta, published by Packt☆20Updated last year
- ☆24Updated last year
- The open source version of the Amazon Redshift Getting Started Guide.☆15Updated last year
- Some recipes for data engineering with Python☆22Updated 3 years ago
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆13Updated 5 years ago