anthonywong611 / Batch-ETL-with-AWS-EMR-and-MWAA

Create a data pipeline on AWS to execute batch processing in a Spark cluster provisioned by Amazon EMR. ETL using managed airflow: extracts data from S3, transform data using spark, load transformed data back to S3.
9Updated 3 years ago

Related projects

Alternatives and complementary repositories for Batch-ETL-with-AWS-EMR-and-MWAA