AdeboyeML / UK_Accident_Traffic_ETL_Pipeline

This is a capstone project that entails building an end-to-end ETL (Extract-Transform-Load) Data pipeline which extracts UK accident and traffic datasets from Amazon S3, clean and transform with Pyspark, transfer it back to S3 and finally load to Amazon Redshift (Distributed Database), from where the data can be queried for ad-hoc analyses.
18Updated 4 years ago

Alternatives and similar repositories for UK_Accident_Traffic_ETL_Pipeline:

Users that are interested in UK_Accident_Traffic_ETL_Pipeline are comparing it to the libraries listed below