dogukannulu / glue_etl_job_data_catalog_s3
Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog
☆11Updated last year
Alternatives and similar repositories for glue_etl_job_data_catalog_s3:
Users that are interested in glue_etl_job_data_catalog_s3 are comparing it to the libraries listed below
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆21Updated last year
- YouTube tutorial project☆101Updated last year
- ☆28Updated last year
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆60Updated last year
- ☆21Updated last year
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆36Updated last year
- An AWS Data Engineering End-to-End Project (Glue, Lambda, Kinesis, Redshift, QuickSight, Athena, EC2, S3)☆12Updated last year
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆45Updated 5 years ago
- ☆136Updated 2 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆41Updated last year
- ☆16Updated last year
- ☆40Updated 9 months ago
- ☆23Updated 2 years ago
- Simple ETL pipeline using Python☆26Updated last year
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆91Updated last month
- ☆87Updated 2 years ago
- The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technolog…☆12Updated 2 years ago
- ☆71Updated 7 months ago
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆35Updated last year
- Resources for the free AWS Data Engineering course on youtube☆99Updated 3 years ago
- An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS ap…☆25Updated 2 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆121Updated 11 months ago
- This is an all-in-one repository for Data Engineers, ideal for beginners & interview preparation, which includes Python as the main Progr…☆29Updated 2 years ago
- This project contain build end-to-end e-commerce data from data source into data warehouse and visualization.☆12Updated 7 months ago
- ☆195Updated last year
- ☆64Updated last week
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆25Updated 2 years ago
- Repository related to Spark SQL and Pyspark using Python3☆37Updated 2 years ago
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆49Updated last year
- End to end data engineering project☆54Updated 2 years ago