dogukannulu / glue_etl_job_data_catalog_s3Links
Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog
☆11Updated last year
Alternatives and similar repositories for glue_etl_job_data_catalog_s3
Users that are interested in glue_etl_job_data_catalog_s3 are comparing it to the libraries listed below
Sorting:
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆22Updated 2 years ago
- ☆23Updated 2 years ago
- ☆28Updated last year
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆60Updated last year
- YouTube tutorial project☆103Updated last year
- Simple ETL pipeline using Python☆26Updated 2 years ago
- ☆139Updated 2 years ago
- An AWS Data Engineering End-to-End Project (Glue, Lambda, Kinesis, Redshift, QuickSight, Athena, EC2, S3)☆12Updated last year
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆46Updated 5 years ago
- ☆64Updated last week
- ☆150Updated 3 years ago
- ☆40Updated 11 months ago
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆22Updated 3 years ago
- This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.☆74Updated 9 months ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆144Updated last year
- ☆197Updated last year
- ☆51Updated last year
- Repository related to Spark SQL and Pyspark using Python3☆38Updated 2 years ago
- ☆12Updated 4 years ago
- data-warehouse-snowflake-for-data-engineering☆17Updated last year
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆22Updated last year
- ☆87Updated 2 years ago
- Cool DE Projects☆28Updated last week
- Ravi Azure ADB ADF Repository☆66Updated 4 months ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆42Updated last year
- ☆19Updated 2 years ago
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.☆150Updated 9 months ago
- With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset wh…☆14Updated 3 years ago
- An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS ap…☆25Updated 2 years ago
- Data Engineering YouTube Analysis Project by Darshil Parmar☆195Updated last year