dogukannulu / aws_end_to_end_streaming_pipeline
An AWS Data Engineering End-to-End Project (Glue, Lambda, Kinesis, Redshift, QuickSight, Athena, EC2, S3)
☆12Updated last year
Alternatives and similar repositories for aws_end_to_end_streaming_pipeline:
Users that are interested in aws_end_to_end_streaming_pipeline are comparing it to the libraries listed below
- Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog☆11Updated last year
- YouTube tutorial project☆97Updated last year
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆34Updated last year
- ☆28Updated last year
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆39Updated last year
- ☆129Updated last year
- Data Engineering with AWS, 2nd edition - Published by Packt☆125Updated last year
- ☆40Updated 6 months ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆131Updated last year
- Sample repo for startdataengineering DE 101 free course☆43Updated 6 months ago
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆36Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆61Updated 7 months ago
- ☆31Updated last year
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆76Updated 5 months ago
- Building ETL Pipelines with Python☆116Updated 6 months ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆97Updated 8 months ago
- ☆18Updated 2 months ago
- Data Engineering YouTube Analysis Project by Darshil Parmar☆168Updated last year
- Data Engineering with Google Cloud Platform, published by Packt☆111Updated last year
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆57Updated last year
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆21Updated last year
- Resources for the free AWS Data Engineering course on youtube☆99Updated 3 years ago
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆16Updated last year
- ☆86Updated 2 years ago
- ☆19Updated last year
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆222Updated last year
- ☆31Updated 2 years ago
- Data Engineering Project with Hadoop HDFS and Kafka☆42Updated last year
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆94Updated last month
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.☆106Updated 5 months ago