faizeraza / dataengineering-github-data-pipelinelineLinks
In this project I have built etl pipline which scraps the trending repository based on month,week and day LIVE extract other related information using API and transform it into star schema and load to postgresql then analyzed it over PowerBI . Deployment repo link:
☆12Updated 2 years ago
Alternatives and similar repositories for dataengineering-github-data-pipelineline
Users that are interested in dataengineering-github-data-pipelineline are comparing it to the libraries listed below
Sorting:
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆24Updated 2 years ago
- This is the final project that after participated the Data Engineering Zoomcamp☆11Updated 3 years ago
- Data Engineering Project in GCP☆21Updated 2 years ago
- Price Crawler - Tracking Price Inflation☆188Updated 5 years ago
- ☆29Updated last year
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆156Updated 5 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆41Updated last year
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆103Updated 6 months ago
- Sample project to demonstrate data engineering best practices☆198Updated last year
- End to end data engineering project☆57Updated 2 years ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆27Updated 2 years ago
- ☆18Updated 2 years ago
- Capstone Project for the IBM Data Engineering Professional Certification.☆12Updated 3 years ago
- ☆160Updated 3 years ago
- Simple ETL pipeline using Python☆28Updated 2 years ago
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆15Updated 4 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆279Updated last year
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆282Updated 8 months ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆170Updated last month
- DataTalks.Club's Data Engineering Zoomcamp Project☆23Updated 3 years ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆158Updated last year
- ☆142Updated 2 years ago
- This data project can be used as a take-home assignment to learn Pyspark and Data Engineering.☆16Updated 2 years ago
- Sample repo for startdataengineering DE 101 free course☆69Updated last year
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆365Updated last year
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆165Updated 2 years ago
- This is a template you can use for your next data engineering portfolio project.☆182Updated 4 years ago
- YouTube tutorial project☆104Updated 2 years ago
- Near real time ETL to populate a dashboard.☆72Updated last month
- Simple stream processing pipeline☆110Updated last year