faizeraza / dataengineering-github-data-pipelinelineLinks
In this project I have built etl pipline which scraps the trending repository based on month,week and day LIVE extract other related information using API and transform it into star schema and load to postgresql then analyzed it over PowerBI . Deployment repo link:
☆12Updated 2 years ago
Alternatives and similar repositories for dataengineering-github-data-pipelineline
Users that are interested in dataengineering-github-data-pipelineline are comparing it to the libraries listed below
Sorting:
- This is the final project that after participated the Data Engineering Zoomcamp☆11Updated 3 years ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆28Updated 2 years ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆25Updated 2 years ago
- ☆30Updated 2 years ago
- End to end data engineering project☆58Updated 3 years ago
- An end-to-end data engineering pipeline to create a dashboard for the latest content on the r/Stocks subreddit☆20Updated 3 years ago
- ☆18Updated 3 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆199Updated last month
- ☆163Updated 3 years ago
- ☆18Updated 3 years ago
- Capstone Project for the IBM Data Engineering Professional Certification.☆13Updated 3 years ago
- This data project can be used as a take-home assignment to learn Pyspark and Data Engineering.☆17Updated 2 years ago
- ☆21Updated 2 years ago
- Code Repository for my 3rd Data Project.☆16Updated 2 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆12Updated 2 years ago
- Sample project to demonstrate data engineering best practices☆202Updated last year
- Simple ETL pipeline using Python☆29Updated 2 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆164Updated 3 years ago
- Data Engineering Project in GCP☆22Updated 2 years ago
- ☆41Updated 2 years ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Updated 3 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆160Updated 5 years ago
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆244Updated 3 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆23Updated 3 years ago
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…☆57Updated 3 years ago
- A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour…☆18Updated 5 months ago
- YouTube tutorial project☆107Updated 2 years ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆107Updated 3 weeks ago
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆373Updated 2 years ago
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆15Updated 4 years ago