faizeraza / dataengineering-github-data-pipelineline
In this project I have built etl pipline which scraps the trending repository based on month,week and day LIVE extract other related information using API and transform it into star schema and load to postgresql then analyzed it over PowerBI . Deployment repo link:
☆12Updated last year
Alternatives and similar repositories for dataengineering-github-data-pipelineline:
Users that are interested in dataengineering-github-data-pipelineline are comparing it to the libraries listed below
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆21Updated last year
- ☆19Updated last year
- This is the final project that after participated the Data Engineering Zoomcamp☆11Updated 2 years ago
- This data project can be used as a take-home assignment to learn Pyspark and Data Engineering.☆13Updated 2 years ago
- Data Engineering Project in GCP☆18Updated last year
- Data Engineering & Analysis Project- San Francisco Eviction Data ETL Pipeline An end-to-end batch data pipeline for performing ETL on San…☆8Updated 5 months ago
- ☆27Updated last year
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆25Updated last year
- Capstone Project for the IBM Data Engineering Professional Certification.☆10Updated 3 years ago
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆15Updated 3 years ago
- End to end data engineering project☆53Updated 2 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆23Updated 2 years ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Updated 2 years ago
- Simple ETL pipeline using Python☆25Updated last year
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆24Updated 2 years ago
- data-warehouse-snowflake-for-data-engineering☆17Updated last year
- This project introduces PySpark, a powerful open-source framework for distributed data processing. We explore its architecture, component…☆27Updated 5 months ago
- A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour…☆16Updated 2 years ago
- ☆16Updated 2 years ago
- Business challenge that requires building a data platform for retailer data analytics.☆12Updated 2 years ago
- A project portfolio to accompany my resume☆27Updated last year
- Code Repository for my 3rd Data Project.☆14Updated last year
- An end-to-end data engineering pipeline to create a dashboard for the latest content on the r/Stocks subreddit☆19Updated 2 years ago
- Cool DE Projects☆25Updated 3 weeks ago
- Code for "Advanced data transformations in SQL" free live workshop☆74Updated 5 months ago
- My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform☆14Updated 2 years ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆18Updated 6 months ago
- End-to-end ELT data engineering project☆20Updated 2 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆35Updated last year
- ☆151Updated 2 years ago