faizeraza / dataengineering-github-data-pipelineline
In this project I have built etl pipline which scraps the trending repository based on month,week and day LIVE extract other related information using API and transform it into star schema and load to postgresql then analyzed it over PowerBI . Deployment repo link:
☆12Updated last year
Alternatives and similar repositories for dataengineering-github-data-pipelineline:
Users that are interested in dataengineering-github-data-pipelineline are comparing it to the libraries listed below
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆21Updated last year
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆15Updated 3 years ago
- This data project can be used as a take-home assignment to learn Pyspark and Data Engineering.☆14Updated 2 years ago
- This is the final project that after participated the Data Engineering Zoomcamp☆11Updated 2 years ago
- ☆27Updated last year
- ☆19Updated last year
- Simple ETL pipeline using Python☆25Updated last year
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆35Updated last year
- An end-to-end data engineering pipeline to create a dashboard for the latest content on the r/Stocks subreddit☆19Updated 2 years ago
- ☆16Updated 2 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆23Updated 2 years ago
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆27Updated last year
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Updated 2 years ago
- Data Engineering Project in GCP☆20Updated 2 years ago
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆21Updated 2 years ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆25Updated last year
- End-to-end ELT data engineering project☆21Updated 2 years ago
- Code Repository for my 3rd Data Project.☆14Updated last year
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆18Updated 6 months ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆24Updated 2 years ago
- Capstone Project for the IBM Data Engineering Professional Certification.☆10Updated 3 years ago
- ☆40Updated 8 months ago
- End to end data engineering project☆53Updated 2 years ago
- This project shows how to capture changes from postgres database and stream them into kafka☆36Updated 10 months ago
- This repo will guide you step-by-step method to create star schema dimensional model.☆25Updated 3 years ago
- ☆64Updated this week
- YouTube tutorial project☆102Updated last year
- Near real time ETL to populate a dashboard.☆73Updated 9 months ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆45Updated 5 years ago
- data-warehouse-snowflake-for-data-engineering☆17Updated last year