faizeraza / dataengineering-github-data-pipelineline
In this project I have built etl pipline which scraps the trending repository based on month,week and day LIVE extract other related information using API and transform it into star schema and load to postgresql then analyzed it over PowerBI . Deployment repo link:
☆12Updated last year
Alternatives and similar repositories for dataengineering-github-data-pipelineline:
Users that are interested in dataengineering-github-data-pipelineline are comparing it to the libraries listed below
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆24Updated last year
- ☆19Updated last year
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆21Updated last year
- Data Engineering Project in GCP☆18Updated last year
- ☆17Updated last year
- ☆28Updated last year
- This data project can be used as a take-home assignment to learn Pyspark and Data Engineering.☆13Updated last year
- DataTalks.Club's Data Engineering Zoomcamp Project☆22Updated 2 years ago
- This is the final project that after participated the Data Engineering Zoomcamp☆10Updated 2 years ago
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆14Updated 3 years ago
- Code Repository for my 3rd Data Project.☆14Updated last year
- data-warehouse-snowflake-for-data-engineering☆14Updated last year
- Simple ETL pipeline using Python☆24Updated last year
- End to end data engineering project☆53Updated 2 years ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆26Updated 2 years ago
- A project portfolio to accompany my resume☆23Updated last year
- Data Engineering Project to Extract and Process Solana Reddit Data☆24Updated 11 months ago
- Airflow & DBT Cloud Integrated Project Presented at Lagos DBT Community Meetup & DataFestAfrica 23☆13Updated last year
- End-to-end ELT data engineering project☆20Updated 2 years ago
- A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour…☆16Updated 2 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆97Updated 8 months ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆18Updated 4 months ago
- This repo will guide you step-by-step method to create star schema dimensional model.☆24Updated 3 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆34Updated last year
- Example repo to create end to end tests for data pipeline.☆21Updated 7 months ago
- An end-to-end data engineering pipeline to create a dashboard for the latest content on the r/Stocks subreddit☆19Updated 2 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆22Updated 2 years ago
- Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and tr…☆10Updated last year
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆43Updated 5 years ago
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆24Updated last year