faizeraza / dataengineering-github-data-pipelineline
In this project I have built etl pipline which scraps the trending repository based on month,week and day LIVE extract other related information using API and transform it into star schema and load to postgresql then analyzed it over PowerBI . Deployment repo link:
☆12Updated last year
Alternatives and similar repositories for dataengineering-github-data-pipelineline:
Users that are interested in dataengineering-github-data-pipelineline are comparing it to the libraries listed below
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆21Updated last year
- DataTalks.Club's Data Engineering Zoomcamp Project☆23Updated 2 years ago
- End-to-end ELT data engineering project☆21Updated 2 years ago
- Simple ETL pipeline using Python☆25Updated last year
- Capstone Project for the IBM Data Engineering Professional Certification.☆10Updated 3 years ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆25Updated last year
- ☆27Updated last year
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Updated 2 years ago
- A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour…☆16Updated 2 years ago
- End to end data engineering project☆53Updated 2 years ago
- ☆19Updated last year
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆15Updated 3 years ago
- Code Repository for my 3rd Data Project.☆14Updated last year
- This is the final project that after participated the Data Engineering Zoomcamp☆11Updated 2 years ago
- With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset wh…☆12Updated 2 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆35Updated last year
- This repo will guide you step-by-step method to create star schema dimensional model.☆25Updated 3 years ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆18Updated 6 months ago
- Data Engineering Project in GCP☆20Updated 2 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆44Updated 5 years ago
- Repository for Data Engineering Interview Series☆29Updated 5 months ago
- data-warehouse-snowflake-for-data-engineering☆17Updated last year
- Fully dockerized Data Warehouse (DWH) using Airflow, dbt, PostgreSQL and dashboard using redash☆24Updated 2 years ago
- This data project can be used as a take-home assignment to learn Pyspark and Data Engineering.☆14Updated 2 years ago
- Example repo to create end to end tests for data pipeline.☆22Updated 9 months ago
- ☆11Updated 4 years ago
- Code Repository for my 1st Data Project.☆23Updated 2 years ago
- Cool DE Projects☆25Updated last month
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆24Updated 2 years ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆63Updated last year