faizeraza / dataengineering-github-data-pipelineline
In this project I have built etl pipline which scraps the trending repository based on month,week and day LIVE extract other related information using API and transform it into star schema and load to postgresql then analyzed it over PowerBI . Deployment repo link:
☆12Updated last year
Alternatives and similar repositories for dataengineering-github-data-pipelineline:
Users that are interested in dataengineering-github-data-pipelineline are comparing it to the libraries listed below
- This data project can be used as a take-home assignment to learn Pyspark and Data Engineering.☆13Updated last year
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆21Updated last year
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆24Updated last year
- ☆19Updated last year
- End to end data engineering project☆53Updated 2 years ago
- Capstone Project for the IBM Data Engineering Professional Certification.☆10Updated 2 years ago
- Data Engineering Project in GCP☆18Updated last year
- This is the final project that after participated the Data Engineering Zoomcamp☆10Updated 2 years ago
- ☆28Updated last year
- Simple ETL pipeline using Python☆25Updated last year
- DataTalks.Club's Data Engineering Zoomcamp Project☆22Updated 2 years ago
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆14Updated 3 years ago
- A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour…☆16Updated 2 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆22Updated 2 years ago
- Business challenge that requires building a data platform for retailer data analytics.☆12Updated last year
- My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform☆14Updated 2 years ago
- Data Engineering Project with Hadoop HDFS and Kafka☆46Updated last year
- ☆17Updated 2 years ago
- Data Engineering & Analysis Project- San Francisco Eviction Data ETL Pipeline An end-to-end batch data pipeline for performing ETL on San…☆8Updated 4 months ago
- Airflow & DBT Cloud Integrated Project Presented at Lagos DBT Community Meetup & DataFestAfrica 23☆13Updated last year
- Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and tr…☆10Updated last year
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆18Updated 5 months ago
- End-to-end ELT data engineering project☆20Updated 2 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆21Updated last year
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆34Updated last year
- Repository for Data Engineering Interview Series☆28Updated 3 months ago
- Code Repository for my 3rd Data Project.☆14Updated last year
- Sample project to demonstrate data engineering best practices☆177Updated 11 months ago
- ☆40Updated 7 months ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆26Updated 2 years ago