faizeraza / dataengineering-github-data-pipelinelineLinks
In this project I have built etl pipline which scraps the trending repository based on month,week and day LIVE extract other related information using API and transform it into star schema and load to postgresql then analyzed it over PowerBI . Deployment repo link:
☆12Updated 2 years ago
Alternatives and similar repositories for dataengineering-github-data-pipelineline
Users that are interested in dataengineering-github-data-pipelineline are comparing it to the libraries listed below
Sorting:
- This is the final project that after participated the Data Engineering Zoomcamp☆11Updated 3 years ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆27Updated 2 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆170Updated last month
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆24Updated 2 years ago
- ☆161Updated 3 years ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆88Updated 6 years ago
- Sample project to demonstrate data engineering best practices☆197Updated last year
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆157Updated 5 years ago
- ☆29Updated last year
- An end-to-end data engineering pipeline to create a dashboard for the latest content on the r/Stocks subreddit☆19Updated 3 years ago
- ☆21Updated last year
- Capstone Project for the IBM Data Engineering Professional Certification.☆13Updated 3 years ago
- Data Engineering Project in GCP☆21Updated 2 years ago
- Simple stream processing pipeline☆110Updated last year
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆365Updated last year
- YouTube tutorial project☆105Updated 2 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆165Updated 2 years ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆102Updated 7 months ago
- With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset wh…☆14Updated 3 years ago
- ☆142Updated 2 years ago
- Near real time ETL to populate a dashboard.☆72Updated last month
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆242Updated 2 years ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Updated 2 years ago
- End to end data engineering project☆57Updated 3 years ago
- This data project can be used as a take-home assignment to learn Pyspark and Data Engineering.☆16Updated 2 years ago
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…☆57Updated 3 years ago
- Price Crawler - Tracking Price Inflation☆188Updated 5 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆49Updated 6 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆279Updated last year
- A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour…☆18Updated 2 months ago