faizeraza / dataengineering-github-data-pipelinelineLinks
In this project I have built etl pipline which scraps the trending repository based on month,week and day LIVE extract other related information using API and transform it into star schema and load to postgresql then analyzed it over PowerBI . Deployment repo link:
☆12Updated 2 years ago
Alternatives and similar repositories for dataengineering-github-data-pipelineline
Users that are interested in dataengineering-github-data-pipelineline are comparing it to the libraries listed below
Sorting:
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆24Updated 2 years ago
- End to end data engineering project☆57Updated 2 years ago
- ☆29Updated last year
- A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour…☆17Updated last month
- Data Engineering Project in GCP☆21Updated 2 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆164Updated last week
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆155Updated 5 years ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆28Updated 2 years ago
- Price Crawler - Tracking Price Inflation☆187Updated 5 years ago
- This is the final project that after participated the Data Engineering Zoomcamp☆11Updated 3 years ago
- ☆142Updated 2 years ago
- ☆154Updated 3 years ago
- YouTube tutorial project☆106Updated last year
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆163Updated 2 years ago
- This data project can be used as a take-home assignment to learn Pyspark and Data Engineering.☆16Updated 2 years ago
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆357Updated last year
- Capstone Project for the IBM Data Engineering Professional Certification.☆12Updated 3 years ago
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆237Updated 2 years ago
- ☆21Updated last year
- Sample project to demonstrate data engineering best practices☆196Updated last year
- ☆18Updated 2 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆25Updated 2 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆23Updated 3 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆276Updated last year
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆92Updated 6 years ago
- Simple ETL pipeline using Python☆27Updated 2 years ago
- ☆88Updated 3 years ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Updated 2 years ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Updated last month
- Contains spark dataframe solutions of leetcode questions☆26Updated 2 years ago