ultranet1 / APACHE_AIRFLOW_DATA_PIPELINES
Project Description: A music streaming company wants to introduce more automation and monitoring to their data warehouse ETL pipelines and they have come to the conclusion that the best tool to achieve this is Apache Airflow. As their Data Engineer, I was tasked to create a reusable production-grade data pipeline that incorporates data quality c…
☆14Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for APACHE_AIRFLOW_DATA_PIPELINES
- Use Multiple Linear Regression, Python, Pandas, and Matplotlib to analyze the lifetime value and the key factors of the ‘Telco Customer C…☆10Updated 4 years ago
- This repository houses all the resources, contents, source codes, files, jupyter notebooks etc. related to Natural Language Processing re…☆10Updated 5 years ago
- Monolithic model-view-controller full-stack web application built with Python, Flask, SQL Alchemy, MySQL, Jinja, and Bootstrap. Applicati…☆15Updated last year
- The repository of the book: Deep Learning with Python by Francois Chollet☆16Updated 5 years ago
- Create Interactive Dashboards with Streamlit and Python Coursera☆10Updated 4 years ago
- Automated Machine Learning on AWS, published by Packt☆44Updated 10 months ago
- Amazon Redshift Cookbook, Published by Packt☆15Updated last year
- ETL process which downloads, transforms, and loads Freddie Mac/Fannie Mae mortgage data☆18Updated 6 years ago
- Demonstrating the efficiency of pmdarima’s auto_arima() function compared to implementing a traditional ARIMA model.☆9Updated 3 years ago
- Streamlit example showing Scikit Learn & Pyspark ML over Healthcare data ! Its simple !!☆30Updated 3 years ago
- Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple …☆28Updated 4 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 8 years ago
- Customer-base segmentation over e-commerce sales data☆24Updated 4 years ago
- The repository for the course in Udemy☆17Updated 5 years ago
- Customer analytics has been one of hottest buzzwords for years. Few years back it was only marketing department’s monopoly carried out wi…☆21Updated 6 years ago
- Public Repo of my machine learning project to predict home prices☆11Updated 4 years ago
- Detecting car parking slot on Open car park space☆13Updated 5 years ago
- This MVP data web app uses the Streamlit framework and Facebook's Prophet forecasting package to generate a dynamic forecast from your ow…☆65Updated 10 months ago
- Building Real Time Data Pipeline using Apache Kafka, Apache Spark, Hadoop, PostgreSQL, Django and Flexomonster on Docker to track status …☆20Updated 3 years ago
- ☆13Updated last year
- Creating a Gradio user interface to predict the sentiment of a tweet☆12Updated 2 years ago
- A few end to end examples that use data-describe☆16Updated last year
- Learning and buiding API using Fast API☆12Updated 3 years ago
- Live stream tweets based on keywords to database using SQLAlchemy. Tweets are assigned a sentiment score and data is presented via stream…☆43Updated 3 years ago
- Automate PowerPoint Slides Creation with Python☆29Updated last month
- A study and comparison of Risk Modeling algorithms (Capstone Project)☆30Updated 6 years ago
- ☆15Updated 3 years ago
- Code and notebooks containing my experiments in data science, EDA, visualization, and machine learning☆27Updated last year
- Using Python to conduct EDA, perform statistical analysis, visualize insights, and present data-driven solutions to Chief Marketing Offic…☆16Updated 3 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆16Updated 5 years ago