gfelot / DEND-Data_Pipeline_Airflow
Learn to build a data pipeline with Airflow to automate wrangling data - An Udacity Data Engineer Nano Degree Project
☆8Updated 5 years ago
Alternatives and similar repositories for DEND-Data_Pipeline_Airflow:
Users that are interested in DEND-Data_Pipeline_Airflow are comparing it to the libraries listed below
- My solutions for the Udacity Data Engineering Nanodegree☆33Updated 5 years ago
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- Source code for 'Building a Data Warehouse' by Vincent Rainardi☆29Updated 7 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Updated 3 years ago
- Analyzing and calculating key marketing metrics with SQL and Python☆14Updated 5 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 5 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.☆20Updated 5 years ago
- Udacity Data Engineer Nano Degree - Project-3 (Data Warehouse)☆22Updated 5 years ago
- notebooks produced throughout the Udacity's Nanodegree Data Engineering Course☆73Updated 4 years ago
- Data analysis using numpy, pandas, matplotlib, seaborn, sqlite3, data wrangling☆31Updated 5 years ago
- A repo to track data engineering projects☆13Updated 2 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆16Updated 5 years ago
- Data engineering interviews Q&A for data community by data community☆62Updated 4 years ago
- Customer life time analysis (CLV analysis). We are using Gamma-Gamma model to estimate average transaction value for each customer.☆45Updated 6 years ago
- Learn how to auto-ingest streaming data into Snowflake using Snowpipe.☆23Updated 2 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆28Updated 9 months ago
- Here's how to get DataQuest's Data Engineering Track missions' content to work on your localhost. Using data from my Valenbisi ARIMA mode…☆15Updated 6 years ago
- Code repo for Packt course I developed, "Beginning Data Wrangling with Python"☆29Updated 4 years ago
- Predictive Analytics for Busines co-created by Alteryx and Tableau☆13Updated 7 years ago
- ☆29Updated 4 years ago
- ☆19Updated 6 years ago
- Machine Learning Solutions, published by Packt☆16Updated last year
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆25Updated 5 years ago
- AWS Big Data Certification☆25Updated last week
- Apache Spark using SQL☆14Updated 3 years ago
- Python Notes on IPython Notebook files.☆37Updated 4 years ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 5 years ago