gfelot / DEND-Data_Pipeline_AirflowLinks
Learn to build a data pipeline with Airflow to automate wrangling data - An Udacity Data Engineer Nano Degree Project
☆8Updated 5 years ago
Alternatives and similar repositories for DEND-Data_Pipeline_Airflow
Users that are interested in DEND-Data_Pipeline_Airflow are comparing it to the libraries listed below
Sorting:
- My solutions for the Udacity Data Engineering Nanodegree☆34Updated 5 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- Analyzing and calculating key marketing metrics with SQL and Python☆14Updated 6 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 6 years ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆90Updated 3 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- A repo to track data engineering projects☆13Updated 2 years ago
- Here's how to get DataQuest's Data Engineering Track missions' content to work on your localhost. Using data from my Valenbisi ARIMA mode…☆15Updated 7 years ago
- Python Notes on IPython Notebook files.☆37Updated 4 years ago
- ☆150Updated 7 years ago
- AWS Big Data Certification☆25Updated 6 months ago
- ETL pipeline using pyspark (Spark - Python)☆117Updated 5 years ago
- ☆33Updated last year
- ☆86Updated 2 years ago
- Quick EDA on a data set to determine what segments there are.☆31Updated 6 years ago
- This repository hosts the code/projects/demos/slides for Big Data technologies under Apache Hadoop and Apache Spark umbrella.☆42Updated 2 years ago
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Updated 3 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 6 years ago
- ☆63Updated 6 years ago
- Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution☆69Updated 4 years ago
- Data analysis using numpy, pandas, matplotlib, seaborn, sqlite3, data wrangling☆32Updated 5 years ago
- Frank Kane's Taming Big Data with Apache Spark and Python, published by Packt☆123Updated 2 years ago
- A curated list of repositories for my book Machine Learning Solutions.☆78Updated 7 years ago
- Udacity Data Pipeline Exercises☆15Updated 5 years ago
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆30Updated last year
- Data engineering interviews Q&A for data community by data community☆63Updated 5 years ago
- [Video]AWS Certified Machine Learning-Specialty (ML-S) Guide☆121Updated 6 months ago
- ☆18Updated 3 years ago
- Jupyter notebooks for pyspark tutorials given at University☆108Updated this week