Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development
☆21Jul 9, 2019Updated 6 years ago
Alternatives and similar repositories for data-engineering-capstone
Users that are interested in data-engineering-capstone are comparing it to the libraries listed below
Sorting:
- This repository is a working ETL framework which utilizes user data from Spotify API using ➲Python for Extraction and Transformation ➲SQL…☆12Apr 16, 2023Updated 2 years ago
- Data Engineering Capstone☆17Oct 10, 2019Updated 6 years ago
- Udacity Data Engineering Nanodegree Capstone Project☆37May 9, 2020Updated 5 years ago
- Spark, Airflow, Kafka☆24Apr 30, 2023Updated 2 years ago
- This trading strategy deploy the copula model to define the divergence of two correlated asset. The backtesting system is built on backtr…☆22May 31, 2022Updated 3 years ago
- ☆11Nov 30, 2022Updated 3 years ago
- A podcast transcription service built on Azure that transcribes any new episode of your podcast and displays synchronized transcripts alo…☆10Dec 10, 2022Updated 3 years ago
- Different ways to connect to storage in Azure Databricks☆11Jul 19, 2019Updated 6 years ago
- ☆10May 24, 2021Updated 4 years ago
- A simple tool for monitoring the progress of OpenFOAM simulations☆12Nov 9, 2018Updated 7 years ago
- All the Snowflake Virtual Warehouse - Example☆13May 21, 2020Updated 5 years ago
- Chrome Extension for Development/Testing/Exploring GraphQL Servers☆14Oct 1, 2018Updated 7 years ago
- Data Engineering Hours With Experts Coding Challenge☆12Updated this week
- Project based learning for Data Engineering fundamentals.☆13Jan 15, 2021Updated 5 years ago
- A complete media downloader for smule.☆11Mar 18, 2017Updated 9 years ago
- Data Science for Good links.☆14Nov 10, 2021Updated 4 years ago
- Streamlit OpenAI app to chat with custom text documents of all kinds☆13Sep 26, 2024Updated last year
- Code for my blogs on Data Engineering☆15Nov 9, 2020Updated 5 years ago
- using Redis for data science and data engineering☆16Jan 14, 2020Updated 6 years ago
- B19415 - The Definitive Guide to Data Integration☆11Apr 15, 2024Updated last year
- This repo contains all code and data for WWCode Python DE workshop Aug 18 and 25 2022☆25Sep 17, 2022Updated 3 years ago
- Based on our paper "Pneumonia Detection from Lung X-ray Images using Local Search Aided Sine Cosine Algorithm based Deep Feature Selectio…☆11Jun 26, 2022Updated 3 years ago
- Reference and learning notebooks on the use of Spark for ML and analytical applications☆11Mar 1, 2019Updated 7 years ago
- Deliver Pinpoint Campaigns Driven by Machine Learning on AWS SageMaker☆18Feb 10, 2019Updated 7 years ago
- Udacity Data Engineering Nanodegree Project 3☆12Jul 14, 2019Updated 6 years ago
- ☆30Jun 23, 2022Updated 3 years ago
- Notes and code for Programming Massively Parallel Processors☆13Mar 29, 2025Updated 11 months ago
- This repo contains my projects from the Udacity Data Engineering Nano degree☆13Apr 26, 2023Updated 2 years ago
- postProcessing tool for OpenFOAM, transform OpenFOAM fields to one single file by columns☆17May 11, 2021Updated 4 years ago
- Introduction to Modern Data Analytics Tools Docker, Airbyte, DBT, Apache Superset with Brazilian Ecommerce Data & Applying RFM in DBT☆13Sep 8, 2022Updated 3 years ago
- Curated List of NLP tutorials☆30Feb 27, 2025Updated last year
- COMP9321 Data Services Engineering 2019T1☆10Jun 4, 2019Updated 6 years ago
- CLV prediction with pareto-NBD model☆12Jul 1, 2016Updated 9 years ago
- A reinforcement learning based tennis game - Discrete mathematics approach☆12Nov 6, 2020Updated 5 years ago
- Use the Google Cloud Speech API to transcribe audio files from a podcast.☆20May 17, 2017Updated 8 years ago
- This guide will demonstrate how to deploy a minimal Apache Kafka cluster on Docker and set up producers and consumers using Python. We wi…☆18Nov 15, 2020Updated 5 years ago
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆15Apr 29, 2021Updated 4 years ago
- Abnormal Activity Detection using Deep Learning LRCN is a model that combines CNN and RNN to identify abnormal behavior in videos. With r…☆10Sep 22, 2023Updated 2 years ago
- Data-aRT is a No-code environment built in react and Danfojs to enable handling data like an Artist.☆16Jun 28, 2021Updated 4 years ago