Resources and projects from Udacity Data Engineering with AWS nano degree programme
☆29Apr 12, 2023Updated 3 years ago
Alternatives and similar repositories for Data-Engineering-With-AWS
Users that are interested in Data-Engineering-With-AWS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Machine Learning DevOps Engineer Nanodegree☆11Jan 27, 2022Updated 4 years ago
- Design data models, build data warehouses, data lakes & lakehouse, automate data pipelines - SQL | NoSQL | AWS | Spark | Airflow☆15Aug 19, 2023Updated 2 years ago
- ☆21Oct 21, 2024Updated last year
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆15May 22, 2023Updated 2 years ago
- Simple, easily customizable and powerful database load testing tool. Provides real-time in-browser aggregate stats. Supports MySQL, Postg…☆13Jan 10, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A demo Streamlit dashboard exploring the classic Seattle Weather dataset.☆20Oct 7, 2025Updated 6 months ago
- ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…☆11Mar 9, 2022Updated 4 years ago
- Data Engineering Project in GCP☆22Mar 29, 2023Updated 3 years ago
- My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform☆14Oct 12, 2022Updated 3 years ago
- This repo is mostly created for pyspark and hive related interview questions.☆63Jan 6, 2026Updated 3 months ago
- Code Repository for my 3rd Data Project.☆16Jun 13, 2023Updated 2 years ago
- This repo contains my projects from the Udacity Data Engineering Nano degree☆13Apr 26, 2023Updated 3 years ago
- ☆16Jul 15, 2023Updated 2 years ago
- Talks from the UW Python for Geosciences Seminar☆12Mar 1, 2016Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆77Sep 2, 2023Updated 2 years ago
- Artifacts required for running the Labs included in Denodo Tutorials, Training courses, etc.☆19Apr 8, 2026Updated 3 weeks ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆26Nov 8, 2022Updated 3 years ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Dec 20, 2022Updated 3 years ago
- Simple ETL pipeline using Python☆29May 22, 2023Updated 2 years ago
- Docktor is a Web App that deploys an easy-to-use kit of analysis and scanning tools.☆13Nov 1, 2023Updated 2 years ago
- this repository contains opportunities for you to apply to more than 140 product base companies(NOT JUST FAANG ) & good start-up's☆17Feb 26, 2023Updated 3 years ago
- Final Project for Data Engineering Zoomcamp Course 2024 🧙🔥☆11Apr 17, 2024Updated 2 years ago
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10May 24, 2021Updated 4 years ago
- Code Repository for my 1st Data Project.☆25Mar 31, 2023Updated 3 years ago
- ☆30Nov 16, 2023Updated 2 years ago
- End to end data pipeline to extract and analyze submissions from any subreddit using Pushshift, python, dbt and BigQuery.☆12Jul 17, 2023Updated 2 years ago
- A Python Snowpark CLI for loading the TPC-DI dataset into Snowflake. Additional dbt models for building the data warehouse.☆11Sep 4, 2025Updated 7 months ago
- Master's thesis on Big Data☆36Aug 14, 2022Updated 3 years ago
- ☆23Sep 25, 2024Updated last year
- Data Engineering Project to Extract and Process Solana Reddit Data☆39Feb 3, 2024Updated 2 years ago
- Case Study's from Danny Ma's Serious SQL Course☆19Aug 4, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Deployed an kafka instance in AWS EC2 Instance to streamline the data into Databricks☆10Aug 15, 2023Updated 2 years ago
- ☆11Dec 28, 2020Updated 5 years ago
- ☆14May 1, 2024Updated last year
- ☆11Jul 13, 2020Updated 5 years ago
- AI-powered fraud detection and prevention system using GANs and Random Forest for secure digital transactions.☆38Dec 22, 2024Updated last year
- ☆35Aug 11, 2024Updated last year
- AlvinToh Learning Repository for The Ultimate Hands-On Hadoop - Tame your Big Data!☆10May 23, 2018Updated 7 years ago