Resources and projects from Udacity Data Engineering with AWS nano degree programme
☆29Apr 12, 2023Updated 3 years ago
Alternatives and similar repositories for Data-Engineering-With-AWS
Users that are interested in Data-Engineering-With-AWS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Machine Learning DevOps Engineer Nanodegree☆11Jan 27, 2022Updated 4 years ago
- Design data models, build data warehouses, data lakes & lakehouse, automate data pipelines - SQL | NoSQL | AWS | Spark | Airflow☆16Aug 19, 2023Updated 2 years ago
- ☆22Oct 21, 2024Updated last year
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆15May 22, 2023Updated 3 years ago
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆15Jun 26, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A demo Streamlit dashboard exploring the classic Seattle Weather dataset.☆22Oct 7, 2025Updated 8 months ago
- Data Engineering Project in GCP☆22Mar 29, 2023Updated 3 years ago
- This repo is mostly created for pyspark and hive related interview questions.☆63Jan 6, 2026Updated 5 months ago
- Code Repository for my 3rd Data Project.☆16Jun 13, 2023Updated 2 years ago
- This repo contains my projects from the Udacity Data Engineering Nano degree☆13Apr 26, 2023Updated 3 years ago
- ☆16Jul 15, 2023Updated 2 years ago
- Talks from the UW Python for Geosciences Seminar☆12Mar 1, 2016Updated 10 years ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆78Sep 2, 2023Updated 2 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆26Nov 8, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Live Training Session: Cleaning Data with Pyspark☆17Jun 18, 2020Updated 5 years ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Dec 20, 2022Updated 3 years ago
- A python library to generate competition brackets☆12Mar 23, 2021Updated 5 years ago
- End-to-End examples that show how to solve business problems using Amazon SageMaker and it's ML/DL algorithm.☆17Jun 12, 2023Updated 2 years ago
- Easily put your django site behind a layer of Basic Authentication, for protecting the staging/testing servers☆13Apr 26, 2022Updated 4 years ago
- API/Data Platform for Ingesting, Storing, and Serving Data through Postgres, and Litestar☆11Apr 25, 2026Updated last month
- The typed graph between your code and whichever warehouse, table format, or query engine you've chosen — typed compiler, branches, replay…☆265Updated this week
- Docktor is a Web App that deploys an easy-to-use kit of analysis and scanning tools.☆14Nov 1, 2023Updated 2 years ago
- Final Project for Data Engineering Zoomcamp Course 2024 🧙🔥☆11Apr 17, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- ☆10May 24, 2021Updated 5 years ago
- Code Repository for my 1st Data Project.☆25Mar 31, 2023Updated 3 years ago
- ☆30Nov 16, 2023Updated 2 years ago
- End to end data pipeline to extract and analyze submissions from any subreddit using Pushshift, python, dbt and BigQuery.☆12Jul 17, 2023Updated 2 years ago
- A Python Snowpark CLI for loading the TPC-DI dataset into Snowflake. Additional dbt models for building the data warehouse.☆11Sep 4, 2025Updated 9 months ago
- Data Engineering Project to Extract and Process Solana Reddit Data☆40Feb 3, 2024Updated 2 years ago
- Case Study's from Danny Ma's Serious SQL Course☆19Aug 4, 2022Updated 3 years ago
- Deployed an kafka instance in AWS EC2 Instance to streamline the data into Databricks☆10Aug 15, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15May 1, 2024Updated 2 years ago
- ☆39Feb 26, 2026Updated 3 months ago
- ☆11Jul 13, 2020Updated 5 years ago
- AI-powered fraud detection and prevention system using GANs and Random Forest for secure digital transactions.☆38Dec 22, 2024Updated last year
- AlvinToh Learning Repository for The Ultimate Hands-On Hadoop - Tame your Big Data!☆10May 23, 2018Updated 8 years ago
- files created in ardan labs golang training☆12Nov 8, 2023Updated 2 years ago
- A fully serverless, event-driven data pipeline that ingests, enriches, validates, and visualizes real-time news data using AWS services. …☆25Aug 10, 2025Updated 9 months ago