A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.
☆140Apr 18, 2020Updated 5 years ago
Alternatives and similar repositories for Skytrax-Data-Warehouse
Users that are interested in Skytrax-Data-Warehouse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A way for home buyers to know about factors affecting a state☆48Mar 2, 2019Updated 7 years ago
- Tracking and measuring neighborhood and district-level eviction rates in the city of San Francisco.☆140Jul 14, 2020Updated 5 years ago
- Airflow ETL for Meetup API☆45Dec 27, 2018Updated 7 years ago
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆15Jun 26, 2023Updated 2 years ago
- An open source enterprise data warehousing and analysis platform.☆22Nov 8, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Pyspark Spotify ETL☆17Aug 19, 2021Updated 4 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆166Jun 16, 2020Updated 5 years ago
- RedditR for Content Engagement and Recommendation☆18Dec 21, 2017Updated 8 years ago
- Udacity Data Engineering Nanodegree Project 3☆12Jul 14, 2019Updated 6 years ago
- ☆10May 24, 2021Updated 4 years ago
- Beginner data engineering project - batch edition☆571Mar 12, 2026Updated 2 weeks ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆89Nov 22, 2021Updated 4 years ago
- Tough and flexible tools for data analysis, transformation, validation and movement.☆142Jan 26, 2024Updated 2 years ago
- Use AWS Lambda to Pull E-Scooter and E-Bike Location Data, store in S3 & Redshift using Data Vault Data Model, Server to Google Data Stud…☆16Jun 12, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆51Aug 23, 2019Updated 6 years ago
- This checklist aims to be an exhaustive list of all elements you should consider when using Amazon Redshift.☆15Sep 21, 2020Updated 5 years ago
- Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake developme…☆1,880Aug 26, 2022Updated 3 years ago
- An end-to-end data engineering pipeline to create a dashboard for the latest content on the r/Stocks subreddit☆20Aug 5, 2022Updated 3 years ago
- A template for Python projects that need to use a relational database, including tooling for managing schema migrations and testing again…☆13Dec 13, 2024Updated last year
- Example end to end data engineering project.☆1,398Dec 8, 2022Updated 3 years ago
- A platform-agnostic index of Singer.io taps and targets.☆11Jan 29, 2021Updated 5 years ago
- A template for dockerized dbt-Core projects with VS Code Dev Containers.☆21Nov 14, 2022Updated 3 years ago
- Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark☆11May 22, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Runnable e-commerce mini data warehouse based on Python, PostgreSQL & Metabase, template for new projects☆29Mar 31, 2021Updated 4 years ago
- A set of coding challenge for various engineering roles at Isentia☆21Sep 14, 2021Updated 4 years ago
- My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on lambda architecture, that aggrega…☆509Aug 24, 2022Updated 3 years ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Jul 9, 2019Updated 6 years ago
- pyspark methods to enhance developer productivity 📣 👯 🎉☆685Mar 6, 2025Updated last year
- Different ways to connect to storage in Azure Databricks☆11Jul 19, 2019Updated 6 years ago
- Personal Data Engineering Projects☆1,001Feb 8, 2023Updated 3 years ago
- ☆16Sep 22, 2020Updated 5 years ago
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆267Jan 1, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- python-practice☆12Jul 12, 2019Updated 6 years ago
- Educational project on how to build an ETL (Extract, Transform, Load) data pipeline, orchestrated with Airflow.☆347Jan 12, 2022Updated 4 years ago
- An (abridged) time series of Aave wallet health factors (and associated token counts, prices, liquidation thresholds)☆11Jul 14, 2022Updated 3 years ago
- R package for tracking Covid19 cases in San Francisco☆12Apr 2, 2023Updated 2 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆898May 8, 2022Updated 3 years ago
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested da…☆114Sep 21, 2023Updated 2 years ago
- ☆13Jan 7, 2022Updated 4 years ago