Fully dockerized Data Warehouse (DWH) using Airflow, dbt, PostgreSQL and dashboard using redash
☆26Nov 12, 2022Updated 3 years ago
Alternatives and similar repositories for Datawarehouse
Users that are interested in Datawarehouse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DataTalks.Club's Data Engineering Zoomcamp Project☆24Jul 14, 2022Updated 3 years ago
- A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour…☆18Aug 14, 2025Updated 9 months ago
- ☆26Jul 25, 2018Updated 7 years ago
- Data warehouse tech stack with PostgreSQL, DBT and Airflow☆20Dec 29, 2025Updated 4 months ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆48Dec 11, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆31Feb 19, 2024Updated 2 years ago
- The best resources I've found related to data analytics.☆168Mar 4, 2025Updated last year
- Scan and monitor your network effortlessly! Nmap Prometheus Exporter provides insights into network health and security with Prometheus-c…☆15Oct 2, 2023Updated 2 years ago
- Demonstration for crawling Laptop products on Tiki ecomercial website☆12Jan 30, 2021Updated 5 years ago
- With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset wh…☆15Jan 4, 2026Updated 4 months ago
- These projects use pandas, matplotlib, numpy, scipy and scikitlearn☆12Jun 12, 2022Updated 3 years ago
- This repo contains a spark standalone cluster on docker for anyone who wants to play with PySpark by submitting their applications.☆37Jun 9, 2023Updated 2 years ago
- Data Engineering Project in GCP☆22Mar 29, 2023Updated 3 years ago
- End-to-end ELT data engineering project☆23Dec 24, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆22Mar 9, 2026Updated 2 months ago
- Code for the Data Engineering Zoomcamp☆20Dec 12, 2022Updated 3 years ago
- ☆20Mar 9, 2026Updated 2 months ago
- ☆15Feb 15, 2023Updated 3 years ago
- ☆30Nov 16, 2023Updated 2 years ago
- Data Engineering Project to Extract and Process Solana Reddit Data☆40Feb 3, 2024Updated 2 years ago
- A cloud data platform product to accelerate time to insights. Our open-source framework is designed for the real world. Stripping away th…☆25May 8, 2026Updated 2 weeks ago
- A Benchmark Dataset for Multimodal Scientific Fact Checking☆27Sep 17, 2024Updated last year
- Implemented Artificial Bee Colony Algorithm coupled with fuzzy C means Algorithm using OpenCV and Python. • Combined watershed algorit…☆16Feb 25, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- It is a assemble to include all Practice Projects about Big Data Topic, includes Hadoop, Spark, Spark Streaming and Kafka☆11Mar 7, 2019Updated 7 years ago
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆33Nov 9, 2023Updated 2 years ago
- Highly efficient GLCM/X-GLCM feature extractor for python.☆20Aug 8, 2017Updated 8 years ago
- TTS utility☆12Aug 2, 2020Updated 5 years ago
- A project portfolio to accompany my resume☆30Sep 5, 2023Updated 2 years ago
- Spark Notebook docker image☆10Dec 29, 2017Updated 8 years ago
- Multimodal sentiment analysis☆26Jul 17, 2023Updated 2 years ago
- used Airflow, Postgres, Kafka, Spark, and Cassandra, and GitHub Actions to establish an end-to-end data pipeline☆31Oct 25, 2023Updated 2 years ago
- A small set of sources and tools for the Gameboy Development Kit by Michael Hope☆13Aug 7, 2013Updated 12 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆15May 22, 2023Updated 3 years ago
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆12Nov 18, 2023Updated 2 years ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆65Jul 21, 2023Updated 2 years ago
- ☆22Jul 28, 2021Updated 4 years ago
- Fan made game finishing Stefan Butler's Bandersnatch game, as seen in Black Mirror CYOA film.☆10Apr 16, 2019Updated 7 years ago
- Modern Data Engineering Project☆12Jun 3, 2022Updated 3 years ago
- ☆13Mar 14, 2023Updated 3 years ago