A minimal docker compose setup for experimenting with cloud agnostic Lakehouse Architectures Apache Spark with Hive Metastore + Delta Lake + MinIO
☆34Apr 17, 2024Updated last year
Alternatives and similar repositories for spark-minio-delta-lakehouse-docker
Users that are interested in spark-minio-delta-lakehouse-docker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple project using pyflink, kafka and postgre containerized using Docker☆11Aug 26, 2024Updated last year
- Query Iceberg in Trino, Nessie as Catalog, and use minio to replace AWS S3☆26Aug 7, 2025Updated 8 months ago
- trino + hive + minio with postgres in docker compose☆27Aug 18, 2023Updated 2 years ago
- IceDB S3 Proxy to trick S3 clients into only seeing alive files☆13Dec 24, 2023Updated 2 years ago
- ☆19Jun 12, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Google Cloud Platform solution that provides an event driven process that flattens (unnests) Google Analytics 360 data that has been expo…☆16Sep 9, 2021Updated 4 years ago
- ☆14Feb 3, 2020Updated 6 years ago
- O'Neil et al.'s Star Schema Benchmark: curated code☆20May 19, 2025Updated 10 months ago
- ☆14Mar 25, 2026Updated 2 weeks ago
- Useful generic types for Go☆25Apr 3, 2026Updated last week
- Angular 14 - Role Based Authorization Tutorial with Example☆13Dec 22, 2022Updated 3 years ago
- Apache arrow examples in golang☆15Apr 27, 2021Updated 4 years ago
- Documentation of Hologres☆13Aug 18, 2020Updated 5 years ago
- ☆16Mar 9, 2026Updated last month
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Add accent for Vietnamese. N-Grams + Beam search, LSTM, Transformer, Evolved Transformer☆18Feb 3, 2021Updated 5 years ago
- ☆18Sep 24, 2024Updated last year
- A super-minimal Docker Compose template for Apache Superset.☆18Jan 31, 2024Updated 2 years ago
- High Performance Go Driver for Bytehouse☆14Jun 11, 2025Updated 10 months ago
- Scheduler of events for near real-time systems☆31Aug 21, 2025Updated 7 months ago
- dbt + Trino demo project, using TPC-H sample data☆19Mar 27, 2024Updated 2 years ago
- TAU Vehicle Type Recognition Competition☆19Dec 18, 2019Updated 6 years ago
- universal-datalakehouse-postgres-ingestion-deltastreamer☆11Apr 7, 2024Updated 2 years ago
- Crawlyx is an open-source command-line interface (CLI) based web crawler built using Node.js. It is designed to crawl websites and extrac…☆13Apr 12, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This repository contains examples for my article published on Medium☆11Oct 29, 2017Updated 8 years ago
- A Golang DuckDB library that doesn't require CGO☆20Jan 24, 2025Updated last year
- streaming data pipeline platform☆30Jan 4, 2026Updated 3 months ago
- Production Grade Nifi & Nifi Registry. Deploy for VM (Virtual Machine) with Terraform + Ansible, Helm & Helmfile for Kubernetes (EKS)☆13Updated this week
- Learning management system created with Sveltekit and Pocketbase☆11Oct 9, 2023Updated 2 years ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆76Sep 2, 2023Updated 2 years ago
- AI-powered code review assistant for GitHub Pull Requests using OpenAI GPT-4 and Claude with automated feedback and analytics dashboard.☆23Dec 13, 2025Updated 3 months ago
- Create Greenplum docker files☆11Aug 8, 2023Updated 2 years ago
- Building a highly scalable Machine Learning System☆27Dec 3, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Training and evaluating phase☆13Apr 1, 2021Updated 5 years ago
- Flink, Presto, Trino TPC-DS benchmark☆16Feb 20, 2023Updated 3 years ago
- This repository provides basic ansible scripts to deploy a kubernetes cluster☆13Jul 17, 2019Updated 6 years ago
- Implement D*Lite and A* Algorithm on Processing environment☆11Apr 7, 2017Updated 9 years ago
- Xiaomi Yi remote on ESP8266☆13Mar 26, 2016Updated 10 years ago
- Multi-agent chatbot with integrated search engine, utilizing a plan-execute-replan approach for complex queries, built with LangGraph and…☆24Mar 20, 2024Updated 2 years ago
- Atlassian notifications on your menu bar. Available on macOS, Windows & Linux.☆35Updated this week