Example for article Running Spark 3 with standalone Hive Metastore 3.0
☆103Jan 31, 2023Updated 3 years ago
Alternatives and similar repositories for hive-metastore-docker
Users that are interested in hive-metastore-docker are comparing it to the libraries listed below
Sorting:
- ☆25Mar 15, 2024Updated last year
- Postgresql configured to work as metastore for Hive.☆32Dec 16, 2022Updated 3 years ago
- ☆270Oct 23, 2024Updated last year
- Spark and Hive docker containers sharing a common MySQL metastore☆26Apr 17, 2020Updated 5 years ago
- Cloud-native Trino (prestosql) + Hive + Minio + Superset☆24Nov 29, 2021Updated 4 years ago
- Hadoop, Hive and PrestoDB for deployment using Docker☆27Oct 21, 2025Updated 4 months ago
- Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store☆17Oct 20, 2022Updated 3 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆83Apr 12, 2025Updated 10 months ago
- This project demonstrates Real-Time streaming of CDC data from MySql to Apache Iceberg using Flink SQL Client for faster data analytics a…☆23Jan 16, 2024Updated 2 years ago
- Minimal example to run Trino, Minio, and Hive standalone metastore on docker☆53Jun 4, 2022Updated 3 years ago
- Apache iceberg Spark s3 examples☆21Mar 1, 2024Updated 2 years ago
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- Iceberg Playground in a Box☆67Jun 27, 2025Updated 8 months ago
- Presto and Minio on Docker Infrastructure☆43Jul 11, 2018Updated 7 years ago
- dlt-dagster-demo☆13Nov 6, 2023Updated 2 years ago
- A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)☆13May 2, 2021Updated 4 years ago
- Presto Gateway routes query based on policy.☆12Sep 15, 2020Updated 5 years ago
- ☆13Oct 4, 2023Updated 2 years ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆86Sep 30, 2024Updated last year
- Web UI for Amazon Athena☆58Aug 29, 2022Updated 3 years ago
- A complete data engineering project demonstrating modern data stack practices with Apache Flink, Iceberg, Trino and Superset☆20Sep 29, 2025Updated 5 months ago
- A tool to create Airflow RBAC roles with dag-level permissions from cli.☆13Sep 7, 2023Updated 2 years ago
- Presto & Alluxio Dockers for blazing fast analytics☆13Nov 6, 2019Updated 6 years ago
- Multiple node presto cluster on docker container☆126Jul 8, 2022Updated 3 years ago
- Tools for building, packaging, and OAP public cloud integrations such as AWS EMR, Google Dataproc and K8S.☆18Mar 27, 2024Updated last year
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆21Dec 18, 2023Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated this week
- 【易车】- Spark、flink、HBase、Hive、flume集成了一些Hadoop的原生api的一些demo(如HDFS、MapReduce:目前就这两个);同时测试一些异常功能☆16Apr 4, 2019Updated 6 years ago
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆20Feb 10, 2025Updated last year
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆15Feb 15, 2025Updated last year
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆42Jan 19, 2026Updated last month
- Spark on Kubernetes using Helm☆33Jun 9, 2020Updated 5 years ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆302Feb 23, 2026Updated 2 weeks ago
- Framework for running macro benchmarks in a clustered environment☆37Mar 5, 2025Updated last year
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,430Updated this week
- ☆41Jul 4, 2022Updated 3 years ago
- Spark on Kubernetes samples☆20Jun 8, 2021Updated 4 years ago
- A repository of blogs/videos that presents how Apache Iceberg is being used in Production by various orgs☆18Jul 31, 2023Updated 2 years ago
- A Table format agnostic data sharing framework☆42Feb 4, 2024Updated 2 years ago