Example for article Running Spark 3 with standalone Hive Metastore 3.0
☆101Jan 31, 2023Updated 3 years ago
Alternatives and similar repositories for hive-metastore-docker
Users that are interested in hive-metastore-docker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store☆17Oct 20, 2022Updated 3 years ago
- ☆25Mar 15, 2024Updated 2 years ago
- Postgresql configured to work as metastore for Hive.☆32Dec 16, 2022Updated 3 years ago
- Apache Hive Metastore as a Standalone server in Docker☆80Aug 22, 2024Updated last year
- ☆270Oct 23, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Hadoop, Hive and PrestoDB for deployment using Docker☆27Oct 21, 2025Updated 7 months ago
- This project demonstrates Real-Time streaming of CDC data from MySql to Apache Iceberg using Flink SQL Client for faster data analytics a…☆24Jan 16, 2024Updated 2 years ago
- Docker image for Apache Hive Metastore☆72Apr 18, 2023Updated 3 years ago
- ☆40Jan 14, 2021Updated 5 years ago
- Setup for running Trino with Hive Metastore on Kubernetes☆103Aug 10, 2022Updated 3 years ago
- Spark on Kubernetes using Helm☆33Jun 9, 2020Updated 5 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆85Apr 12, 2025Updated last year
- Spark on Kubernetes samples☆20Jun 8, 2021Updated 4 years ago
- Presto and Minio on Docker Infrastructure☆43Jul 11, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆13Oct 4, 2023Updated 2 years ago
- An example showing how to integrate Apache Kafka with Akka Streams and Akka HTTP.☆15Sep 28, 2016Updated 9 years ago
- Cloud-native Trino (prestosql) + Hive + Minio + Superset☆23Nov 29, 2021Updated 4 years ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆84Sep 30, 2024Updated last year
- Minimal example to run Trino, Minio, and Hive standalone metastore on docker☆53Jun 4, 2022Updated 3 years ago
- A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)☆12May 2, 2021Updated 5 years ago
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆42Jan 19, 2026Updated 4 months ago
- Iceberg Playground in a Box☆69Apr 8, 2026Updated last month
- A complete data engineering project demonstrating modern data stack practices with Apache Flink, Iceberg, Trino and Superset☆25Sep 29, 2025Updated 8 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆15Feb 15, 2025Updated last year
- dlt-dagster-demo☆14Nov 6, 2023Updated 2 years ago
- Multiple node presto cluster on docker container☆127Jul 8, 2022Updated 3 years ago
- A docker image with a pre-configured Hive Metastore and a Spark ThriftServer☆19Jan 20, 2020Updated 6 years ago
- A dbt package for easily using production data in a development environment.☆51May 21, 2026Updated last week
- POC for all the stack of big data (kafka, spark, cassandra, hdfs, docker, springboot)☆12Dec 16, 2022Updated 3 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆31May 21, 2026Updated last week
- A tool to create Airflow RBAC roles with dag-level permissions from cli.☆13Sep 7, 2023Updated 2 years ago
- A playground to experience Gravitino☆77May 15, 2026Updated 2 weeks ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- A Table format agnostic data sharing framework☆42Feb 4, 2024Updated 2 years ago
- ☆10Jun 3, 2023Updated 2 years ago
- Operator for managing the Spark clusters on Kubernetes and OpenShift.☆159Nov 18, 2021Updated 4 years ago
- A repository of blogs/videos that presents how Apache Iceberg is being used in Production by various orgs☆20Jul 31, 2023Updated 2 years ago
- Deploy a simple Multi-Node Clickhouse Cluster with docker-compose in minutes.☆17Feb 11, 2022Updated 4 years ago
- how to unit test your PySpark code☆29Mar 26, 2021Updated 5 years ago