Docker image for Apache Hive Metastore
☆73Apr 18, 2023Updated 2 years ago
Alternatives and similar repositories for docker-hive
Users that are interested in docker-hive are comparing it to the libraries listed below
Sorting:
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆103Jan 31, 2023Updated 3 years ago
- A foreign data wrapper for PostgreSQL allowing easy accessing of Apache ORC formatted data files.☆11Sep 21, 2020Updated 5 years ago
- Example implementation of Zeebe workflows using pyzeebe.☆12Jun 1, 2021Updated 4 years ago
- Repository for building Apache Ozone Docker images☆20Feb 10, 2026Updated 3 weeks ago
- ☆1,080Jun 2, 2024Updated last year
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆20Feb 10, 2025Updated last year
- C++ template library for computing a longest common subsequence (diff)☆13Feb 8, 2014Updated 12 years ago
- dbt + Trino demo project, using TPC-H sample data☆19Mar 27, 2024Updated last year
- Apache Airflow CI pipeline☆19Jun 12, 2019Updated 6 years ago
- Helm charts for Trino and Trino Gateway☆193Updated this week
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆69Feb 3, 2021Updated 5 years ago
- ☆24Oct 3, 2023Updated 2 years ago
- Gitbook Repo for Practical Data Pipeline☆25Feb 4, 2022Updated 4 years ago
- ☆25Mar 15, 2024Updated last year
- Plugin offering views, operators, sensors, and more developed at Pandora Media.☆26May 3, 2018Updated 7 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆172Feb 4, 2021Updated 5 years ago
- PySpark data-pipeline testing and CICD☆28Oct 28, 2020Updated 5 years ago
- A tool to install, configure and manage Trino installations☆27Mar 29, 2022Updated 3 years ago
- Monitoring and insights on your data lakehouse tables☆32Updated this week
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆29Nov 4, 2024Updated last year
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆304Oct 30, 2025Updated 4 months ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆136Oct 25, 2023Updated 2 years ago
- Apache Arrow Ballista Python bindings☆41Feb 10, 2024Updated 2 years ago
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆335Sep 29, 2023Updated 2 years ago
- Common utilities for Apache Kafka☆36Aug 7, 2023Updated 2 years ago
- SPBench: A Framework for Benchmarking Stream Processing Applications☆11Dec 16, 2025Updated 2 months ago
- Wiki and snippets in web stack architecture (Especially for Django and AWS)☆11Feb 18, 2019Updated 7 years ago
- A Los Angeles Times analysis of helicopter accident rates☆11Dec 21, 2020Updated 5 years ago
- Token management for redux saga☆11Aug 19, 2020Updated 5 years ago
- Hadoop/Hive/Spark container to perform CI tests☆10Dec 26, 2020Updated 5 years ago
- A very basic app written in Javascript and packaged as a Docker image to be used as a demo when testing clustered deployments in ECS/EKS.☆11Jun 30, 2023Updated 2 years ago
- ☆10May 28, 2025Updated 9 months ago
- ☆36Nov 11, 2022Updated 3 years ago
- Scripts to work with IRS 990 XML data☆10Jan 11, 2019Updated 7 years ago
- A terraform module that deploys Dagster to Azure.☆11May 10, 2021Updated 4 years ago
- Gradle plugin to include build information such as Git commit ID to your JAR. It can be used to show Git commit information with Spring B…☆38Jan 2, 2017Updated 9 years ago
- A tool that makes it easy to run modular Trino environments locally.☆45Dec 4, 2025Updated 3 months ago
- Readme☆12Mar 2, 2023Updated 3 years ago
- An AWS Lambda function that purges EC2 snapshots according to the rules you specify☆12Apr 3, 2018Updated 7 years ago