Lewuathe / docker-trino-cluster
Multiple node presto cluster on docker container
☆124Updated 2 years ago
Alternatives and similar repositories for docker-trino-cluster:
Users that are interested in docker-trino-cluster are comparing it to the libraries listed below
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- ☆79Updated last year
- Storage connector for Trino☆101Updated 3 weeks ago
- A load balancer / proxy / gateway for prestodb☆357Updated 5 months ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆215Updated this week
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆97Updated last year
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆118Updated last month
- Setup for running Trino with Hive Metastore on Kubernetes☆99Updated 2 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆86Updated 10 months ago
- PostgreSQL protocol gateway for Presto distributed SQL query engine☆291Updated last year
- Spline agent for Apache Spark☆191Updated last week
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆86Updated 9 months ago
- A library for Spark DataFrame using MinIO Select API☆97Updated 5 years ago
- A tool to install, configure and manage Trino installations☆27Updated 2 years ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆219Updated 3 weeks ago
- Presto and Minio on Docker Infrastructure☆41Updated 6 years ago
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆186Updated last year
- Plugin for Presto to allow addition of user functions easily☆116Updated 3 years ago
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆176Updated 2 years ago
- A library that provides useful extensions to Apache Spark and PySpark.☆205Updated last month
- Spark SQL index for Parquet tables☆134Updated 3 years ago
- Tutorial on how to setup Trino and Apache Ranger using docker☆41Updated 5 months ago
- REST API for Apache Spark on K8S or YARN☆93Updated this week
- Data ingestion library for Amundsen to build graph and search index☆205Updated 10 months ago
- Cache File System optimized for columnar formats and object stores☆182Updated 2 years ago
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability☆232Updated 2 years ago
- Kinesis Connector for Structured Streaming☆136Updated 6 months ago