njanakiev / trino-minio-dockerLinks
Minimal example to run Trino, Minio, and Hive standalone metastore on docker
☆53Updated 3 years ago
Alternatives and similar repositories for trino-minio-docker
Users that are interested in trino-minio-docker are comparing it to the libraries listed below
Sorting:
- ☆267Updated 11 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆100Updated 2 years ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆248Updated 3 weeks ago
- A Table format agnostic data sharing framework☆39Updated last year
- ☆70Updated 9 months ago
- Adapter for dbt that executes dbt pipelines on Apache Flink☆95Updated last year
- Apache Hive Metastore as a Standalone server in Docker☆80Updated last year
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆414Updated 5 months ago
- dbt (data build tool) adapter for the Dremio☆52Updated last month
- Python client for Trino☆397Updated 3 weeks ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆81Updated 5 months ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆82Updated last year
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆271Updated last year
- Data product portal created by Dataminded☆190Updated this week
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆76Updated this week
- Schema modelling framework for decentralised domain-driven ownership of data.☆259Updated last year
- REST API for Apache Spark on K8S or YARN☆104Updated last month
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆286Updated this week
- ☆323Updated last week
- ☆80Updated 5 months ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆76Updated 4 years ago
- A Micosoft Power BI Custom Connector allowing you to import Trino data into Power BI.☆75Updated 8 months ago
- Enables Python developers to leverage Debezium's CDC capabilities with custom event handlers and seamless integration.☆35Updated 3 weeks ago
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆194Updated last week
- Generate and Visualize Data Lineage from query history☆327Updated 2 years ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆166Updated 3 weeks ago
- dbt-starrocks contains all of the code enabling dbt to work with StarRocks☆45Updated 3 weeks ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆96Updated 3 weeks ago
- The Data Product Descriptor Specification (DPDS) Repository☆80Updated 8 months ago
- Turning PySpark Into a Universal DataFrame API☆432Updated last week