starburstdata / starburst-terraform
A complete set of Terraform scripts to deploy Starburst to AWS, GCP and Azure managed Kubernetes services
☆16Updated 2 years ago
Alternatives and similar repositories for starburst-terraform:
Users that are interested in starburst-terraform are comparing it to the libraries listed below
- A Table format agnostic data sharing framework☆38Updated last year
- ☆79Updated last year
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Updated 3 months ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated 3 weeks ago
- Unity Catalog UI☆40Updated 6 months ago
- ☆27Updated last week
- The Internals of Delta Lake☆184Updated 2 months ago
- The Internals of Spark on Kubernetes☆71Updated 2 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆76Updated last month
- A command-line interface for packaging, deploying, and running your EMR Serverless Spark jobs☆41Updated 10 months ago
- Snowflake Data Source for Apache Spark.☆222Updated 4 months ago
- Trino website☆50Updated this week
- ☆24Updated last year
- ☆28Updated 3 months ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆231Updated this week
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆213Updated 2 weeks ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆88Updated last year
- Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools☆14Updated 2 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆25Updated 7 months ago
- Magic to help Spark pipelines upgrade☆34Updated 6 months ago
- ☆24Updated last year
- Adapter for dbt that executes dbt pipelines on Apache Flink☆92Updated last year
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR☆18Updated 7 months ago
- Examples for High Performance Spark☆15Updated 5 months ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- A library that brings useful functions from various modern database management systems to Apache Spark☆58Updated last year
- Examples and custom spark images for working with the spark-on-k8s operator on AWS☆27Updated 4 years ago
- Kafka Connector for Iceberg tables☆16Updated last year
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆185Updated 2 years ago