starburstdata / trinoLinks
Distributed SQL query engine for big data
☆49Updated last week
Alternatives and similar repositories for trino
Users that are interested in trino are comparing it to the libraries listed below
Sorting:
- ACID Data Source for Apache Spark based on Hive ACID☆97Updated 4 years ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- Storage connector for Trino☆112Updated last week
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆64Updated last year
- Starburst Enterprise Distribution of Presto☆45Updated 3 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆80Updated 3 months ago
- Port of TPC-DS dsdgen to Java☆50Updated 11 months ago
- ☆80Updated 3 months ago
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆186Updated 2 years ago
- Snowflake Data Source for Apache Spark.☆226Updated last month
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆90Updated 2 months ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆272Updated this week
- Spline agent for Apache Spark☆196Updated this week
- A testing framework for Trino☆26Updated 4 months ago
- A load balancer / proxy / gateway for prestodb☆358Updated last year
- The Internals of Delta Lake☆184Updated 6 months ago
- Avro SerDe for Apache Spark structured APIs.☆235Updated last month
- Cache File System optimized for columnar formats and object stores☆183Updated 2 years ago
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆344Updated last year
- ☆213Updated this week
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- A library that brings useful functions from various modern database management systems to Apache Spark☆60Updated last year
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆61Updated 7 months ago
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆300Updated last year
- Resource for the book Trino: The Definitive Guide (and formerly Presto: The Definitive Guide)☆226Updated 2 years ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆124Updated last week
- ☆106Updated 2 years ago
- Framework for running macro benchmarks in a clustered environment☆35Updated 4 months ago
- Kafka Connector for Iceberg tables☆16Updated 2 years ago
- A simple Spark-powered ETL framework that just works 🍺☆182Updated this week