atlanhq / presto-metricsLinks
☆1Updated 4 years ago
Alternatives and similar repositories for presto-metrics
Users that are interested in presto-metrics are comparing it to the libraries listed below
Sorting:
- Cloudformation template for deploying Presto on AWS☆13Updated 5 years ago
- Data ingestion library for Amundsen to build graph and search index☆204Updated last year
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆61Updated 8 months ago
- Multiple node presto cluster on docker container☆124Updated 3 years ago
- A dbt adapter for Decodable☆12Updated 5 months ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated 2 years ago
- Cache File System optimized for columnar formats and object stores☆183Updated 3 years ago
- A load balancer / proxy / gateway for prestodb☆356Updated last year
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆158Updated 2 years ago
- A library for Spark DataFrame using MinIO Select API☆98Updated 5 years ago
- Change Data Capture (CDC) service☆444Updated last year
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- Quark is a data virtualization engine over analytic databases.☆98Updated 8 years ago
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆344Updated last year
- Big Data Processing Framework - Unified Data API or SQL on Any Storage☆246Updated last month
- ☆62Updated 6 years ago
- Use SQL to build ELT pipelines on a data lakehouse.☆288Updated 3 years ago
- A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (in…☆254Updated 3 weeks ago
- ACID Data Source for Apache Spark based on Hive ACID☆97Updated 4 years ago
- Storage connector for Trino☆113Updated last week
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆300Updated last year
- Spline agent for Apache Spark☆196Updated this week
- ☆80Updated 3 months ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆275Updated this week
- Search service library for Amundsen☆54Updated 3 weeks ago
- Iceberg is a table format for large, slow-moving tabular data☆481Updated 2 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Updated 7 months ago
- Multi-hop declarative data pipelines☆117Updated this week
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆260Updated 2 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆89Updated last year