atlanhq / presto-metricsLinks

☆1

Alternatives and similar repositories for presto-metrics

Users that are interested in presto-metrics are comparing it to the libraries listed below

Sorting:

atlanhq / presto-on-aws
Cloudformation template for deploying Presto on AWS
☆13Updated 5 years ago
amundsen-io / amundsendatabuilder
Data ingestion library for Amundsen to build graph and search index
☆204Updated last year
linkedin / iceberg
A temporary home for LinkedIn's changes to Apache Iceberg (incubating)
☆61Updated 8 months ago
Lewuathe / docker-trino-cluster
Multiple node presto cluster on docker container
☆124Updated 3 years ago
decodableco / dbt-decodable
A dbt adapter for Decodable
☆12Updated 5 months ago
Wikia / discreETLy
ETLy is an add-on dashboard service on top of Apache Airflow.
☆69Updated 2 years ago
qubole / rubix
Cache File System optimized for columnar formats and object stores
☆183Updated 3 years ago
lyft / presto-gateway
A load balancer / proxy / gateway for prestodb
☆356Updated last year
intuit / superglue
Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …
☆158Updated 2 years ago
minio / spark-select
A library for Spark DataFrame using MinIO Select API
☆98Updated 5 years ago
airbnb / SpinalTap
Change Data Capture (CDC) service
☆444Updated last year
varadaio / presto-workload-analyzer
The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them
☆135Updated last year
qubole / quark
Quark is a data virtualization engine over analytic databases.
☆98Updated 8 years ago
datamechanics / delight
A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.
☆344Updated last year
paypal / gimel
Big Data Processing Framework - Unified Data API or SQL on Any Storage
☆246Updated last month
semantalytics / awesome-druid
☆62Updated 6 years ago
cuebook / cuelake
Use SQL to build ELT pipelines on a data lakehouse.
☆288Updated 3 years ago
airbnb / omniduct
A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (in…
☆254Updated 3 weeks ago
qubole / spark-acid
ACID Data Source for Apache Spark based on Hive ACID
☆97Updated 4 years ago
snowlift / trino-storage
Storage connector for Trino
☆113Updated last week
linkedin / transport
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…
☆300Updated last year
AbsaOSS / spline-spark-agent
Spline agent for Apache Spark
☆196Updated this week
getindata / kafka-connect-iceberg-sink
☆80Updated 3 months ago
memiiso / debezium-server-iceberg
Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake
☆275Updated this week
amundsen-io / amundsensearchlibrary
Search service library for Amundsen
☆54Updated 3 weeks ago
Netflix / iceberg
Iceberg is a table format for large, slow-moving tabular data
☆481Updated 2 years ago
SaurabhChawla100 / spark-radiant
Spark-Radiant is Apache Spark Performance and Cost Optimizer
☆25Updated 7 months ago
linkedin / Hoptimator
Multi-hop declarative data pipelines
☆117Updated this week
etsy / boundary-layer
Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform
☆260Updated 2 years ago
ExpediaGroup / circus-train
Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.
☆89Updated last year