onehouseinc / LakeViewLinks
Monitoring and insights on your data lakehouse tables
☆31Updated this week
Alternatives and similar repositories for LakeView
Users that are interested in LakeView are comparing it to the libraries listed below
Sorting:
- Apache Spark Kubernetes Operator☆206Updated 2 weeks ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆282Updated last week
- a curated list of awesome lakehouse frameworks, applications, etc☆35Updated 6 months ago
- Storage connector for Trino☆114Updated this week
- ☆80Updated 4 months ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- ☆220Updated this week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆80Updated 4 months ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆127Updated last month
- Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!☆233Updated 7 months ago
- Drop-in replacement for Apache Spark UI☆293Updated 2 weeks ago
- A library that provides useful extensions to Apache Spark and PySpark.☆229Updated last month
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆301Updated last year
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆91Updated 3 months ago
- REST API for Apache Spark on K8S or YARN☆99Updated 2 months ago
- Apache Iceberg Documentation Site☆42Updated last year
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆42Updated 11 months ago
- Multi-hop declarative data pipelines☆118Updated last week
- Tutorial on how to setup Trino and Apache Ranger using docker☆41Updated last year
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆329Updated last year
- ☆89Updated this week
- Framework for running macro benchmarks in a clustered environment☆35Updated 5 months ago
- Open Control Plane for Tables in Data Lakehouse☆367Updated this week
- A load balancer / proxy / gateway for prestodb☆356Updated last year
- Helm charts for Trino and Trino Gateway☆176Updated last week
- The Internals of Spark on Kubernetes☆71Updated 3 years ago
- pulsar lakehouse connector☆34Updated 5 months ago
- A simple Spark-powered ETL framework that just works 🍺☆182Updated 3 weeks ago
- Spline agent for Apache Spark☆196Updated last week
- An Extensible Data Skipping Framework☆47Updated last month