onehouseinc / lake-loaderLinks
A tool to benchmark L (loading) workloads within ETL workloads
☆24Updated last month
Alternatives and similar repositories for lake-loader
Users that are interested in lake-loader are comparing it to the libraries listed below
Sorting:
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆79Updated 2 months ago
- Multi-hop declarative data pipelines☆117Updated 3 weeks ago
- a curated list of awesome lakehouse frameworks, applications, etc☆33Updated 4 months ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated 10 months ago
- A Table format agnostic data sharing framework☆38Updated last year
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture☆89Updated 2 weeks ago
- ☆90Updated 5 months ago
- Presto Trino with Apache Hive Postgres metastore☆42Updated 9 months ago
- Monitoring and insights on your data lakehouse tables☆30Updated this week
- Docker envinroment to stream data from Kafka to Iceberg tables☆29Updated last year
- Yet Another (Spark) ETL Framework☆21Updated last year
- ☆14Updated last month
- MCP Server for Trino developed via MCP Python SDK☆18Updated 2 months ago
- ☆25Updated last year
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆75Updated 3 years ago
- ☆40Updated 2 years ago
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆127Updated last month
- A tool that makes it easy to run modular Trino environments locally.☆39Updated last week
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆77Updated this week
- Java implementation for performing operations on Apache Iceberg and Hive tables☆19Updated last month
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆61Updated 7 months ago
- Use dbt to manage real-time data transformations in RisingWave.☆27Updated this week
- Apache Hive Metastore as a Standalone server in Docker☆79Updated 10 months ago
- ☆80Updated 2 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆99Updated 2 years ago
- ☆58Updated 11 months ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆64Updated last year
- Quick Guides from Dremio on Several topics☆71Updated last week
- Iceberg Playground in a Box☆55Updated last week
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆269Updated this week