onehouseinc / lake-loaderLinks
A tool to benchmark L (loading) workloads within ETL workloads
☆26Updated 3 months ago
Alternatives and similar repositories for lake-loader
Users that are interested in lake-loader are comparing it to the libraries listed below
Sorting:
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆64Updated last year
- Multi-hop declarative data pipelines☆118Updated this week
- Presto Trino with Apache Hive Postgres metastore☆43Updated 11 months ago
- ☆58Updated this week
- ☆91Updated 7 months ago
- a curated list of awesome lakehouse frameworks, applications, etc☆34Updated 6 months ago
- 🌟 Examples of use cases that utilize Decodable, as well as demos for related open-source projects such as Apache Flink, Debezium, and Po…☆80Updated 2 months ago
- Yet Another (Spark) ETL Framework☆21Updated last year
- MemQ is an efficient, scalable cloud native PubSub system☆138Updated last week
- ☆40Updated 2 years ago
- Apache Kafka is an open-source distributed event streaming platform used by thousands of companies. uForwarder aims to address several pa…☆78Updated 5 months ago
- Management and automation platform for Stateful Distributed Systems☆109Updated this week
- ☆59Updated last year
- Distributed SQL query engine for big data☆50Updated this week
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆62Updated this week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆80Updated 4 months ago
- A Table format agnostic data sharing framework☆38Updated last year
- Comptaction runtime for Apache Iceberg.☆67Updated this week
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆133Updated last week
- Stackable Operator for Apache Airflow☆30Updated this week
- Lakehouse storage system benchmark☆75Updated 2 years ago
- A testing framework for Trino☆26Updated 5 months ago
- ☆39Updated last month
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆79Updated this week
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆164Updated 8 months ago
- Resource for the book Trino: The Definitive Guide (and formerly Presto: The Definitive Guide)☆228Updated 2 years ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆279Updated this week
- In-Memory Analytics for Kafka using DuckDB☆133Updated last week
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated last week
- A BYOC option for Snowflake workloads☆88Updated this week