onehouseinc / lake-loaderLinks
A tool to benchmark L (loading) workloads within ETL workloads
☆30Updated last week
Alternatives and similar repositories for lake-loader
Users that are interested in lake-loader are comparing it to the libraries listed below
Sorting:
- Multi-hop declarative data pipelines☆122Updated last week
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆65Updated 2 years ago
- ☆107Updated 11 months ago
- ☆64Updated last year
- ☆40Updated 2 years ago
- Compaction runtime for Apache Iceberg.☆113Updated this week
- 🌟 Examples of use cases that utilize Decodable, as well as demos for related open-source projects such as Apache Flink, Debezium, and Po…☆87Updated 6 months ago
- Presto Trino with Apache Hive Postgres metastore☆43Updated last year
- A Table format agnostic data sharing framework☆42Updated last year
- Management and automation platform for Stateful Distributed Systems☆110Updated last week
- In-Memory Analytics for Kafka using DuckDB☆146Updated last week
- Yet Another (Spark) ETL Framework☆21Updated 2 years ago
- ☆60Updated 2 weeks ago
- Apache Kafka is an open-source distributed event streaming platform used by thousands of companies. uForwarder aims to address several pa…☆100Updated 3 months ago
- Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …☆12Updated 3 weeks ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆103Updated 2 years ago
- The observability platform for Iceberg lakehouses.☆410Updated 2 weeks ago
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆145Updated 4 months ago
- Iceberg Playground in a Box☆67Updated 6 months ago
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆88Updated 2 months ago
- ☆40Updated 3 weeks ago
- a curated list of awesome lakehouse frameworks, applications, etc☆37Updated last month
- A testing framework for Trino☆26Updated 9 months ago
- A home for LinkedIn's changes to Apache Iceberg☆63Updated 3 weeks ago
- ☆81Updated 8 months ago
- MemQ is an efficient, scalable cloud native PubSub system☆140Updated 2 months ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆83Updated 8 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated 2 weeks ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆169Updated 3 months ago
- ACID Data Source for Apache Spark based on Hive ACID☆97Updated 4 years ago