onehouseinc / lake-loaderLinks
A tool to benchmark L (loading) workloads within ETL workloads
☆27Updated 4 months ago
Alternatives and similar repositories for lake-loader
Users that are interested in lake-loader are comparing it to the libraries listed below
Sorting:
- Multi-hop declarative data pipelines☆120Updated 2 weeks ago
- Stackable Operator for Apache Airflow☆31Updated this week
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆64Updated 2 years ago
- ☆39Updated last week
- Yet Another (Spark) ETL Framework☆21Updated last year
- ☆58Updated this week
- ☆99Updated 8 months ago
- ☆40Updated 2 years ago
- a curated list of awesome lakehouse frameworks, applications, etc☆35Updated 7 months ago
- Presto Trino with Apache Hive Postgres metastore☆43Updated last year
- A Table format agnostic data sharing framework☆39Updated last year
- Apache Kafka is an open-source distributed event streaming platform used by thousands of companies. uForwarder aims to address several pa…☆89Updated 6 months ago
- 🌟 Examples of use cases that utilize Decodable, as well as demos for related open-source projects such as Apache Flink, Debezium, and Po…☆84Updated 3 months ago
- Iceberg Playground in a Box☆67Updated 3 months ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆81Updated 5 months ago
- MemQ is an efficient, scalable cloud native PubSub system☆138Updated this week
- Management and automation platform for Stateful Distributed Systems☆110Updated this week
- In-Memory Analytics for Kafka using DuckDB☆138Updated this week
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆76Updated last week
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated 2 weeks ago
- ☆59Updated last year
- Mock streaming data generator☆17Updated last year
- Documentation for Hyper, the blazingly fast SQL engine powering analytics at Tableau and Salesforce☆32Updated 2 weeks ago
- Compaction runtime for Apache Iceberg.☆87Updated last week
- Docker envinroment to stream data from Kafka to Iceberg tables☆30Updated last year
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆100Updated 2 years ago
- ☆10Updated 2 years ago
- Generated Kafka protocol implementations☆33Updated last month
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆38Updated 2 weeks ago
- ☆30Updated 4 months ago