onehouseinc / lake-loaderLinks
A tool to benchmark L (loading) workloads within ETL workloads
☆24Updated 3 weeks ago
Alternatives and similar repositories for lake-loader
Users that are interested in lake-loader are comparing it to the libraries listed below
Sorting:
- A Table format agnostic data sharing framework☆38Updated last year
- ☆85Updated 4 months ago
- Presto Trino with Apache Hive Postgres metastore☆41Updated 8 months ago
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture☆74Updated last month
- Iceberg Playground in a Box☆52Updated last week
- Yet Another (Spark) ETL Framework☆21Updated last year
- Unity Catalog UI☆40Updated 9 months ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆27Updated 9 months ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆29Updated last year
- Multi-hop declarative data pipelines☆115Updated this week
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆98Updated 2 years ago
- ☆56Updated this week
- a curated list of awesome lakehouse frameworks, applications, etc☆29Updated 3 months ago
- ☆57Updated 10 months ago
- Monitoring and insights on your data lakehouse tables☆29Updated last month
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆78Updated last month
- ☆14Updated this week
- ☆40Updated 2 years ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆64Updated last year
- ☆80Updated last month
- Quick Guides from Dremio on Several topics☆71Updated last week
- A tool that makes it easy to run modular Trino environments locally.☆38Updated this week
- Apache Hive Metastore as a Standalone server in Docker☆76Updated 9 months ago
- MCP Server for Trino developed via MCP Python SDK☆15Updated last month
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆259Updated this week
- 🌟 Examples of use cases that utilize Decodable, as well as demos for related open-source projects such as Apache Flink, Debezium, and Po…☆76Updated last month
- ☆27Updated 2 months ago
- 📚 Tech blogs & talks by companies that run Apache Flink in production☆173Updated 4 months ago
- The Internals of Spark on Kubernetes☆71Updated 3 years ago
- Storage connector for Trino☆110Updated last month