IBM / java-iceberg-toolkitLinks
Java implementation for performing operations on Apache Iceberg and Hive tables
☆19Updated 2 months ago
Alternatives and similar repositories for java-iceberg-toolkit
Users that are interested in java-iceberg-toolkit are comparing it to the libraries listed below
Sorting:
- Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.☆57Updated last week
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆142Updated 3 months ago
- Experimental version. A BYOC option for Snowflake workloads☆102Updated this week
- Compaction runtime for Apache Iceberg.☆109Updated last week
- Fully Managed, Streaming Ingestion (CDC) into your Lakehouse☆261Updated this week
- TPC-H benchmark data generation in pure Rust☆206Updated 2 weeks ago
- ☆30Updated 11 months ago
- Apache DataFusion Ray☆222Updated last month
- Multi-hop declarative data pipelines☆122Updated last week
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆227Updated this week
- A native Delta implementation for integration with any query engine☆289Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆84Updated last year
- ☆33Updated 6 months ago
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆261Updated last week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆83Updated 7 months ago
- The observability platform for Iceberg lakehouses.☆381Updated last week
- Traffic routing for Trino Clusters☆27Updated 2 months ago
- Lakevision is a tool which provides insights into your Apache Iceberg based Data Lakehouse.☆45Updated last week
- In-Memory Analytics for Kafka using DuckDB☆143Updated last week
- Apache Kafka is an open-source distributed event streaming platform used by thousands of companies. uForwarder aims to address several pa…☆93Updated last month
- ☆339Updated last week
- Iceberg Playground in a Box☆67Updated 4 months ago
- Apache Parquet Testing☆75Updated 3 months ago
- Lance Namespace is an open specification on top of the storage-based Lance table and file format to standardize access to a collection of…☆35Updated last week
- Pure Rust Iceberg Implementation☆161Updated last year
- Arrow Flight SQL Server☆114Updated 5 months ago
- Apache DataFusion Benchmarks☆22Updated last month
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆167Updated 2 months ago
- A Table format agnostic data sharing framework☆42Updated last year
- Firebolt Core is a free, self-hosted edition of Firebolt's distributed query engine (https://www.firebolt.io/); it provides high-performa…☆183Updated last week