IBM / java-iceberg-toolkitLinks
Java implementation for performing operations on Apache Iceberg and Hive tables
☆19Updated last month
Alternatives and similar repositories for java-iceberg-toolkit
Users that are interested in java-iceberg-toolkit are comparing it to the libraries listed below
Sorting:
- Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.☆56Updated this week
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆139Updated 2 months ago
- A BYOC option for Snowflake workloads☆101Updated this week
- Compaction runtime for Apache Iceberg.☆104Updated this week
- Multi-hop declarative data pipelines☆122Updated last week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆81Updated 6 months ago
- Lakevision is a tool which provides insights into your Apache Iceberg based Data Lakehouse.☆44Updated this week
- In-Memory Analytics for Kafka using DuckDB☆141Updated last week
- ☆30Updated 10 months ago
- ☆332Updated last week
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆221Updated this week
- Apache DataFusion Ray☆221Updated 3 weeks ago
- The observability platform for Iceberg lakehouses.☆369Updated this week
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆255Updated last week
- A native Delta implementation for integration with any query engine☆273Updated this week
- Iceberg Playground in a Box☆67Updated 4 months ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆82Updated last year
- TPC-H benchmark data generation in pure Rust☆202Updated last month
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆329Updated 2 years ago
- A leightweight UI for Lakekeeper☆15Updated last week
- ☆58Updated last week
- Fully Managed, Streaming Ingestion (CDC) into your Lakehouse☆231Updated this week
- ☆230Updated last week
- Run, mock and test fake Snowflake databases locally.☆153Updated 2 weeks ago
- Apache Kafka is an open-source distributed event streaming platform used by thousands of companies. uForwarder aims to address several pa…☆92Updated 3 weeks ago
- ☆33Updated 5 months ago
- Apache Parquet Testing☆73Updated 2 months ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆288Updated last week
- A Table format agnostic data sharing framework☆41Updated last year
- Open Control Plane for Tables in Data Lakehouse☆370Updated last week