IBM / java-iceberg-toolkitLinks
Java implementation for performing operations on Apache Iceberg and Hive tables
☆19Updated 4 months ago
Alternatives and similar repositories for java-iceberg-toolkit
Users that are interested in java-iceberg-toolkit are comparing it to the libraries listed below
Sorting:
- Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.☆65Updated this week
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆147Updated last week
- Experimental version. A BYOC option for Snowflake workloads☆100Updated this week
- Compaction runtime for Apache Iceberg.☆115Updated this week
- Fully Managed, Streaming Ingestion (CDC) into your Lakehouse☆301Updated this week
- Apache DataFusion Ray☆229Updated 4 months ago
- Apache Parquet Testing☆80Updated 2 months ago
- Multi-hop declarative data pipelines☆124Updated 2 weeks ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆83Updated 9 months ago
- The observability platform for Iceberg lakehouses.☆437Updated 3 weeks ago
- TPC-H benchmark data generation in pure Rust☆226Updated this week
- Lakevision is a tool which provides insights into your Apache Iceberg based Data Lakehouse.☆47Updated 3 weeks ago
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆268Updated last week
- ☆30Updated last year
- In-Memory Analytics for Kafka using DuckDB☆147Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆86Updated last year
- ☆374Updated last week
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆232Updated this week
- ☆33Updated 8 months ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆298Updated this week
- Pure Rust Iceberg Implementation☆162Updated last year
- A tool to benchmark L (loading) workloads within ETL workloads☆31Updated last week
- A native Delta implementation for integration with any query engine☆311Updated last week
- Iceberg Playground in a Box☆67Updated 7 months ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆169Updated 4 months ago
- Arrow Flight SQL Server☆125Updated 7 months ago
- Database connectivity API standard and libraries for Apache Arrow☆545Updated this week
- Apache Kafka is an open-source distributed event streaming platform used by thousands of companies. uForwarder aims to address several pa…☆101Updated 4 months ago
- Apache DataFusion Benchmarks☆24Updated last month
- OpenData is a collection of open source databases built on a common, object-native storage and infrastructure foundation.☆72Updated this week