IBM / java-iceberg-toolkitLinks
Java implementation for performing operations on Apache Iceberg and Hive tables
☆19Updated 3 weeks ago
Alternatives and similar repositories for java-iceberg-toolkit
Users that are interested in java-iceberg-toolkit are comparing it to the libraries listed below
Sorting:
- Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.☆55Updated this week
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆139Updated last month
- A BYOC option for Snowflake workloads☆101Updated this week
- Compaction runtime for Apache Iceberg.☆90Updated last week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆82Updated last year
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆220Updated this week
- Lakevision is a tool which provides insights into your Apache Iceberg based Data Lakehouse.☆44Updated 2 weeks ago
- Apache DataFusion Ray☆221Updated 2 months ago
- Multi-hop declarative data pipelines☆120Updated 2 weeks ago
- ☆326Updated last week
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆254Updated last week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆81Updated 5 months ago
- In-Memory Analytics for Kafka using DuckDB☆138Updated last week
- Arrow Flight SQL Server☆111Updated 3 months ago
- ☆30Updated 10 months ago
- TPC-H benchmark data generation in pure Rust☆188Updated last month
- A native Delta implementation for integration with any query engine☆272Updated this week
- ☆33Updated 5 months ago
- A leightweight UI for Lakekeeper☆15Updated this week
- DataFusion TableProviders for reading data from other systems☆149Updated last week
- Fully Managed, Streaming Ingestion (CDC) into your Lakehouse☆206Updated this week
- Apache Kafka is an open-source distributed event streaming platform used by thousands of companies. uForwarder aims to address several pa…☆91Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆489Updated this week
- A Table format agnostic data sharing framework☆39Updated last year
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆165Updated 3 weeks ago
- Apache Parquet Testing☆73Updated last month
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆330Updated 2 years ago
- ☆229Updated last week
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆286Updated last week
- Apache Iceberg Documentation Site☆42Updated last year