IBM / java-iceberg-toolkit
Java implementation for performing operations on Apache Iceberg and Hive tables
☆20Updated 5 months ago
Alternatives and similar repositories for java-iceberg-toolkit:
Users that are interested in java-iceberg-toolkit are comparing it to the libraries listed below
- A testing framework for Trino☆26Updated 4 months ago
- ☆10Updated last year
- Trino connectors for accessing APIs with an OpenAPI spec☆31Updated this week
- BigQuery connector for Apache Flink☆29Updated 3 weeks ago
- Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.☆29Updated 2 weeks ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆76Updated 3 weeks ago
- Apache iceberg Spark s3 examples☆20Updated last year
- Presto Trino with Apache Hive Postgres metastore☆40Updated 6 months ago
- ☆56Updated this week
- Cloud Storage Connector integrates Apache Pulsar with cloud storage.☆27Updated this week
- Unity Catalog UI☆40Updated 6 months ago
- In-Memory Analytics for Kafka using DuckDB☆104Updated this week
- pulsar lakehouse connector☆32Updated this week
- An open-source, community-driven REST catalog for Apache Iceberg!☆26Updated 8 months ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆25Updated last year
- a curated list of awesome lakehouse frameworks, applications, etc☆23Updated 3 weeks ago
- A tool that makes it easy to run modular Trino environments locally.☆34Updated last week
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆104Updated last month
- Open, Multi-modal Catalog for Data & AI, written in Rust☆78Updated 5 months ago
- Multi-hop declarative data pipelines☆111Updated this week
- LinkedIn's version of Apache Calcite☆22Updated 4 months ago
- minio as local storage and DynamoDB as catalog☆13Updated 10 months ago
- ☆33Updated 2 years ago
- ☆15Updated this week
- A Table format agnostic data sharing framework☆38Updated last year
- Trino load balancer with support for routing, queueing and auto-scaling☆26Updated this week
- ☆25Updated last year
- Pinterest's simplified and efficient Tiered Storage implementation for Kafka☆19Updated 2 weeks ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated this week
- Auxiliary testing files for Apache Arrow☆15Updated 2 months ago