awslabs / analytics-accelerator-s3Links
Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.
☆46Updated this week
Alternatives and similar repositories for analytics-accelerator-s3
Users that are interested in analytics-accelerator-s3 are comparing it to the libraries listed below
Sorting:
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆127Updated last month
- Multi-hop declarative data pipelines☆117Updated this week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆79Updated 3 months ago
- A leightweight UI for Lakekeeper☆13Updated last week
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated 10 months ago
- Java implementation for performing operations on Apache Iceberg and Hive tables☆19Updated 2 months ago
- A dbt adapter for Decodable☆12Updated 4 months ago
- A BYOC option for Snowflake workloads☆81Updated this week
- Apache DataFusion Benchmarks☆20Updated 3 months ago
- Comptaction runtime for Apache Iceberg.☆47Updated this week
- Firebolt Core is a free, self-hosted edition of Firebolt's distributed query engine (https://www.firebolt.io/); it provides high-performa…☆160Updated this week
- ☆30Updated 7 months ago
- Pythonic Iceberg REST Catalog☆2Updated 3 weeks ago
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆77Updated this week
- Unity Catalog UI☆41Updated 10 months ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆81Updated 9 months ago
- ☆27Updated last month
- ☆45Updated 2 weeks ago
- Apache Kafka is an open-source distributed event streaming platform used by thousands of companies. uForwarder aims to address several pa…☆76Updated 4 months ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆230Updated last week
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆271Updated last week
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆25Updated 2 weeks ago
- Apache DataFusion Ray☆209Updated 3 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆163Updated 7 months ago
- ☆295Updated this week
- ☆70Updated 6 months ago
- The Control Plane for Apache Iceberg.☆282Updated this week
- Apache Parquet Testing☆64Updated last week
- Lakehouse storage system benchmark☆75Updated 2 years ago