awslabs / analytics-accelerator-s3Links
Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.
☆41Updated this week
Alternatives and similar repositories for analytics-accelerator-s3
Users that are interested in analytics-accelerator-s3 are comparing it to the libraries listed below
Sorting:
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆123Updated 2 weeks ago
- Java implementation for performing operations on Apache Iceberg and Hive tables☆19Updated 3 weeks ago
- Apache DataFusion Benchmarks☆19Updated 2 months ago
- A leightweight UI for Lakekeeper☆12Updated this week
- Multi-hop declarative data pipelines☆115Updated this week
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆76Updated this week
- The Control Plane for Apache Iceberg☆59Updated this week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆78Updated last month
- ☆30Updated 6 months ago
- ☆24Updated last year
- Sample code to collect Apache Iceberg metrics for table monitoring☆27Updated 9 months ago
- ☆278Updated last week
- Lakehouse storage system benchmark☆74Updated 2 years ago
- ☆42Updated last month
- Apache Parquet Testing☆58Updated last week
- Apache DataFusion Ray☆194Updated 2 months ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆79Updated 8 months ago
- ☆22Updated 3 months ago
- Rust implementation of Apache Iceberg with integration for Datafusion☆191Updated this week
- ☆52Updated last week
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆19Updated 3 months ago
- Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshif…☆59Updated 8 months ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆218Updated this week
- A Pub/Sub for Tables based data integration platform, to discover, publish, modify and consume data effortlessly.☆33Updated last month
- A dbt adapter for Decodable☆12Updated 3 months ago
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆40Updated 8 months ago
- Unity Catalog UI☆40Updated 8 months ago
- Lance Namespace Specification is an open specification on top of the storage-based Lance data format to standardize access to a collectio…☆14Updated last week
- ☆33Updated 3 weeks ago
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆197Updated this week