awslabs / analytics-accelerator-s3
Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.
☆29Updated last week
Alternatives and similar repositories for analytics-accelerator-s3:
Users that are interested in analytics-accelerator-s3 are comparing it to the libraries listed below
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆105Updated last month
- Java implementation for performing operations on Apache Iceberg and Hive tables☆20Updated 5 months ago
- Lakehouse storage system benchmark☆72Updated 2 years ago
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆73Updated last week
- Sample code to collect Apache Iceberg metrics for table monitoring☆25Updated 7 months ago
- Pinterest's simplified and efficient Tiered Storage implementation for Kafka☆20Updated 3 weeks ago
- Apache Kafka is an open-source distributed event streaming platform used by thousands of companies. uForwarder aims to address several pa…☆38Updated last week
- ☆24Updated last year
- ☆11Updated 4 months ago
- Apache DataFusion Benchmarks☆17Updated 4 months ago
- Best practices and recommendations for getting started with Amazon EMR on EKS.☆63Updated 3 weeks ago
- Unity Catalog UI☆40Updated 6 months ago
- ☆27Updated 7 months ago
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆39Updated 6 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- ☆34Updated last week
- ☆14Updated 2 months ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆31Updated last month
- Apache DataFusion Ray☆171Updated 2 weeks ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆78Updated 5 months ago
- Multi-hop declarative data pipelines☆112Updated this week
- Amazon EMR on EKS Custom Image CLI☆28Updated 6 months ago
- A dbt adapter for Decodable☆12Updated last month
- Amundsen Gremlin☆21Updated 2 years ago
- Kafka sink connector for Amazon EventBridge to send events (records) from Kafka topic(s) to the specified EventBridge event bus☆66Updated this week
- Cloud Storage Connector integrates Apache Pulsar with cloud storage.☆27Updated last week
- Run the open-source online analytics database ClickHouse in an AWS Lambda function☆66Updated 4 months ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆197Updated this week
- Spark Structured Streaming Kinesis Data Streams connector supports both GetRecords and SubscribeToShard (Enhanced Fan-Out, EFO)☆31Updated 3 months ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆76Updated last month