awslabs / analytics-accelerator-s3Links
Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.
☆65Updated last week
Alternatives and similar repositories for analytics-accelerator-s3
Users that are interested in analytics-accelerator-s3 are comparing it to the libraries listed below
Sorting:
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆149Updated 2 weeks ago
- Compaction runtime for Apache Iceberg.☆116Updated last week
- Java implementation for performing operations on Apache Iceberg and Hive tables☆19Updated 4 months ago
- Multi-hop declarative data pipelines☆124Updated this week
- A leightweight UI for Lakekeeper☆16Updated this week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆83Updated 10 months ago
- Apache DataFusion Benchmarks☆24Updated last month
- ☆30Updated last year
- Experimental version. A BYOC option for Snowflake workloads☆100Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆86Updated last year
- The observability platform for Iceberg lakehouses.☆437Updated last month
- Apache DataFusion Ray☆229Updated 4 months ago
- ☆376Updated this week
- A dbt adapter for Decodable☆12Updated 5 months ago
- TPC-H benchmark data generation in pure Rust☆227Updated this week
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆267Updated 2 weeks ago
- A one-afternoon implementation of redis-like primitives with S3 Express☆33Updated last year
- Collection of AWS Lambdas for creating and managing Delta tables☆57Updated last month
- ☆61Updated this week
- Fully Managed, Streaming Ingestion (CDC) into your Lakehouse☆302Updated this week
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆233Updated this week
- Apache Parquet Testing☆81Updated 2 months ago
- Unity Catalog UI☆43Updated last year
- A Table format agnostic data sharing framework☆42Updated 2 years ago
- A native Delta implementation for integration with any query engine☆312Updated this week
- Lance Namespace is an open specification for describing access and operations against a collection of tables in a multimodal lakehouse☆47Updated last month
- Sample code to collect Apache Iceberg metrics for table monitoring☆29Updated last year
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆46Updated last month
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆20Updated last year
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆298Updated last week