awslabs / analytics-accelerator-s3Links
Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.
☆55Updated this week
Alternatives and similar repositories for analytics-accelerator-s3
Users that are interested in analytics-accelerator-s3 are comparing it to the libraries listed below
Sorting:
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆139Updated last month
- Compaction runtime for Apache Iceberg.☆90Updated last week
- A BYOC option for Snowflake workloads☆101Updated this week
- A leightweight UI for Lakekeeper☆15Updated this week
- Multi-hop declarative data pipelines☆120Updated 2 weeks ago
- Java implementation for performing operations on Apache Iceberg and Hive tables☆19Updated 3 weeks ago
- A one-afternoon implementation of redis-like primitives with S3 Express☆33Updated 11 months ago
- ☆30Updated 10 months ago
- A dbt adapter for Decodable☆12Updated last month
- The Control Plane for Apache Iceberg.☆359Updated 2 weeks ago
- Apache Parquet Testing☆73Updated last month
- Apache DataFusion Ray☆221Updated 2 months ago
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆35Updated 2 years ago
- ☆33Updated 5 months ago
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆245Updated last week
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆220Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆82Updated last year
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated last year
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆81Updated 5 months ago
- Lakevision is a tool which provides insights into your Apache Iceberg based Data Lakehouse.☆44Updated 2 weeks ago
- Firebolt Core is a free, self-hosted edition of Firebolt's distributed query engine (https://www.firebolt.io/); it provides high-performa…☆176Updated this week
- ☆326Updated last week
- Run query engines in Cloud Functions☆26Updated 2 years ago
- TPC-H benchmark data generation in pure Rust☆188Updated last month
- ☆48Updated 3 months ago
- Apache Kafka is an open-source distributed event streaming platform used by thousands of companies. uForwarder aims to address several pa…☆91Updated this week
- Fully Managed, Streaming Ingestion (CDC) into your Lakehouse☆206Updated this week
- Helm Charts for RisingWave☆22Updated last week
- In-Memory Analytics for Kafka using DuckDB☆138Updated last week
- Arrow Flight SQL Server☆111Updated 3 months ago