PacktPublishing / In-Memory-Analytics-with-Apache-Arrow-Second-EditionLinks
In-Memory Analytics with Apache Arrow, Published by Packt
☆34Updated last week
Alternatives and similar repositories for In-Memory-Analytics-with-Apache-Arrow-Second-Edition
Users that are interested in In-Memory-Analytics-with-Apache-Arrow-Second-Edition are comparing it to the libraries listed below
Sorting:
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆266Updated 11 months ago
- Apache DataFusion Python Bindings☆497Updated this week
- ☆94Updated 7 months ago
- Database connectivity API standard and libraries for Apache Arrow☆481Updated last week
- ☆309Updated this week
- Turning PySpark Into a Universal DataFrame API☆426Updated this week
- ☆59Updated 4 months ago
- Iceberg Playground in a Box☆62Updated 2 months ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated last year
- ☆155Updated 3 months ago
- Proof-of-concept extension combining the delta extension with Unity Catalog☆89Updated 2 months ago
- Quick Guides from Dremio on Several topics☆74Updated 3 weeks ago
- ☆268Updated 10 months ago
- TPC-H_SF10☆53Updated 7 months ago
- ☆38Updated 5 months ago
- ☆139Updated last month
- Distributed SQL Engine in Python using Dask☆407Updated last year
- DuckDB extension for Delta Lake☆200Updated 2 weeks ago
- Delta Lake helper methods. No Spark dependency.☆23Updated last year
- ☆80Updated 6 months ago
- Apache DataFusion Ray☆219Updated last month
- Open, Multi-modal Catalog for Data & AI, written in Rust☆81Updated 11 months ago
- Apache Hive Metastore as a Standalone server in Docker☆79Updated last year
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture☆93Updated 2 months ago
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆249Updated last week
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆115Updated 5 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆188Updated this week
- ☆70Updated 8 months ago
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆330Updated 2 years ago
- The smallest DuckDB SQL orchestrator on Earth.☆320Updated 4 months ago