apache / arrowLinks
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
☆16,459Updated last week
Alternatives and similar repositories for arrow
Users that are interested in arrow are comparing it to the libraries listed below
Sorting:
- Apache DataFusion SQL Query Engine☆8,344Updated this week
- A composable and fully extensible C++ execution engine library for data management systems.☆4,037Updated last week
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,569Updated this week
- Apache Iceberg☆8,485Updated last week
- DuckDB is an analytical in-process SQL database management system☆35,793Updated last week
- Parallel computing with task scheduling☆13,727Updated this week
- Apache Parquet Java☆3,019Updated 2 weeks ago
- the portable Python dataframe library☆6,371Updated last week
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,465Updated this week
- High-performance runtime for data analytics applications☆3,006Updated 3 years ago
- Apache Parquet Format☆2,224Updated this week
- HeavyDB (formerly MapD/OmniSciDB)☆3,056Updated last month
- ClickHouse® is a real-time analytics database management system☆45,658Updated this week
- Extremely fast Query Engine for DataFrames, written in Rust☆37,283Updated this week
- Apache Pinot - A realtime distributed OLAP datastore☆6,020Updated this week
- Event streaming platform for agents, apps, and analytics. Continuously ingest, transform, and serve event data in real time, at scale.☆8,753Updated last week
- A library that provides an embeddable, persistent key-value store for fast storage.☆31,471Updated last week
- The live data layer for apps and AI agents Create up-to-the-second views into your business, just using SQL☆6,220Updated last week
- Apache Druid: a high performance real-time analytics database.☆13,928Updated last week
- The official home of the Presto distributed SQL query engine for big data☆16,646Updated this week
- Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from scratch. Unified architecture on your S3.☆9,129Updated this week
- Alluxio, data orchestration for analytics and machine learning in the cloud☆7,153Updated 9 months ago
- NoSQL data store using the Seastar framework, compatible with Apache Cassandra and Amazon DynamoDB☆15,314Updated this week
- A fast compressor/decompressor☆6,518Updated 2 weeks ago
- Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)☆12,485Updated last week
- Upserts, Deletes And Incremental Processing on Big Data.☆6,084Updated this week
- Distributed transactional key-value database, originally created to complement TiDB☆16,505Updated this week
- Apache Flink☆25,764Updated this week
- Machine Learning Toolkit for Kubernetes☆15,431Updated last month
- FoundationDB - the open source, distributed, transactional key-value store☆16,118Updated this week