apache / arrowLinks
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
☆16,241Updated this week
Alternatives and similar repositories for arrow
Users that are interested in arrow are comparing it to the libraries listed below
Sorting:
- Apache DataFusion SQL Query Engine☆8,130Updated this week
- Parallel computing with task scheduling☆13,648Updated this week
- Extremely fast Query Engine for DataFrames, written in Rust☆36,504Updated this week
- Apache Parquet Java☆3,002Updated this week
- DuckDB is an analytical in-process SQL database management system☆34,614Updated last week
- the portable Python dataframe library☆6,262Updated last week
- Apache Parquet Format☆2,143Updated this week
- Apache Iceberg☆8,292Updated last week
- The live data layer for apps and AI agents Create up-to-the-second views into your business, just using SQL☆6,184Updated last week
- 𝗔𝗜-𝗡𝗮𝘁𝗶𝘃𝗲 𝗗𝗮𝘁𝗮 𝗪𝗮𝗿𝗲𝗵𝗼𝘂𝘀𝗲. Blazing analytics, fast search, geo insights, vector AI. Built for multimodal analytics, O…☆9,031Updated this week
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆5,816Updated this week
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆43,457Updated last week
- NoSQL data store using the Seastar framework, compatible with Apache Cassandra and Amazon DynamoDB☆15,128Updated this week
- HeavyDB (formerly MapD/OmniSciDB)☆3,043Updated last month
- Apache Pinot - A realtime distributed OLAP datastore☆5,978Updated this week
- Distributed transactional key-value database, originally created to complement TiDB☆16,370Updated this week
- High-performance runtime for data analytics applications☆3,004Updated 3 years ago
- Streaming data platform. Real-time stream processing, low-latency serving, and Iceberg table management.☆8,580Updated last week
- cuDF - GPU DataFrame Library☆9,375Updated this week
- Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!☆11,423Updated this week
- The Universal Storage Engine☆2,003Updated this week
- An orchestration platform for the development, production, and observation of data assets.☆14,582Updated this week
- YugabyteDB - the cloud native distributed SQL database for mission-critical applications.☆9,956Updated this week
- Apache Pulsar - distributed pub-sub messaging system☆15,005Updated this week
- Data-Centric Pipelines and Data Versioning☆6,274Updated 10 months ago
- ClickHouse® is a real-time analytics database management system☆44,621Updated this week
- BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.☆1,994Updated 3 years ago
- Official Rust implementation of Apache Arrow☆3,261Updated this week
- FoundationDB - the open source, distributed, transactional key-value store☆15,989Updated this week
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,395Updated this week