apache / arrowLinks
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
☆16,179Updated this week
Alternatives and similar repositories for arrow
Users that are interested in arrow are comparing it to the libraries listed below
Sorting:
- Apache DataFusion SQL Query Engine☆8,061Updated this week
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,398Updated last week
- A composable and fully extensible C++ execution engine library for data management systems.☆3,956Updated this week
- Parallel computing with task scheduling☆13,603Updated last week
- The live data layer for apps and AI agents Create up-to-the-second views into your business, just using SQL☆6,172Updated this week
- DuckDB is an analytical in-process SQL database management system☆34,344Updated this week
- Apache Parquet Java☆2,994Updated last week
- the portable Python dataframe library☆6,226Updated this week
- 𝗔𝗜-𝗡𝗮𝘁𝗶𝘃𝗲 𝗗𝗮𝘁𝗮 𝗪𝗮𝗿𝗲𝗵𝗼𝘂𝘀𝗲. Blazing analytics, fast search, geo insights, vector AI. Built for multimodal analytics, O…☆8,984Updated last week
- An orchestration platform for the development, production, and observation of data assets.☆14,455Updated last week
- Streaming data platform. Real-time stream processing, low-latency serving, and Iceberg table management.☆8,534Updated this week
- Apache Parquet Format☆2,115Updated last month
- High-performance runtime for data analytics applications☆3,003Updated 3 years ago
- Apache Iceberg☆8,246Updated this week
- NoSQL data store using the Seastar framework, compatible with Apache Cassandra and Amazon DynamoDB☆15,065Updated this week
- Build, Manage and Deploy AI/ML Systems☆9,640Updated this week
- Extremely fast Query Engine for DataFrames, written in Rust☆36,147Updated last week
- Apache Spark - A unified analytics engine for large-scale data processing☆42,357Updated this week
- Apache Druid: a high performance real-time analytics database.☆13,877Updated this week
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,376Updated this week
- AI + Data, online. https://vespa.ai☆6,602Updated last week
- FoundationDB - the open source, distributed, transactional key-value store☆15,854Updated this week
- ClickHouse® is a real-time analytics database management system☆44,176Updated this week
- The official home of the Presto distributed SQL query engine for big data☆16,568Updated this week
- A native Rust library for Delta Lake, with bindings into Python☆3,040Updated last week
- HeavyDB (formerly MapD/OmniSciDB)☆3,040Updated last month
- ZetaSQL - Analyzer Framework for SQL☆2,431Updated 2 weeks ago
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆5,723Updated last week
- Data-Centric Pipelines and Data Versioning☆6,267Updated 9 months ago
- Apache Pinot - A realtime distributed OLAP datastore☆5,959Updated last week