facebookincubator / veloxLinks
A composable and fully extensible C++ execution engine library for data management systems.
☆3,906Updated this week
Alternatives and similar repositories for velox
Users that are interested in velox are comparing it to the libraries listed below
Sorting:
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,453Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,391Updated this week
- Apache DataFusion SQL Query Engine☆7,844Updated this week
- The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…☆1,609Updated last week
- Apache DataFusion Ballista Distributed Query Engine☆1,858Updated this week
- Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, v…☆5,462Updated this week
- Real-time event streaming platform. Streaming CDC, stream processing, low-latency serving, and Iceberg management.☆8,409Updated this week
- Apache Iceberg☆8,043Updated this week
- Upserts, Deletes And Incremental Processing on Big Data.☆5,978Updated this week
- Apache Impala☆1,243Updated this week
- Apache Parquet Format☆2,055Updated 2 weeks ago
- Apache DataFusion Comet Spark Accelerator☆1,048Updated this week
- Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch …☆3,012Updated last week
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆746Updated this week
- Mirror of Apache Kudu☆1,888Updated last week
- Apache Calcite☆4,957Updated last week
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,310Updated this week
- 𝗔𝗜-𝗡𝗮𝘁𝗶𝘃𝗲 𝗗𝗮𝘁𝗮 𝗪𝗮𝗿𝗲𝗵𝗼𝘂𝘀𝗲. Open-source Snowflake alternative. Proven at petabyte scale with enterprise performance. B…☆8,917Updated this week
- Fastest SQL pipeline engine in a single C++ binary, for stream processing, analytics, observability and AI.☆1,911Updated last month
- Apache Parquet Java☆2,952Updated last week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,339Updated this week
- Scalable, reliable, distributed storage system optimized for data analytics and object store workloads.☆1,029Updated last week
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,253Updated this week
- Apache HoraeDB (incubating) is a high-performance, distributed, cloud native time-series database.☆2,789Updated last month
- Apache Iceberg☆1,098Updated last week
- Apache Pinot - A realtime distributed OLAP datastore☆5,916Updated this week
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆992Updated last week
- ClickBench: a Benchmark For Analytical Databases☆890Updated this week
- Pluggable in-process caching engine to build and scale high performance services☆1,438Updated this week
- An educational OLAP database system.☆1,781Updated 2 months ago