[SIGMOD 2026] F3: The Open-Source Data File Format for the Future
☆412Nov 3, 2025Updated 3 months ago
Alternatives and similar repositories for F3
Users that are interested in F3 are comparing it to the libraries listed below
Sorting:
- An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Li…☆2,742Updated this week
- This repo demonstrates an Apache Arrow Flight server implementation in Kubernetes.☆12Oct 25, 2024Updated last year
- Next-Gen Big Data File Format☆659Oct 11, 2025Updated 4 months ago
- Compaction runtime for Apache Iceberg.☆119Feb 13, 2026Updated 2 weeks ago
- Protobuf to Arrow, using Rust☆24Feb 20, 2026Updated last week
- ☆30Dec 4, 2024Updated last year
- [VLDB 2023 Vol 17] "An Empirical Evaluation of Columnar Storage Formats"☆69Oct 15, 2025Updated 4 months ago
- Arrow Flight SQL Server☆129Jun 21, 2025Updated 8 months ago
- DuckDB extension to read files within zip archives.☆56Updated this week
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆15Feb 15, 2025Updated last year
- continuously update cloud database papers☆83May 22, 2024Updated last year
- Tonbo is an embedded database for serverless and edge runtimes.☆1,497Feb 20, 2026Updated last week
- 🚀 GizmoSQL — High-Performance SQL Server☆290Updated this week
- Apache DataFusion Benchmarks☆23Dec 31, 2025Updated 2 months ago
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,476Updated this week
- Apache DataFusion Ray☆228Oct 5, 2025Updated 4 months ago
- Open Data Stack Platform: a collection of projects and pipelines built with open data stack tools for scalable, observable data platform…☆22Dec 21, 2025Updated 2 months ago
- The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…☆1,715Updated this week
- DataFusion TableProviders for reading data from other systems☆170Updated this week
- A simplified, generic, entity based web library for golang that's drop in compatible with net/http☆10Jul 14, 2023Updated 2 years ago
- A Minimalistic Rust Implementation of Delta Sharing Server.☆97Mar 17, 2025Updated 11 months ago
- Self-Tuning Adaptive Radix Tree☆30Apr 19, 2020Updated 5 years ago
- GlareDB: A light and fast SQL database for analytics☆1,001Nov 14, 2025Updated 3 months ago
- A tool to benchmark L (loading) workloads within ETL workloads☆31Feb 10, 2026Updated 2 weeks ago
- A minimal Python library for Apache Arrow, connecting to the Rust Arrow crate☆250Updated this week
- Apache Arrow Development Experiments☆25Nov 6, 2025Updated 3 months ago
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆269Updated this week
- Embeddable Aggregate Management System for Streams and Queries.☆107Nov 8, 2025Updated 3 months ago
- Monitoring Databricks using Prometheus, Grafana and Pyroscope☆27Jul 29, 2025Updated 7 months ago
- A compute manifest and composable tools for data, built on Ibis, DataFusion, and Arrow Flight.☆487Updated this week
- Sqllogictest parser and runner in Rust, with extensions.☆217Feb 14, 2026Updated 2 weeks ago
- A cloud native embedded storage engine built on object storage.☆2,741Updated this week
- The DuckDB Python package☆118Updated this week
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,074Updated this week
- Rust based high-performance Apache Uniffle shuffle-server☆62Updated this week
- DuckDB HTTP API Server and Query Interface in a Community Extension☆273Feb 18, 2026Updated last week
- Hybrid in-memory and disk cache in Rust☆1,639Updated this week
- Analytical database for data-driven Web applications 🪶☆509Feb 25, 2025Updated last year
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆315Updated this week