[SIGMOD 2026] F3: The Open-Source Data File Format for the Future
☆421Nov 3, 2025Updated 4 months ago
Alternatives and similar repositories for F3
Users that are interested in F3 are comparing it to the libraries listed below
Sorting:
- An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Li…☆2,801Updated this week
- Next-Gen Big Data File Format☆664Oct 11, 2025Updated 5 months ago
- [VLDB 2023 Vol 17] "An Empirical Evaluation of Columnar Storage Formats"☆69Oct 15, 2025Updated 5 months ago
- Compaction runtime for Apache Iceberg.☆121Mar 10, 2026Updated last week
- DuckDB extension to read files within zip archives.☆58Mar 9, 2026Updated last week
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆15Feb 15, 2025Updated last year
- continuously update cloud database papers☆83May 22, 2024Updated last year
- This repo demonstrates an Apache Arrow Flight server implementation in Kubernetes.☆12Oct 25, 2024Updated last year
- Apache DataFusion Benchmarks☆22Mar 3, 2026Updated 2 weeks ago
- Implementation of Apache ORC file format use Apache Arrow in-memory format☆47Jan 13, 2026Updated 2 months ago
- Tremor Language Server (Trill)☆13Apr 18, 2024Updated last year
- Protobuf to Arrow, using Rust☆24Updated this week
- The source code for the www.tremor.rs website☆15Feb 10, 2026Updated last month
- A minimal Python library for Apache Arrow, connecting to the Rust Arrow crate☆256Mar 2, 2026Updated 2 weeks ago
- A simplified, generic, entity based web library for golang that's drop in compatible with net/http☆10Jul 14, 2023Updated 2 years ago
- ☆30Dec 4, 2024Updated last year
- 🚀 GizmoSQL — High-Performance SQL Server☆296Updated this week
- A Rust implementation of HyperLogLog trying to be parsimonious with memory.☆33Sep 24, 2024Updated last year
- Arrow Flight SQL Server☆131Jun 21, 2025Updated 9 months ago
- Apache Arrow Development Experiments☆25Nov 6, 2025Updated 4 months ago
- GlareDB: A light and fast SQL database for analytics☆1,005Nov 14, 2025Updated 4 months ago
- Tonbo is an embedded database for serverless and edge runtimes.☆1,504Mar 6, 2026Updated 2 weeks ago
- Monitoring Databricks using Prometheus, Grafana and Pyroscope☆27Jul 29, 2025Updated 7 months ago
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,487Updated this week
- DataFusion TableProviders for reading data from other systems☆175Updated this week
- LSM based key-value store in rust, design for cloud☆87Feb 27, 2022Updated 4 years ago
- Terminal-based context management for AI driven development☆21Oct 3, 2025Updated 5 months ago
- The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…☆1,730Updated this week
- A Minimalistic Rust Implementation of Delta Sharing Server.☆98Mar 17, 2025Updated last year
- HyperTwoBits implementation☆17Aug 29, 2025Updated 6 months ago
- Apache DataFusion Ray☆228Oct 5, 2025Updated 5 months ago
- Self-Tuning Adaptive Radix Tree☆30Apr 19, 2020Updated 5 years ago
- Apache Hive Metastore in Standalone Mode With Docker☆14Jul 22, 2024Updated last year
- Rust based high-performance Apache Uniffle shuffle-server☆62Updated this week
- ghir is a CLI making past GitHub Releases immutable☆26Updated this week
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆317Updated this week
- DuckLake is an integrated data lake and catalog format☆2,542Mar 12, 2026Updated last week
- Providing wrapper types for safely performing panic-free checked arithmetic on instants and durations.☆17Mar 12, 2026Updated last week
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆270Updated this week