marsupialtail/quokka

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/marsupialtail/quokka)

marsupialtail / quokka

Making data lake work for time series

☆1,192

Alternatives and similar repositories for quokka

Users that are interested in quokka are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tobymao / sqlglot
View on GitHub
Python SQL Parser and Transpiler
☆9,477Updated this week
fugue-project / fugue
View on GitHub
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…
☆2,170May 19, 2026Updated 2 months ago
lance-format / lance
View on GitHub
Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…
☆6,884Updated this week
Eventual-Inc / Daft
View on GitHub
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
☆5,669Updated this week
apache / datafusion-ballista
View on GitHub
Apache DataFusion Ballista Distributed Query Engine
☆2,096Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
substrait-io / substrait
View on GitHub
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
☆1,538Updated this week
datafusion-contrib / ray-sql
View on GitHub
Distributed SQL Query Engine in Python using Ray
☆245Oct 2, 2024Updated last year
ibis-project / ibis
View on GitHub
the portable Python dataframe library
☆6,614Updated this week
apache / datafusion-python
View on GitHub
Apache DataFusion Python Bindings
☆594Updated this week
apache / datafusion
View on GitHub
Apache DataFusion SQL Query Engine
☆9,044Updated this week
ray-project / deltacat
View on GitHub
A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…
☆282Apr 17, 2026Updated 3 months ago
sutoiku / puffin
View on GitHub
Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg
☆332Mar 28, 2023Updated 3 years ago
eakmanrq / sqlframe
View on GitHub
Turning PySpark Into a Universal DataFrame API
☆528Updated this week
facebookincubator / velox
View on GitHub
A composable and fully extensible C++ execution engine library for data management systems.
☆4,180Updated this week
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
SQLMesh / sqlmesh
View on GitHub
Scalable and efficient data transformation framework - backwards compatible with dbt.
☆3,229Updated this week
delta-io / delta-rs
View on GitHub
A native Rust library for Delta Lake, with bindings into Python
☆3,274Updated this week
GlareDB / glaredb
View on GitHub
GlareDB: A light and fast SQL database for analytics
☆1,019Nov 14, 2025Updated 8 months ago
pola-rs / polars
View on GitHub
Extremely fast Query Engine for DataFrames, written in Rust
☆39,127Updated this week
vortex-data / vortex
View on GitHub
An extensible, state-of-the-art framework for columnar compression, and the fastest FOSS columnar file format. Formerly at @spiraldb, now…
☆3,108Updated this week
apache / datafusion-ray
View on GitHub
Apache DataFusion Ray
☆230May 15, 2026Updated 2 months ago
apache / arrow-adbc
View on GitHub
Database connectivity API standard and libraries for Apache Arrow
☆615Updated this week
Mause / duckdb-deltatable-extension
View on GitHub
A purely experimental DuckDB Deltalake extension
☆94Jul 20, 2026Updated last week
PRQL / prql
View on GitHub
PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement
☆10,883Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
facebookincubator / nimble
View on GitHub
New and extensible file format for storage of large columnar datasets.
☆728Updated this week
roapi / roapi
View on GitHub
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
☆3,421Mar 25, 2026Updated 4 months ago
jorgecarleitao / arrow2
View on GitHub
Transmute-free Rust library to work with the Arrow format
☆1,063Feb 27, 2024Updated 2 years ago
sfu-db / connector-x
View on GitHub
Fastest library to load data from DB to DataFrames in Rust and Python
☆2,637Jul 20, 2026Updated last week
duckdb / duckdb
View on GitHub
DuckDB is an analytical in-process SQL database management system
☆39,794Updated this week
MaterializeInc / materialize
View on GitHub
The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL
☆6,342Updated this week
splitgraph / seafowl
View on GitHub
Analytical database for data-driven Web applications 🪶
☆515Feb 25, 2025Updated last year
barakalon / mysql-mimic
View on GitHub
☆121May 5, 2026Updated 2 months ago
ArroyoSystems / arroyo
View on GitHub
Distributed stream processing engine in Rust
☆4,976Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
deepseek-ai / smallpond
View on GitHub
A lightweight data processing framework built on DuckDB and 3FS.
☆4,972Mar 5, 2025Updated last year
boilingdata / node-boilingdata
View on GitHub
BoilingData JS client (NodeJS and Browsers)
☆18Sep 25, 2024Updated last year
malloydata / malloy
View on GitHub
Malloy is a modern open source language for describing data relationships and transformations.
☆2,535Updated this week
apache / arrow
View on GitHub
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
☆16,965Updated this week
bytewax / bytewax
View on GitHub
Python Stream Processing
☆2,040Jun 20, 2026Updated last month
hydradatabase / columnar
View on GitHub
Postgres-native columnar storage extension
☆3,037Feb 10, 2025Updated last year
moj-analytical-services / splink
View on GitHub
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
☆2,298Updated this week