Making data lake work for time series
☆1,190Aug 21, 2024Updated last year
Alternatives and similar repositories for quokka
Users that are interested in quokka are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python SQL Parser and Transpiler☆9,196Updated this week
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,370May 1, 2026Updated last week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,156Updated this week
- High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale☆5,446May 2, 2026Updated last week
- Apache DataFusion Ballista Distributed Query Engine☆2,024May 3, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,502Updated this week
- the portable Python dataframe library☆6,523Updated this week
- Distributed SQL Query Engine in Python using Ray☆245Oct 2, 2024Updated last year
- Apache DataFusion SQL Query Engine☆8,727Updated this week
- Apache DataFusion Python Bindings☆579Updated this week
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆276Apr 17, 2026Updated 3 weeks ago
- Turning PySpark Into a Universal DataFrame API☆506Apr 21, 2026Updated 2 weeks ago
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆3,057Apr 29, 2026Updated last week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆333Mar 28, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A composable and fully extensible C++ execution engine library for data management systems.☆4,115Updated this week
- A native Rust library for Delta Lake, with bindings into Python☆3,207Updated this week
- GlareDB: A light and fast SQL database for analytics☆1,012Nov 14, 2025Updated 5 months ago
- Apache DataFusion Ray☆230Oct 5, 2025Updated 7 months ago
- Extremely fast Query Engine for DataFrames, written in Rust☆38,420Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆588Updated this week
- PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement☆10,816Updated this week
- Create full-fledged APIs for slowly moving datasets without writing a single line of code.☆3,414Mar 25, 2026Updated last month
- Codd method-chained SQL generator and Pandas data processing in Python.☆118Oct 19, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆118Updated this week
- New and extensible file format for storage of large columnar datasets.☆709Updated this week
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends☆2,134Updated this week
- A purely experimental DuckDB Deltalake extension☆95Apr 28, 2026Updated last week
- Transmute-free Rust library to work with the Arrow format☆1,067Feb 27, 2024Updated 2 years ago
- Analytical database for data-driven Web applications 🪶☆515Feb 25, 2025Updated last year
- The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL☆6,279Updated this week
- Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.☆2,389Updated this week
- DuckDB is an analytical in-process SQL database management system☆37,955Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Fastest library to load data from DB to DataFrames in Rust and Python☆2,598Apr 30, 2026Updated last week
- Distributed stream processing engine in Rust☆4,902Updated this week
- A lightweight data processing framework built on DuckDB and 3FS.☆4,951Mar 5, 2025Updated last year
- BoilingData JS client (NodeJS and Browsers)☆18Sep 25, 2024Updated last year
- Apache DataFusion Comet Spark Accelerator☆1,182Updated this week
- Malloy is a modern open source language for describing data relationships and transformations.☆2,460Updated this week
- Postgres-native columnar storage extension☆3,024Feb 10, 2025Updated last year