marsupialtail / quokka
Making data lake work for time series
☆1,153Updated 6 months ago
Alternatives and similar repositories for quokka:
Users that are interested in quokka are comparing it to the libraries listed below
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,043Updated 5 months ago
- Distributed data engine for Python/SQL designed for the cloud, powered by Rust☆2,573Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,264Updated 2 weeks ago
- Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, v…☆4,227Updated this week
- Malloy is an experimental language for describing data relationships and transformations.☆2,080Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,674Updated last week
- Apache DataFusion Comet Spark Accelerator☆902Updated this week
- Fastest library to load data from DB to DataFrames in Rust and Python☆2,135Updated last week
- An extensible, state-of-the-art columnar file format☆1,118Updated this week
- Apache DataFusion Python Bindings☆418Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆412Updated this week
- The Feldera Incremental Computation Engine☆1,172Updated this week
- Efficient data transformation and modeling framework that is backwards compatible with dbt.☆2,118Updated this week
- GlareDB: An analytics DBMS for distributed data☆772Updated this week
- WebAssembly version of DuckDB☆1,449Updated last week
- New file format for storage of large columnar datasets.☆488Updated this week
- Turning PySpark Into a Universal DataFrame API☆370Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆319Updated last year
- LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive (AI) workloads.☆640Updated this week
- A native Rust library for Delta Lake, with bindings into Python☆2,588Updated this week
- Analytical database for data-driven Web applications 🪶☆477Updated this week
- Work with your web service, database, and streaming schemas in a single format.☆343Updated 11 months ago
- Apache PyIceberg☆618Updated this week
- Distributed SQL Engine in Python using Dask☆400Updated 6 months ago
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)☆1,002Updated last week
- Postgres-native columnar storage extension☆2,901Updated 3 weeks ago
- Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.☆1,885Updated this week
- Python Stream Processing☆1,656Updated last week
- Stream Arrow data into Postgres☆257Updated 10 months ago
- SQLAlchemy driver for DuckDB☆391Updated this week