marsupialtail / quokka
Making data lake work for time series
☆1,136Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for quokka
- Distributed data engine for Python/SQL designed for the cloud, powered by Rust☆2,306Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,199Updated this week
- Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, v…☆3,929Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,003Updated last month
- Efficient data transformation and modeling framework that is backwards compatible with dbt.☆1,785Updated this week
- Malloy is an experimental language for describing data relationships and transformations.☆1,990Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,531Updated this week
- MetricFlow allows you to define, build, and maintain metrics in code.☆1,143Updated this week
- The Feldera Incremental Computation Engine☆720Updated this week
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)☆916Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆382Updated this week
- Apache DataFusion Python Bindings☆373Updated this week
- A native Rust library for Delta Lake, with bindings into Python☆2,308Updated this week
- Fastest library to load data from DB to DataFrames in Rust and Python☆1,995Updated this week
- GlareDB: An analytics DBMS for distributed data☆678Updated last week
- Turning PySpark Into a Universal DataFrame API☆317Updated this week
- Work with your web service, database, and streaming schemas in a single format.☆330Updated 7 months ago
- Apache DataFusion Comet Spark Accelerator☆816Updated this week
- An extensible, state-of-the-art columnar file format☆967Updated this week
- New file format for storage of large columnar datasets.☆449Updated this week
- Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.☆1,707Updated this week
- Apache PyIceberg☆461Updated this week
- Distributed SQL Engine in Python using Dask☆393Updated 2 months ago
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆303Updated last year
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton☆863Updated last year
- Analytical database for data-driven Web applications 🪶☆434Updated this week
- Distributed SQL Query Engine in Python using Ray☆238Updated last month
- Python Stream Processing☆1,543Updated this week
- WebAssembly version of DuckDB☆1,271Updated this week
- Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metada…☆1,848Updated this week