Eventual-Inc / Daft
Distributed data engine for Python/SQL designed for the cloud, powered by Rust
☆2,476Updated this week
Alternatives and similar repositories for Daft:
Users that are interested in Daft are comparing it to the libraries listed below
- Making data lake work for time series☆1,147Updated 4 months ago
- Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, v…☆4,119Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,032Updated 3 months ago
- Python Stream Processing☆1,605Updated last month
- A native Rust library for Delta Lake, with bindings into Python☆2,483Updated this week
- Fastest library to load data from DB to DataFrames in Rust and Python☆2,066Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,618Updated this week
- Efficient data transformation and modeling framework that is backwards compatible with dbt.☆1,961Updated this week
- Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metada…☆1,977Updated this week
- LakeSail's computation framework with a mission to unify stream processing, batch processing, and compute-intensive (AI) workloads.☆601Updated this week
- Chronon is a data platform for serving for AI/ML applications.☆762Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,238Updated this week
- An extensible, state-of-the-art columnar file format☆1,070Updated this week
- Apache PyIceberg☆551Updated this week
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)☆968Updated last week
- 🦆 A curated list of awesome DuckDB resources☆1,490Updated this week
- Better SQL in Jupyter. 📊☆736Updated last week
- Apache DataFusion Comet Spark Accelerator☆866Updated this week
- Malloy is an experimental language for describing data relationships and transformations.☆2,027Updated this week
- Apache Iceberg☆778Updated this week
- The Virtual Feature Store. Turn your existing data infrastructure into a feature store.☆1,826Updated this week
- GlareDB: An analytics DBMS for distributed data☆752Updated this week
- A light-weight, flexible, and expressive statistical data testing library☆3,546Updated this week
- Apache DataFusion Python Bindings☆400Updated this week
- DuckDB-powered Postgres for high performance apps & analytics.☆1,883Updated this week
- Apache DataFusion SQL Query Engine☆6,628Updated this week
- A portable SQL query and AI compute engine, written in Rust, for data-grounded apps and agents.☆1,997Updated this week
- WebAssembly version of DuckDB☆1,383Updated last week
- Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.☆1,810Updated this week
- Lightweight and extensible compatibility layer between dataframe libraries!☆759Updated this week