Eventual-Inc / Daft
Distributed data engine for Python/SQL designed for the cloud, powered by Rust
☆2,808Updated this week
Alternatives and similar repositories for Daft:
Users that are interested in Daft are comparing it to the libraries listed below
- Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, v…☆4,584Updated this week
- Fastest library to load data from DB to DataFrames in Rust and Python☆2,245Updated this week
- Making data lake work for time series☆1,167Updated 8 months ago
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆2,293Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,741Updated this week
- the portable Python dataframe library☆5,723Updated this week
- A native Rust library for Delta Lake, with bindings into Python☆2,719Updated last week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,077Updated last month
- Python Stream Processing☆1,723Updated last month
- GlareDB: A light and fast SQL database for analytics☆811Updated this week
- Apache DataFusion Comet Spark Accelerator☆939Updated this week
- Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metada…☆2,123Updated last month
- DuckDB-powered Postgres for high performance apps & analytics.☆2,196Updated this week
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)☆1,068Updated this week
- Apache DataFusion SQL Query Engine☆7,145Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,302Updated this week
- Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.☆2,036Updated this week
- Apache PyIceberg☆714Updated this week
- data load tool (dlt) is an open source Python library that makes data loading easy 🛠️☆3,567Updated this week
- Malloy is an experimental language for describing data relationships and transformations.☆2,141Updated this week
- An extensible, state of the art columnar file format☆1,226Updated this week
- LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive (AI) workloads.☆733Updated this week
- Compare tables within or across databases☆2,969Updated 11 months ago
- Apache Iceberg☆919Updated this week
- WebAssembly version of DuckDB☆1,565Updated this week
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆2,083Updated this week
- Distributed stream processing engine in Rust☆4,311Updated this week
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wr…☆2,047Updated this week
- Chronon is a data platform for serving for AI/ML applications.☆794Updated this week
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆621Updated this week