Eventual-Inc / Daft
Distributed data engine for Python/SQL designed for the cloud, powered by Rust
☆2,633Updated this week
Alternatives and similar repositories for Daft:
Users that are interested in Daft are comparing it to the libraries listed below
- Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, v…☆4,281Updated this week
- Making data lake work for time series☆1,157Updated 7 months ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,055Updated 6 months ago
- Fastest library to load data from DB to DataFrames in Rust and Python☆2,190Updated this week
- A native Rust library for Delta Lake, with bindings into Python☆2,621Updated this week
- Python Stream Processing☆1,674Updated this week
- Efficient data transformation and modeling framework that is backwards compatible with dbt.☆2,180Updated this week
- Malloy is an experimental language for describing data relationships and transformations.☆2,099Updated this week
- Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metada…☆2,058Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,689Updated this week
- WebAssembly version of DuckDB☆1,491Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,274Updated this week
- Apache PyIceberg☆640Updated this week
- A curated list of Polars talks, tools, examples & articles. Contributions welcome !☆848Updated this week
- Lightweight and extensible compatibility layer between dataframe libraries!☆884Updated this week
- the portable Python dataframe library☆5,608Updated this week
- GlareDB: An analytics DBMS for distributed data☆776Updated this week
- LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive (AI) workloads.☆687Updated this week
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)☆1,022Updated this week
- 🦆 A curated list of awesome DuckDB resources☆1,646Updated this week
- Chronon is a data platform for serving for AI/ML applications.☆783Updated this week
- DuckDB-powered Postgres for high performance apps & analytics.☆2,091Updated this week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,159Updated this week
- Apache DataFusion Comet Spark Accelerator☆918Updated this week
- Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.☆1,964Updated this week
- Distributed stream processing engine in Rust☆4,093Updated this week
- chDB is an in-process OLAP SQL Engine 🚀 powered by ClickHouse☆2,297Updated last week
- A light-weight, flexible, and expressive statistical data testing library☆3,688Updated 2 weeks ago
- Better SQL in Jupyter. 📊☆752Updated 2 weeks ago
- An extensible, state-of-the-art columnar file format☆1,139Updated this week