lancedb / lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
☆3,964Updated this week
Related projects ⓘ
Alternatives and complementary repositories for lance
- Distributed data engine for Python/SQL designed for the cloud, powered by Rust☆2,336Updated this week
- Apache DataFusion SQL Query Engine☆6,312Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,549Updated this week
- Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!☆4,766Updated this week
- A native Rust library for Delta Lake, with bindings into Python☆2,325Updated this week
- Making data lake work for time series☆1,139Updated 3 months ago
- Distributed stream processing engine in Rust☆3,794Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,013Updated last month
- Postgres with GPUs for ML/AI apps.☆6,038Updated last week
- Fastest library to load data from DB to DataFrames in Rust and Python☆2,015Updated this week
- Official Rust implementation of Apache Arrow☆2,606Updated this week
- Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.☆1,754Updated this week
- Transmute-free Rust library to work with the Arrow format☆1,063Updated 8 months ago
- The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.☆5,809Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,205Updated this week
- Python Stream Processing☆1,565Updated this week
- Apache OpenDAL: One Layer, All Storage.☆3,445Updated this week
- the portable Python dataframe library☆5,318Updated this week
- A composable and fully extensible C++ execution engine library for data management systems.☆3,520Updated this week
- Apache DataFusion Comet Spark Accelerator☆821Updated this week
- The Feldera Incremental Computation Engine☆768Updated this week
- A transactional, relational-graph-vector database that uses Datalog for query. The hippocampus for AI!☆3,422Updated 3 weeks ago
- GlareDB: An analytics DBMS for distributed data☆701Updated this week
- Best-in-class stream processing, analytics, and management. Perform continuous analytics, or build event-driven applications, real-time E…☆7,052Updated this week
- Malloy is an experimental language for describing data relationships and transformations.☆1,996Updated this week
- Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.☆1,727Updated this week
- 𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://data…☆7,867Updated this week
- An extensible, state-of-the-art columnar file format☆987Updated this week
- Apache Iceberg☆658Updated this week
- DuckDB-powered Postgres for high performance apps & analytics.☆1,620Updated this week