lancedb / lanceLinks
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
☆4,720Updated this week
Alternatives and similar repositories for lance
Users that are interested in lance are comparing it to the libraries listed below
Sorting:
- Distributed query engine providing simple and reliable data processing for any modality and scale☆2,873Updated this week
- Apache DataFusion SQL Query Engine☆7,274Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,763Updated this week
- Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.☆6,548Updated this week
- Distributed stream processing engine in Rust☆4,356Updated this week
- Official Rust implementation of Apache Arrow☆2,961Updated this week
- A composable and fully extensible C++ execution engine library for data management systems.☆3,764Updated this week
- Apache OpenDAL: One Layer, All Storage.☆4,151Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,326Updated this week
- A native Rust library for Delta Lake, with bindings into Python☆2,815Updated this week
- Postgres with GPUs for ML/AI apps.☆6,300Updated last month
- Fastest library to load data from DB to DataFrames in Rust and Python☆2,311Updated this week
- Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.☆2,056Updated 3 months ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,082Updated 2 months ago
- An extensible, state of the art columnar file format. Formerly at @spiraldb, now part of the Linux Foundation.☆1,269Updated this week
- Python Stream Processing☆1,751Updated 2 months ago
- Stream processing and management platform.☆7,825Updated this week
- Making data lake work for time series☆1,171Updated 9 months ago
- A transactional, relational-graph-vector database that uses Datalog for query. The hippocampus for AI!☆3,631Updated 6 months ago
- chDB is an in-process OLAP SQL Engine 🚀 powered by ClickHouse☆2,381Updated this week
- Embedded property graph database built for speed. Vector search and full-text search built in. Implements Cypher.☆2,522Updated this week
- 𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://data…☆8,457Updated this week
- Real-time Data Integration and Transformation: use SQL to transform, deliver, and act on fast-changing data.☆6,011Updated this week
- Apache Iceberg☆966Updated this week
- A cloud native embedded storage engine built on object storage.☆2,017Updated this week
- Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.☆1,472Updated this week
- The Feldera Incremental Computation Engine☆1,401Updated this week
- Transmute-free Rust library to work with the Arrow format☆1,061Updated last year
- Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, S…☆2,748Updated last week
- Extensible SQL Lexer and Parser for Rust☆3,084Updated this week