lancedb / lanceLinks
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
☆5,164Updated this week
Alternatives and similar repositories for lance
Users that are interested in lance are comparing it to the libraries listed below
Sorting:
- Distributed query engine providing simple and reliable data processing for any modality and scale☆3,224Updated this week
- Apache DataFusion SQL Query Engine☆7,585Updated this week
- Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.☆7,217Updated last week
- Distributed stream processing engine in Rust☆4,463Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,811Updated this week
- Apache OpenDAL: One Layer, All Storage.☆4,322Updated this week
- Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.☆2,086Updated 5 months ago
- A native Rust library for Delta Lake, with bindings into Python☆2,887Updated last week
- Embedded property graph database built for speed. Vector search and full-text search built in. Implements Cypher.☆2,928Updated this week
- Official Rust implementation of Apache Arrow☆3,076Updated this week
- Fastest library to load data from DB to DataFrames in Rust and Python☆2,377Updated this week
- Postgres with GPUs for ML/AI apps.☆6,414Updated last month
- Python Stream Processing☆1,788Updated 4 months ago
- A composable and fully extensible C++ execution engine library for data management systems.☆3,833Updated this week
- Stream processing and management platform.☆8,189Updated this week
- 𝗔𝗜-𝗡𝗮𝘁𝗶𝘃𝗲 𝗗𝗮𝘁𝗮 𝗪𝗮𝗿𝗲𝗵𝗼𝘂𝘀𝗲. Open-source Snowflake alternative. Proven at petabyte scale with enterprise performance. B…☆8,723Updated this week
- Making data lake work for time series☆1,180Updated 11 months ago
- Real-time Data Integration and Transformation: use SQL to transform, deliver, and act on fast-changing data.☆6,067Updated this week
- Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, S…☆3,019Updated 3 weeks ago
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,363Updated last week
- DuckDB-powered Postgres for high performance apps & analytics.☆2,409Updated last week
- A transactional, relational-graph-vector database that uses Datalog for query. The hippocampus for AI!☆3,675Updated 8 months ago
- An extensible, state of the art columnar file format. Formerly at @spiraldb, now a Linux Foundation project.☆1,378Updated this week
- Apache Iceberg☆1,050Updated this week
- WebAssembly version of DuckDB☆1,680Updated 2 weeks ago
- GlareDB: A light and fast SQL database for analytics☆951Updated last week
- ParadeDB is a modern Elasticsearch alternative built on Postgres. Built for real-time, update-heavy workloads.☆7,561Updated this week
- the portable Python dataframe library☆5,971Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,101Updated 4 months ago
- 🦆 A curated list of awesome DuckDB resources☆1,970Updated 2 weeks ago