Spark integrations for working with Lance datasets
☆47Apr 24, 2026Updated last week
Alternatives and similar repositories for lance-spark
Users that are interested in lance-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lance Namespace is an open specification for describing access and operations against a collection of tables in a multimodal lakehouse☆53Apr 25, 2026Updated last week
- Community Java bindings for https://github.com/facebookincubator/velox☆41Updated this week
- A re-implementation of Hadoop DistCP in Apache Spark☆47Dec 20, 2023Updated 2 years ago
- Client libraries of end users of Apache Kyuubi☆11Jan 10, 2023Updated 3 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16Jan 4, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.☆31Feb 5, 2026Updated 2 months ago
- Testing Sandbox for Hadoop Ecosystem Components☆44Apr 20, 2026Updated last week
- Hive for MR3☆39Updated this week
- ☆13Jun 10, 2024Updated last year
- Alluxio Python client - Access Any Data Source with Python☆31Sep 29, 2025Updated 7 months ago
- Apache Hive Metastore in Standalone Mode With Docker☆14Jul 22, 2024Updated last year
- Olympia is a storage-only open catalog format for big data analytics, ML & AI.☆16May 5, 2025Updated 11 months ago
- Apache DataFusion Ray☆230Oct 5, 2025Updated 6 months ago
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆322Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Patches for Pokemon Snakewood to implement modern QoL features.☆13Jan 16, 2026Updated 3 months ago
- Apache OpenDAL Go Binding Services Releases☆15Sep 11, 2025Updated 7 months ago
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆1,266Apr 25, 2026Updated last week
- OSPP 2022 Project: String Adaptive Hash Table for Databend☆19Sep 15, 2022Updated 3 years ago
- 同步数据的小工具☆17Feb 27, 2026Updated 2 months ago
- An open platform for capturing, integrating, storing, and sharing biological knowledge in and across organizations.☆24Jan 28, 2017Updated 9 years ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆124Apr 23, 2026Updated last week
- World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.☆2,942Updated this week
- Persistent data structures - immutable copy-on-write lists, maps and sets for Java☆11Feb 14, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The home of Floecat: A catalog of catalogs for open table formats☆72Apr 25, 2026Updated last week
- Flink Agents is an Agentic AI framework based on Apache Flink☆361Apr 22, 2026Updated last week
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆15Feb 15, 2025Updated last year
- Radio is a DuckDB extension by Query.Farm that brings real-time event streams into your SQL workflows. It enables DuckDB to receive and s…☆37Mar 29, 2026Updated last month
- ☆49Feb 14, 2022Updated 4 years ago
- Sandboxing C in Rust☆19Jun 16, 2025Updated 10 months ago
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Jan 3, 2023Updated 3 years ago
- Apache Paimon Rust The rust implementation of Apache Paimon.☆163Updated this week
- A complete data engineering project demonstrating modern data stack practices with Apache Flink, Iceberg, Trino and Superset☆24Sep 29, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Voyage BI 是一个开源的可以定制化开发的开源BI平台/工具,通过配置数据源连接,开发数据集,图表开发中拖拉拽方式快速制作看板/报表并分享到外部。提供主题定制、组件定制、自由筛选联动等功能☆30Apr 7, 2026Updated 3 weeks ago
- A playground to experience Gravitino☆76Mar 16, 2026Updated last month
- MCP server for Apache Gravitino☆21Jul 3, 2025Updated 9 months ago
- Feature branche for the pokeemerald decompilation. See the wiki for more info.☆24Dec 20, 2025Updated 4 months ago
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆449Apr 7, 2026Updated 3 weeks ago
- PostgreSQL Lance Table Extension☆25Dec 27, 2025Updated 4 months ago
- Tasks API for Stateful Functions on Flink☆13Feb 28, 2026Updated 2 months ago