Spark integrations for working with Lance datasets
☆53Jun 10, 2026Updated this week
Alternatives and similar repositories for lance-spark
Users that are interested in lance-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lance Namespace is an open specification for describing access and operations against a collection of tables in a multimodal lakehouse☆55Updated this week
- Integration between Lance and Ray for distributed data processing☆30Updated this week
- Community Java bindings for https://github.com/facebookincubator/velox☆41Updated this week
- A re-implementation of Hadoop DistCP in Apache Spark☆47Dec 20, 2023Updated 2 years ago
- Client libraries of end users of Apache Kyuubi☆11May 15, 2026Updated 3 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16May 22, 2026Updated 2 weeks ago
- A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.☆31Feb 5, 2026Updated 4 months ago
- Alluxio Python client - Access Any Data Source with Python☆31Sep 29, 2025Updated 8 months ago
- Apache Hive Metastore in Standalone Mode With Docker☆14Jul 22, 2024Updated last year
- Olympia is a storage-only open catalog format for big data analytics, ML & AI.☆16May 5, 2025Updated last year
- Apache DataFusion Ray☆231May 15, 2026Updated 3 weeks ago
- HashCats Auto Clicker is a versatile tool that enhances your gaming experience by automating various actions within the HashCats game☆18Updated this week
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆324Updated this week
- Patches for Pokemon Snakewood to implement modern QoL features.☆14Jan 16, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Apache OpenDAL Go Binding Services Releases☆16Jun 1, 2026Updated last week
- OSPP 2022 Project: String Adaptive Hash Table for Databend☆19Sep 15, 2022Updated 3 years ago
- ☆100Updated this week
- An open platform for capturing, integrating, storing, and sharing biological knowledge in and across organizations.☆23Jan 28, 2017Updated 9 years ago
- Persistent data structures - immutable copy-on-write lists, maps and sets for Java☆11Feb 14, 2021Updated 5 years ago
- The home of Floecat: A catalog of catalogs for open table formats☆82Updated this week
- Exploring Kotlin Symbol Processing - KSP. This is just an experiment.☆13Jul 26, 2021Updated 4 years ago
- Flink Agents is an Agentic AI framework based on Apache Flink☆387Updated this week
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆15Feb 15, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Idempotent query executor☆53Apr 28, 2025Updated last year
- Apache Iceberg Documentation Site☆42Feb 5, 2024Updated 2 years ago
- Radio is a DuckDB extension by Query.Farm that brings real-time event streams into your SQL workflows. It enables DuckDB to receive and s…☆40Mar 29, 2026Updated 2 months ago
- ☆49Feb 14, 2022Updated 4 years ago
- Monitoring and insights on your data lakehouse tables☆32May 22, 2026Updated 2 weeks ago
- Sandboxing C in Rust☆19Jun 16, 2025Updated 11 months ago
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Jan 3, 2023Updated 3 years ago
- A complete data engineering project demonstrating modern data stack practices with Apache Flink, Iceberg, Trino and Superset☆25Sep 29, 2025Updated 8 months ago
- Apache Paimon Rust The rust implementation of Apache Paimon.☆175Jun 3, 2026Updated last week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- MCP server for Apache Gravitino☆22Jul 3, 2025Updated 11 months ago
- attempt to create a library of code snippets I use a lot☆18Oct 3, 2014Updated 11 years ago
- Feature branche for the pokeemerald decompilation. See the wiki for more info.☆26Dec 20, 2025Updated 5 months ago
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆451May 27, 2026Updated 2 weeks ago
- PostgreSQL Lance Table Extension☆26Dec 27, 2025Updated 5 months ago
- Tasks API for Stateful Functions on Flink☆13May 10, 2026Updated last month
- bash script to find and execute java classes with main methods☆20Oct 24, 2025Updated 7 months ago