linkedin / veniceLinks
Venice, Derived Data Platform for Planet-Scale Workloads.
☆546Updated this week
Alternatives and similar repositories for venice
Users that are interested in venice are comparing it to the libraries listed below
Sorting:
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆840Updated last month
- New file format for storage of large columnar datasets.☆560Updated this week
- This is the companion repository for the book How Query Engines Work.☆392Updated 2 years ago
- Open Control Plane for Tables in Data Lakehouse☆355Updated this week
- ☆609Updated 2 months ago
- Mirror of Apache Helix☆481Updated this week
- Apache DataFusion Comet Spark Accelerator☆974Updated this week
- An extensible distributed system for reliable nearline data streaming at scale☆940Updated last year
- Oxia - Metadata store and coordination system☆256Updated this week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,246Updated this week
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆327Updated last year
- This is RonDB, a distribution of NDB Cluster developed and used by Hopsworks AB. It also contains development branches of RonDB.☆634Updated last week
- A library that provides an embeddable, persistent key-value store for fast storage optimized for AWS☆795Updated 3 weeks ago
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,375Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,337Updated this week
- Scalable, reliable, distributed storage system optimized for data analytics and object store workloads.☆937Updated this week
- A load balancer / proxy / gateway for prestodb☆358Updated 11 months ago
- Iceberg is a table format for large, slow-moving tabular data☆480Updated 2 years ago
- Waltz is a quorum-based distributed write-ahead log for replicating transactions☆424Updated 2 years ago
- CMU-DB's Cascades optimizer framework☆400Updated 5 months ago
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆299Updated last year
- The gateway component to make Spark on K8s much easier for Spark users.☆193Updated last month
- An open protocol for secure data sharing☆848Updated last week
- Apache Kafka is an open-source distributed event streaming platform used by thousands of companies. uForwarder aims to address several pa…☆75Updated 3 months ago
- Astra is a structured log search and analytics engine developed by Slack and Salesforce☆227Updated this week
- Mirror of Apache Samza☆826Updated last month
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆742Updated this week
- GlareDB: A light and fast SQL database for analytics☆906Updated this week
- Distributed SQL Query Engine in Python using Ray☆243Updated 8 months ago
- High Performance Embedded Key-Value Store☆712Updated last week