linkedin / venice
Venice, Derived Data Platform for Planet-Scale Workloads.
☆537Updated this week
Alternatives and similar repositories for venice
Users that are interested in venice are comparing it to the libraries listed below
Sorting:
- New file format for storage of large columnar datasets.☆549Updated last week
- Oxia - Metadata store and coordination system☆248Updated last week
- Waltz is a quorum-based distributed write-ahead log for replicating transactions☆423Updated 2 years ago
- ☆610Updated last month
- Astra is a structured log search and analytics engine developed by Slack and Salesforce☆226Updated this week
- ClickBench: a Benchmark For Analytical Databases☆796Updated this week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,207Updated this week
- MemQ is an efficient, scalable cloud native PubSub system☆136Updated 2 weeks ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆837Updated last week
- Open Control Plane for Tables in Data Lakehouse☆350Updated this week
- Apache DataFusion Comet Spark Accelerator☆946Updated this week
- A library that provides an embeddable, persistent key-value store for fast storage optimized for AWS☆789Updated last month
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,343Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,314Updated last week
- Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.☆1,464Updated this week
- Mirror of Apache Helix☆481Updated this week
- Multi-hop declarative data pipelines☆115Updated this week
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆327Updated last year
- A Raft Library in C++ based on the Raft implementation in Apache Kudu☆133Updated last week
- GlareDB: A light and fast SQL database for analytics☆822Updated this week
- High-performance, low-footprint SQL database written in C++. Process millions of rows per second from Kafka/Pulsar, Iceberg, or ClickHous…☆1,791Updated 3 weeks ago
- An extensible distributed system for reliable nearline data streaming at scale☆937Updated 11 months ago
- A Relational Database Backed by Apache Kafka☆390Updated last week
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆257Updated last week
- This is the companion repository for the book How Query Engines Work.☆390Updated 2 years ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆425Updated 3 years ago
- Iceberg is a table format for large, slow-moving tabular data☆479Updated 2 years ago
- A Redis Module that make it possible to create a consistent Raft cluster from multiple Redis instances.☆825Updated last year
- A RocksDB compliant high performance scalable embedded key-value store☆975Updated 11 months ago
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆241Updated this week