linkedin / venice
Venice, Derived Data Platform for Planet-Scale Workloads.
☆534Updated this week
Alternatives and similar repositories for venice:
Users that are interested in venice are comparing it to the libraries listed below
- This is RonDB, a distribution of NDB Cluster developed and used by Hopsworks AB. It also contains development branches of RonDB.☆621Updated this week
- Mirror of Apache Helix☆479Updated last week
- An extensible distributed system for reliable nearline data streaming at scale☆936Updated 11 months ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆835Updated 2 months ago
- Oxia - Metadata store and coordination system☆245Updated this week
- Open Control Plane for Tables in Data Lakehouse☆341Updated 2 weeks ago
- New file format for storage of large columnar datasets.☆532Updated this week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,185Updated this week
- Waltz is a quorum-based distributed write-ahead log for replicating transactions☆415Updated 2 years ago
- Scalable, reliable, distributed storage system optimized for data analytics and object store workloads.☆902Updated this week
- MemQ is an efficient, scalable cloud native PubSub system☆136Updated 2 weeks ago
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,289Updated this week
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆327Updated last year
- A library that provides an embeddable, persistent key-value store for fast storage optimized for AWS☆788Updated 2 weeks ago
- ☆610Updated 3 weeks ago
- Apache DataFusion Comet Spark Accelerator☆935Updated this week
- A high-performance, reliable and extensible logging agent for uploading data to Kafka, Pulsar, etc.☆181Updated last week
- 🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, with Estuary Flow. 🌊☆717Updated this week
- Cache File System optimized for columnar formats and object stores☆182Updated 2 years ago
- This is the companion repository for the book How Query Engines Work.☆387Updated last year
- The gateway component to make Spark on K8s much easier for Spark users.☆186Updated 2 months ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆204Updated last week
- Open source Java implementation for Raft consensus protocol.☆1,365Updated this week
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆299Updated last year
- DynoYARN is a framework to run simulated YARN clusters and workloads for YARN scale testing.☆60Updated 2 years ago
- Mirror of Apache Samza☆823Updated last month
- CMU-DB's Cascades optimizer framework☆397Updated 3 months ago
- Pravega - Streaming as a new software defined storage primitive☆1,996Updated last month
- Multi-hop declarative data pipelines☆114Updated this week
- ClickBench: a Benchmark For Analytical Databases☆785Updated this week