linkedin / venice
Venice, Derived Data Platform for Planet-Scale Workloads.
☆502Updated this week
Alternatives and similar repositories for venice:
Users that are interested in venice are comparing it to the libraries listed below
- Apache Iceberg☆778Updated this week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,101Updated this week
- This is the companion repository for the book How Query Engines Work.☆376Updated last year
- New file format for storage of large columnar datasets.☆464Updated this week
- ☆611Updated last year
- GlareDB: An analytics DBMS for distributed data☆752Updated this week
- Iceberg is a table format for large, slow-moving tabular data☆480Updated last year
- Apache DataFusion Comet Spark Accelerator☆866Updated this week
- Open Control Plane for Tables in Data Lakehouse☆321Updated this week
- An extensible distributed system for reliable nearline data streaming at scale☆930Updated 7 months ago
- Apache Pinot Documentation☆24Updated this week
- Multi-hop declarative data pipelines☆103Updated this week
- Oxia - Metadata store and coordination system☆231Updated this week
- Kafka-on-Pulsar - A protocol handler that brings native Kafka protocol to Apache Pulsar☆450Updated 11 months ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆811Updated 2 months ago
- ClickBench: a Benchmark For Analytical Databases☆709Updated this week
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆379Updated this week
- A high-performance, reliable and extensible logging agent for uploading data to Kafka, Pulsar, etc.☆180Updated 3 weeks ago
- Change Data Capture (CDC) service☆441Updated 6 months ago
- A library that provides an embeddable, persistent key-value store for fast storage optimized for AWS☆771Updated 3 months ago
- ☆175Updated this week
- Lakekeeper: A Rust native Iceberg REST Catalog☆377Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,238Updated this week
- 🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, with Estuary Flow. 🌊☆669Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,618Updated this week
- Mirror of Apache Helix☆472Updated this week
- MemQ is an efficient, scalable cloud native PubSub system☆135Updated last month
- This is RonDB, a distribution of NDB Cluster developed and used by Hopsworks AB. It also contains development branches of RonDB.☆591Updated this week
- Waltz is a quorum-based distributed write-ahead log for replicating transactions☆415Updated last year