varchar-io / nebula
A distributed block-based data storage and compute engine
☆154Updated last month
Alternatives and similar repositories for nebula:
Users that are interested in nebula are comparing it to the libraries listed below
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆327Updated 2 years ago
- Use SQL to build ELT pipelines on a data lakehouse.☆285Updated 2 years ago
- Demos of Materialize, the operational data warehouse.☆51Updated 3 weeks ago
- The metrics layer for your data. Join us at https://metriql.com/slack☆307Updated 2 years ago
- Open-source metadata collector based on ODD Specification☆43Updated last year
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis.☆99Updated last month
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆54Updated last year
- Data Tools Subjective List☆83Updated last year
- Data pipelines from re-usable components☆108Updated 2 years ago
- Beneath is a serverless real-time data platform ⚡️☆84Updated 3 years ago
- Apache Iceberg C++☆58Updated this week
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆169Updated last year
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆62Updated 2 years ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆61Updated 3 months ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆198Updated this week
- ☆79Updated 2 years ago
- ODD Specification is a universal open standard for collecting metadata.☆137Updated 5 months ago
- sgr (command line client for Splitgraph) and the splitgraph Python library☆322Updated 11 months ago
- Multi-hop declarative data pipelines☆112Updated last week
- Serverless multi-protocol + multi-destination event collection system.☆202Updated 4 months ago
- Distributed SQL Query Engine in Python using Ray☆243Updated 6 months ago
- Work with your web service, database, and streaming schemas in a single format.☆343Updated last year
- ☆41Updated 2 years ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆251Updated last year
- The most valuable time series database in the universe☆33Updated 3 years ago
- ☆137Updated last month
- Parquet-based ML data format optimized for working with unstructured data☆140Updated 2 years ago
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis. It enables anyone inside an or…☆92Updated 2 years ago
- A purely experimental DuckDB Deltalake extension☆95Updated last week
- API Server for chDB, an in-process SQL OLAP Engine powered by ClickHouse☆21Updated last year