varchar-io / nebulaLinks
A distributed block-based data storage and compute engine
☆154Updated 4 months ago
Alternatives and similar repositories for nebula
Users that are interested in nebula are comparing it to the libraries listed below
Sorting:
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆301Updated last year
- Data pipelines from re-usable components☆108Updated 2 years ago
- The metrics layer for your data. Join us at https://metriql.com/slack☆309Updated 2 years ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆222Updated last week
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆54Updated last year
- This is RonDB, a distribution of NDB Cluster developed and used by Hopsworks AB. It also contains development branches of RonDB.☆634Updated this week
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis.☆101Updated last month
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆330Updated 2 years ago
- Distributed SQL Query Engine in Python using Ray☆243Updated 8 months ago
- Use SQL to build ELT pipelines on a data lakehouse.☆287Updated 3 years ago
- Apache Iceberg C++☆87Updated this week
- New file format for storage of large columnar datasets.☆560Updated this week
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆245Updated 2 months ago
- A BYOC option for Snowflake workloads☆77Updated this week
- In Memory Property Graph Server using a Shared Nothing design☆44Updated last year
- Core C++ Sketch Library☆233Updated last week
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆61Updated 6 months ago
- Demos of Materialize, the operational data warehouse.☆51Updated 3 months ago
- ☆79Updated 2 years ago
- Ibis Substrait Compiler☆103Updated this week
- In-Memory Analytics with Apache Arrow, published by Packt☆100Updated last year
- Beneath is a serverless real-time data platform ⚡️☆84Updated 3 years ago
- Open-source metadata collector based on ODD Specification☆44Updated last year
- Data Tools Subjective List☆83Updated last year
- Serverless multi-protocol + multi-destination event collection system.☆206Updated 7 months ago
- Lakehouse storage system benchmark☆75Updated 2 years ago
- VectorSQL is a free analytics DBMS for IoT & Big Data, compatible with ClickHouse.☆294Updated 3 years ago
- The most valuable time series database in the universe☆33Updated 3 years ago
- Distributed SQL Engine in Python using Dask☆405Updated 9 months ago
- Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.☆316Updated last month