varchar-io / nebulaLinks
A distributed block-based data storage and compute engine
☆155Updated 11 months ago
Alternatives and similar repositories for nebula
Users that are interested in nebula are comparing it to the libraries listed below
Sorting:
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆301Updated last year
- PostgreSQL extension providing approximate algorithms based on apache/datasketches-cpp☆90Updated 6 months ago
- ☆80Updated 3 years ago
- This is RonDB, a distribution of NDB Cluster developed and used by Hopsworks AB. It also contains development branches of RonDB.☆694Updated this week
- Use SQL to build ELT pipelines on a data lakehouse.☆288Updated 3 years ago
- Distributed SQL Query Engine in Python using Ray☆246Updated last year
- Bring RocksDB to PostgreSQL as an extension. It is the first foreign data wrapper (FDW) that introduces LSM-tree into PostgreSQL. The und…☆131Updated 3 years ago
- Data pipelines from re-usable components☆107Updated 2 months ago
- Core C++ Sketch Library☆253Updated last week
- Multi-core Window-Based Stream Processing Engine☆73Updated 4 years ago
- In-Memory Analytics with Apache Arrow, published by Packt☆104Updated last month
- In Memory Property Graph Server using a Shared Nothing design☆45Updated 2 years ago
- Transactional functions-as-a-service for database-oriented applications.☆155Updated 2 years ago
- ☆14Updated 5 months ago
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆332Updated 2 years ago
- The most valuable time series database in the universe☆33Updated 3 years ago
- Beneath is a serverless real-time data platform ⚡️☆84Updated 3 years ago
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆53Updated 2 years ago
- A home for LinkedIn's changes to Apache Iceberg☆63Updated last week
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆267Updated last week
- Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.☆320Updated 8 months ago
- ☆152Updated 9 months ago
- An arrow flight extension to support ticking datasets via IPC☆28Updated last month
- Distributed SQL Engine in Python using Dask☆409Updated last year
- Love your Data. Love the Environment. Love VULKИ.☆43Updated 5 years ago
- Documentation for Hyper, the blazingly fast SQL engine powering analytics at Tableau and Salesforce☆32Updated last week
- Embeddable Cloud-Native Key-Value Storage For Sequential Data.☆105Updated last month
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆279Updated 9 months ago
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆171Updated last year
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆60Updated 3 years ago