varchar-io / nebulaLinks
A distributed block-based data storage and compute engine
☆155Updated 11 months ago
Alternatives and similar repositories for nebula
Users that are interested in nebula are comparing it to the libraries listed below
Sorting:
- This is RonDB, a distribution of NDB Cluster developed and used by Hopsworks AB. It also contains development branches of RonDB.☆702Updated this week
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆301Updated last week
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆53Updated 2 years ago
- PostgreSQL extension providing approximate algorithms based on apache/datasketches-cpp☆90Updated 7 months ago
- ☆80Updated 3 years ago
- Love your Data. Love the Environment. Love VULKИ.☆43Updated 5 years ago
- Beneath is a serverless real-time data platform ⚡️☆84Updated 3 years ago
- Use SQL to build ELT pipelines on a data lakehouse.☆288Updated 3 years ago
- In Memory Property Graph Server using a Shared Nothing design☆45Updated 2 years ago
- Data pipelines from re-usable components☆107Updated 3 months ago
- Core C++ Sketch Library☆252Updated this week
- Distributed SQL Query Engine in Python using Ray☆245Updated last year
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆105Updated 3 weeks ago
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆267Updated 2 weeks ago
- In-memory, columnar, arrow-based database.☆48Updated 3 years ago
- AlloyDB is a distributed SQL database.☆75Updated 3 years ago
- Transactional functions-as-a-service for database-oriented applications.☆155Updated 2 years ago
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆332Updated 2 years ago
- Vectorized executor to speed up PostgreSQL☆335Updated 10 years ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆279Updated 10 months ago
- Serverless multi-protocol + multi-destination event collection system.☆210Updated last year
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆171Updated 2 years ago
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆160Updated 3 years ago
- Parquet-based ML data format optimized for working with unstructured data☆141Updated 3 years ago
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis.☆110Updated 9 months ago
- Bring RocksDB to PostgreSQL as an extension. It is the first foreign data wrapper (FDW) that introduces LSM-tree into PostgreSQL. The und…☆132Updated 3 years ago
- Storage Engine for block and key/value stores.☆25Updated this week
- Serverless query engine☆141Updated 3 years ago
- In-Memory Analytics with Apache Arrow, published by Packt☆104Updated last week
- Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.☆320Updated 3 weeks ago