varchar-io / nebulaLinks
A distributed block-based data storage and compute engine
☆155Updated 7 months ago
Alternatives and similar repositories for nebula
Users that are interested in nebula are comparing it to the libraries listed below
Sorting:
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆302Updated last year
- Use SQL to build ELT pipelines on a data lakehouse.☆288Updated 3 years ago
- This is RonDB, a distribution of NDB Cluster developed and used by Hopsworks AB. It also contains development branches of RonDB.☆668Updated last week
- PostgreSQL extension providing approximate algorithms based on apache/datasketches-cpp☆86Updated 2 months ago
- Core C++ Sketch Library☆240Updated last month
- In Memory Property Graph Server using a Shared Nothing design☆44Updated 2 years ago
- Distributed SQL Query Engine in Python using Ray☆244Updated 11 months ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆63Updated 2 weeks ago
- Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.☆317Updated 4 months ago
- Data pipelines from re-usable components☆107Updated 2 years ago
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆53Updated last year
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆240Updated this week
- ☆80Updated 2 years ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆256Updated 5 months ago
- Distributed SQL Engine in Python using Dask☆408Updated last year
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆330Updated 2 years ago
- Vectorized executor to speed up PostgreSQL☆335Updated 10 years ago
- Love your Data. Love the Environment. Love VULKИ.☆43Updated 5 years ago
- Apache datasketches☆99Updated 2 years ago
- Beneath is a serverless real-time data platform ⚡️☆84Updated 3 years ago
- In-Memory Analytics with Apache Arrow, published by Packt☆104Updated 2 weeks ago
- The most valuable time series database in the universe☆33Updated 3 years ago
- In-memory, columnar, arrow-based database.☆49Updated 3 years ago
- Firebolt Core is a free, self-hosted edition of Firebolt's distributed query engine (https://www.firebolt.io/); it provides high-performa…☆175Updated 3 weeks ago
- Transactional functions-as-a-service for database-oriented applications.☆155Updated last year
- Apache Iceberg C++☆132Updated this week
- Bring RocksDB to PostgreSQL as an extension. It is the first foreign data wrapper (FDW) that introduces LSM-tree into PostgreSQL. The und…☆130Updated 2 years ago
- OtterTune Agent - metric collector for external databases☆72Updated last year
- ☆48Updated 2 months ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆60Updated 2 years ago