apache / datasketches-cpp
Core C++ Sketch Library
☆230Updated last month
Alternatives and similar repositories for datasketches-cpp:
Users that are interested in datasketches-cpp are comparing it to the libraries listed below
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆238Updated 10 months ago
- A modular acceleration toolkit for big data analytic engines☆68Updated 10 months ago
- Collection of experiments to carve out the differences between two types of relational query processing engines: Vectorizing (interpretat…☆254Updated 6 years ago
- Apache Iceberg C++☆51Updated this week
- Cuckoo Index: A Lightweight Secondary Index Structure☆130Updated 3 years ago
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆121Updated last week
- Multi-core Window-Based Stream Processing Engine☆71Updated 3 years ago
- InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.☆45Updated 10 months ago
- ☆13Updated 3 weeks ago
- An adaptive radix tree for efficient indexing in main memory.☆154Updated last year
- BI benchmark with user generated data and queries☆64Updated 3 months ago
- Apache datasketches☆95Updated 2 years ago
- Mirror of Apache crail (Incubating)☆149Updated 2 years ago
- Website for DataSketches.☆98Updated this week
- Reproducing TPC-DS qualification/reference results☆32Updated last year
- ☆82Updated this week
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆257Updated 2 years ago
- Towards a New File Format☆210Updated 3 weeks ago
- ☆26Updated 5 years ago
- Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshif…☆58Updated 6 months ago
- A Relational Optimizer and Executor☆66Updated 3 months ago
- Apache Quickstep Incubator - This project is retired☆95Updated 6 years ago
- Apache Parquet☆443Updated 10 months ago
- ☆71Updated 8 months ago
- ☆71Updated 2 years ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆198Updated last week
- Distributed SQL Query Engine in Python using Ray☆243Updated 5 months ago
- PostgreSQL extension providing approximate algorithms based on apache/datasketches-cpp☆86Updated 2 months ago
- An embedded key-value store library specialized for building state machine and log store☆226Updated this week
- New file format for storage of large columnar datasets.☆494Updated this week