apache/datasketches-cpp

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apache/datasketches-cpp)

apache / datasketches-cpp

Core C++ Sketch Library

☆267

Alternatives and similar repositories for datasketches-cpp

Users that are interested in datasketches-cpp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apache / datasketches-postgresql
View on GitHub
PostgreSQL extension providing approximate algorithms based on apache/datasketches-cpp
☆94May 15, 2026Updated 2 months ago
apache / datasketches
View on GitHub
A software library of stochastic streaming algorithms, a.k.a. sketches.
☆116May 15, 2026Updated 2 months ago
vlad17 / datasketches-rs
View on GitHub
Rusty wrapper for Apache DataSketches
☆13Aug 17, 2025Updated 11 months ago
apache / datasketches-pig
View on GitHub
Sketch adaptors for Pig.
☆10May 15, 2026Updated 2 months ago
apache / datasketches-go
View on GitHub
Apache datasketches
☆29Jun 27, 2026Updated 3 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dnbaker / sketch
View on GitHub
C++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings
☆157Jul 23, 2024Updated last year
apache / datasketches-vector
View on GitHub
Sketch Library for vector-based models
☆15May 15, 2026Updated 2 months ago
sliding-sketch / Sliding-Sketch
View on GitHub
☆25Apr 4, 2024Updated 2 years ago
facebookincubator / velox
View on GitHub
A composable and fully extensible C++ execution engine library for data management systems.
☆4,173Updated this week
apache / datasketches-hive
View on GitHub
Sketch adaptors for Hive.
☆51May 15, 2026Updated 2 months ago
dynatrace-research / set-sketch-paper
View on GitHub
SetSketch: Filling the Gap between MinHash and HyperLogLog
☆49Aug 11, 2021Updated 4 years ago
edoliberty / streaming-quantiles
View on GitHub
Implements the Karnin-Lang-Liberty (KLL) algorithm in python
☆59Nov 19, 2022Updated 3 years ago
hyrise / hyrise
View on GitHub
Hyrise is a research in-memory database.
☆869Updated this week
maropu / datasketches-spark
View on GitHub
Data Sketches for Apache Spark
☆22Dec 22, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
kul-optec / libForBES
View on GitHub
libForBES is a C++ solver for generic, constrained and possibly nonsmooth convex optimization problems. LASSO, optimal control, elastic n…
☆10Apr 11, 2017Updated 9 years ago
tobc / dartminhash
View on GitHub
DartMinHash: Fast Sketching for Weighted Sets
☆12Dec 8, 2025Updated 7 months ago
apache / datasketches-memory
View on GitHub
High performance native memory access for Java.
☆134Jul 13, 2026Updated last week
cmu-db / noisepage
View on GitHub
Self-Driving Database Management System from Carnegie Mellon University
☆1,766Nov 8, 2022Updated 3 years ago
TimoKersten / db-engine-paradigms
View on GitHub
Collection of experiments to carve out the differences between two types of relational query processing engines: Vectorizing (interpretat…
☆270Jul 18, 2018Updated 8 years ago
jiecchen / StreamingCC
View on GitHub
A C++ library for summarizing data streams
☆23Jul 26, 2019Updated 6 years ago
substrait-io / substrait
View on GitHub
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
☆1,535Updated this week
RoaringBitmap / CRoaring
View on GitHub
Roaring bitmaps in C (and C++), with SIMD (AVX2, AVX-512 and NEON) optimizations: used by Apache Doris, ClickHouse, Alibaba Tair, Redpand…
☆1,862Updated this week
maxi-k / btrblocks
View on GitHub
BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)
☆285Apr 7, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
hideo55 / cpp-HyperLogLog
View on GitHub
C++ implementation of HyperLogLog
☆61May 27, 2024Updated 2 years ago
leanstore / leanstore
View on GitHub
☆643Apr 9, 2026Updated 3 months ago
SketchLib / P4_SketchLib
View on GitHub
☆42Aug 14, 2023Updated 2 years ago
Kingsford-Group / miniception
View on GitHub
☆14Jan 31, 2020Updated 6 years ago
datafusion-contrib / datafusion-orc
View on GitHub
Implementation of Apache ORC file format use Apache Arrow in-memory format
☆46Updated this week
TU-Berlin-DIMA / Condor
View on GitHub
Condor allows for the specification of synopsis-based streaming jobs on top of general dataflow systems. Condor provides a collection of …
☆13Jun 24, 2024Updated 2 years ago
facebookincubator / nimble
View on GitHub
New and extensible file format for storage of large columnar datasets.
☆728Updated this week
IBM / sliding-window-aggregators
View on GitHub
Reference implementations of sliding window aggregation algorithms
☆46Mar 27, 2026Updated 3 months ago
speedb-io / log-parser
View on GitHub
A tool for analyzing and parsing SpeedB and RocksDB log files
☆22Mar 31, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
s-yata / madoka
View on GitHub
Count-Min sketch-based approximate counting library
☆47May 13, 2025Updated last year
efficient / libcuckoo
View on GitHub
A high-performance, concurrent hash table
☆1,741Apr 25, 2026Updated 2 months ago
FastFilter / fastfilter_cpp
View on GitHub
Fast Approximate Membership Filters (C++)
☆287Aug 29, 2025Updated 10 months ago
JJK96 / P4-filtering
View on GitHub
Attachments for the paper "Filtering DDoS traffic using the P4 programming language" by Jan-Jaap Korpershoek
☆10Jun 30, 2018Updated 8 years ago
facebook / CacheLib
View on GitHub
Pluggable in-process caching engine to build and scale high performance services
☆1,568Updated this week
cmu-db / optd-original
View on GitHub
CMU-DB's Cascades optimizer framework
☆405Jan 6, 2025Updated last year
alabid / countminsketch
View on GitHub
Implementation of Count Min Sketch in C++
☆58Jun 23, 2019Updated 7 years ago