apache/datasketches

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apache/datasketches)

apache / datasketches

A software library of stochastic streaming algorithms, a.k.a. sketches.

☆116

Alternatives and similar repositories for datasketches

Users that are interested in datasketches are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apache / datasketches-cpp
View on GitHub
Core C++ Sketch Library
☆267Jul 11, 2026Updated last week
apache / datasketches-postgresql
View on GitHub
PostgreSQL extension providing approximate algorithms based on apache/datasketches-cpp
☆94May 15, 2026Updated 2 months ago
apache / datasketches-java
View on GitHub
A software library of stochastic streaming algorithms, a.k.a. sketches.
☆957Updated this week
qubole / streaminglens
View on GitHub
Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines
☆17Jan 21, 2020Updated 6 years ago
apache / datasketches-website
View on GitHub
Website for DataSketches.
☆109Jul 4, 2026Updated 2 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
startreedata / pinot-client-go
View on GitHub
Apache Pinot Golang Client managed by StarTree
☆34Updated this week
zebrium / zebrium-kubernetes-demo
View on GitHub
GKE cluster using Litmus Chaos Engine to validate Zebrium's unsupervised Machine Learning incident detection platform
☆18Jun 2, 2023Updated 3 years ago
seglo / exactly-once-streams
View on GitHub
An engineering report on using transactions in Kafka 0.11.0.0
☆19Feb 27, 2018Updated 8 years ago
apache / datasketches-memory
View on GitHub
High performance native memory access for Java.
☆134Jul 13, 2026Updated last week
cudbg / sqltutor
View on GitHub
☆12Oct 5, 2022Updated 3 years ago
DominikHorn / hashing-benchmark
View on GitHub
benchmark driver for "Can Learned Models Replace Hash Functions?" VLDB submission
☆16Oct 31, 2023Updated 2 years ago
logstash-plugins / logstash-output-google_cloud_storage
View on GitHub
☆10May 8, 2026Updated 2 months ago
pinterest / orion
View on GitHub
Management and automation platform for Stateful Distributed Systems
☆113Jun 17, 2026Updated last month
mikedotalmond / tones
View on GitHub
A Haxe port of https://github.com/bit101/tones for quickly making sounds with the WebAudio APIs
☆15Dec 23, 2015Updated 10 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
pinterest / yuvi
View on GitHub
Yuvi is an in-memory storage engine for recent time series metrics data.
☆48Dec 12, 2017Updated 8 years ago
datafusion-contrib / datafusion-objectstore-hdfs
View on GitHub
HDFS based on Java implementation as a remote ObjectStore for DataFusion
☆10Feb 13, 2024Updated 2 years ago
ververica / ForSt
View on GitHub
A Persistent Key-Value Store designed for Streaming processing
☆125Jan 13, 2026Updated 6 months ago
mwangaben / mwangaben-vthelpers
View on GitHub
☆10Jul 9, 2023Updated 3 years ago
apache / datasketches-python
View on GitHub
Apache datasketches
☆43May 15, 2026Updated 2 months ago
linasm / string-search-algos
View on GitHub
Code for blog posts on string search algorithms.
☆17Mar 4, 2020Updated 6 years ago
sec51 / goanomaly
View on GitHub
Golang library for anomaly detection. Uses the Gaussian distribution and the probability density formula.
☆19May 22, 2017Updated 9 years ago
DBOS-project / voltdb
View on GitHub
☆18Feb 11, 2025Updated last year
xephonhq / xephon-k
View on GitHub
A time series database prototype with multiple backends
☆23Feb 13, 2020Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
hashicorp / go.net
View on GitHub
Fork of code.google.com/p/go.net
☆19May 11, 2026Updated 2 months ago
apache / incubator-nemo
View on GitHub
Apache Nemo (Incubating) - Data Processing System for Flexible Employment With Different Deployment Characteristics
☆113Jul 1, 2025Updated last year
eugenesiow / tritandb-kt
View on GitHub
Time-series database for Internet of Things Analytics with a rich graph data model
☆19Oct 25, 2017Updated 8 years ago
linkedin / transport
View on GitHub
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…
☆306Jun 29, 2026Updated 3 weeks ago
substrait-io / substrait
View on GitHub
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
☆1,535Updated this week
trinodb / tempto
View on GitHub
A testing framework for Trino
☆28Jul 2, 2026Updated 2 weeks ago
ververica / lab-flink-repository-analytics
View on GitHub
This project contains a couple of tools to analyze data around the Apache Flink community.
☆18May 22, 2024Updated 2 years ago
wecatch / ember-cli-simditor
View on GitHub
Ember component wrapper for simditor editor
☆17Jul 1, 2022Updated 4 years ago
harttle / cors-demo
View on GitHub
Demo server/client for CORS cookies, preflights and redirects.
☆15May 4, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
risinglightdb / sqlplannertest-rs
View on GitHub
A yaml-based SQL planner test framework
☆27Dec 18, 2024Updated last year
joelpinheiro / iTrading
View on GitHub
☆14Apr 5, 2016Updated 10 years ago
wagjamin / inkfuse
View on GitHub
InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.
☆56May 13, 2024Updated 2 years ago
spacejam / assert_panic_free
View on GitHub
☆16Mar 3, 2021Updated 5 years ago
apache / datafusion-comet
View on GitHub
Apache DataFusion Comet Spark Accelerator
☆1,230Updated this week
netease-bigdata / ne-spark-courseware
View on GitHub
NetEase Spark Courses
☆15Sep 4, 2018Updated 7 years ago
linkedin / coral
View on GitHub
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
☆907Updated this week