A software library of stochastic streaming algorithms, a.k.a. sketches.
☆108Jan 20, 2026Updated last month
Alternatives and similar repositories for datasketches
Users that are interested in datasketches are comparing it to the libraries listed below
Sorting:
- Core C++ Sketch Library☆256Mar 13, 2026Updated last week
- PostgreSQL extension providing approximate algorithms based on apache/datasketches-cpp☆90Jul 2, 2025Updated 8 months ago
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆949Mar 12, 2026Updated last week
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Jan 21, 2020Updated 6 years ago
- An iOS application that uses a golang mobile framework☆10Sep 23, 2016Updated 9 years ago
- Website for DataSketches.☆108Mar 3, 2026Updated 2 weeks ago
- GKE cluster using Litmus Chaos Engine to validate Zebrium's unsupervised Machine Learning incident detection platform☆18Jun 2, 2023Updated 2 years ago
- Fluorite: Apache Calcite trace analyzer☆12Apr 15, 2019Updated 6 years ago
- benchmark driver for "Can Learned Models Replace Hash Functions?" VLDB submission☆16Oct 31, 2023Updated 2 years ago
- Automatically exported from code.google.com/p/selenium-profiler☆10Mar 16, 2015Updated 11 years ago
- Management and automation platform for Stateful Distributed Systems☆112Updated this week
- HDFS based on Java implementation as a remote ObjectStore for DataFusion☆10Feb 13, 2024Updated 2 years ago
- A time series database prototype with multiple backends☆23Feb 13, 2020Updated 6 years ago
- A Persistent Key-Value Store designed for Streaming processing☆120Jan 13, 2026Updated 2 months ago
- Yuvi is an in-memory storage engine for recent time series metrics data.☆48Dec 12, 2017Updated 8 years ago
- Acadnme is the basic Cppia host for Nme applications☆16May 24, 2016Updated 9 years ago
- ☆17Feb 11, 2025Updated last year
- Code for blog posts on string search algorithms.☆17Mar 4, 2020Updated 6 years ago
- Apache Nemo (Incubating) - Data Processing System for Flexible Employment With Different Deployment Characteristics☆113Jul 1, 2025Updated 8 months ago
- VectorDB is a free analytics DBMS for IoT & Big Data, compatible with ClickHouse.☆68Oct 16, 2021Updated 4 years ago
- Repeat command extended to visual mode.☆21Nov 29, 2013Updated 12 years ago
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,487Updated this week
- [haxelib] Helps generate externs for native libraries☆43Oct 2, 2015Updated 10 years ago
- A testing framework for Trino☆27Mar 19, 2025Updated last year
- github.com/cznic/virtual has moved to modernc.org/virtual☆20Nov 22, 2018Updated 7 years ago
- This project contains a couple of tools to analyze data around the Apache Flink community.☆18May 22, 2024Updated last year
- A collection of utilities for working with Druid queries☆23Mar 9, 2026Updated last week
- C++ library for CUDA accelerated computation of Non-negative Matrix Factorizations (NMF)☆12Mar 22, 2017Updated 8 years ago
- The Automated File Loader to BigQuery solution demonstrates the use of Object Change Notification service on Google Cloud Storage. It sho…☆22Jan 9, 2018Updated 8 years ago
- Demo server/client for CORS cookies, preflights and redirects.☆16May 4, 2018Updated 7 years ago
- InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.☆55May 13, 2024Updated last year
- ☆14Apr 5, 2016Updated 9 years ago
- ☆16Mar 3, 2021Updated 5 years ago
- Apache DataFusion Comet Spark Accelerator☆1,153Mar 13, 2026Updated last week
- NetEase Spark Courses☆15Sep 4, 2018Updated 7 years ago
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆305Oct 30, 2025Updated 4 months ago
- Generate GitHub Actions matrix on the fly based on your constraints☆12Apr 1, 2024Updated last year
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆446Mar 5, 2026Updated 2 weeks ago
- ☆34Mar 5, 2026Updated 2 weeks ago