A collection of libraries for single-pass, distributed, sublinear-space approximate aggregation and sketching algorithms. Currently: HyperLogLog++; more to come.
☆164May 21, 2025Updated 9 months ago
Alternatives and similar repositories for zetasketch
Users that are interested in zetasketch are comparing it to the libraries listed below
Sorting:
- GoogleSQL(formerly ZetaSQL) - Analyzer Framework for SQL☆2,617Jan 31, 2026Updated last month
- ScalikeJDBC extension for Google BigQuery☆18Mar 15, 2020Updated 6 years ago
- Parallel boolean circuit evaluation☆26Oct 28, 2018Updated 7 years ago
- Trino Community Connector for Google Data Studio☆11Jan 5, 2022Updated 4 years ago
- A testing framework for Trino☆27Mar 19, 2025Updated last year
- ☆10Jun 16, 2022Updated 3 years ago
- Java library for the HyperLogLog algorithm☆319Feb 7, 2018Updated 8 years ago
- Production-ready Java implementation of the Xor Filter.☆18Jan 23, 2020Updated 6 years ago
- Java/Scala library for easily authoring Flyte tasks and workflows☆44Jan 13, 2026Updated 2 months ago
- OpenCensus Go exporters for AWS (XRay only for now)☆27May 2, 2023Updated 2 years ago
- DDSketch: A Fast and Fully-Mergeable Quantile Sketch with Relative-Error Guarantees.☆128Mar 6, 2026Updated 2 weeks ago
- Ephemeral Hadoop clusters using Google Compute Platform☆135Mar 31, 2022Updated 3 years ago
- ☆20Aug 18, 2020Updated 5 years ago
- Cloud Spanner Connector for Apache Spark☆17Updated this week
- Playing around with http4s + doobie + docker☆11Feb 17, 2018Updated 8 years ago
- Traversable Python Dictionaries☆59Dec 26, 2022Updated 3 years ago
- Be Truly Awesome☆12Oct 18, 2015Updated 10 years ago
- Moments Sketch Code☆40Oct 31, 2018Updated 7 years ago
- Visualizing BigQuery query jobs with Cloud Functions, Firebase and Pub/Sub☆25Jan 9, 2023Updated 3 years ago
- This service is meant to simplify running Google Cloud operations, especially BigQuery tasks. This means you do not have to worry about …☆46Apr 9, 2019Updated 6 years ago
- Keep UI simplest☆14Updated this week
- Java client for txtai☆40Feb 25, 2026Updated 3 weeks ago
- Bloofi: A java implementation of multidimensional Bloom filters☆85Jul 1, 2025Updated 8 months ago
- Source code analyzer that helps you to maintain variable/field naming conventions inside your project.☆40May 30, 2019Updated 6 years ago
- A Hivemall wrapper for Spark☆31Apr 21, 2016Updated 9 years ago
- The (B)ig (F)unction (T)axonomy is a detailed reference for common compute functions executed by different libraries, databases, and tool…☆19Dec 12, 2024Updated last year
- Using server-sent events to send binary data.☆13Dec 10, 2022Updated 3 years ago
- simple rules engine☆93Apr 16, 2020Updated 5 years ago
- APM, A high performance, scalable monitoring tool.☆20Aug 18, 2022Updated 3 years ago
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆949Mar 12, 2026Updated last week
- A tool for data sampling, data generation, and data diffing☆346Jan 8, 2026Updated 2 months ago
- Serve HTTP on a tailnet☆22Oct 31, 2024Updated last year
- ☆14Jan 2, 2023Updated 3 years ago
- Hear proposals from the developer community - specially from women - about creating a test that assesses women friendly companies☆21Jul 12, 2015Updated 10 years ago
- Probabilistic data structures for Guava.☆54Oct 22, 2020Updated 5 years ago
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,515Updated this week
- Opinion Analysis of News, Threaded Conversations, and User Generated Content☆108Sep 19, 2024Updated last year
- Streaming and Incremental Computation Framework☆248Jun 10, 2023Updated 2 years ago
- Apache Beam Site☆30Updated this week