A software library of stochastic streaming algorithms, a.k.a. sketches.
☆954Jun 4, 2026Updated this week
Alternatives and similar repositories for datasketches-java
Users that are interested in datasketches-java are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- High performance native memory access for Java.☆131Updated this week
- Core C++ Sketch Library☆257May 20, 2026Updated 2 weeks ago
- Sketch adaptors for Hive.☆51May 15, 2026Updated 3 weeks ago
- Website for DataSketches.☆109May 18, 2026Updated 3 weeks ago
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆115May 15, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Sketch adaptors for Pig.☆10May 15, 2026Updated 3 weeks ago
- Stream summarizer and cardinality estimator.☆2,265Nov 28, 2019Updated 6 years ago
- Sketch Library for vector-based models☆15May 15, 2026Updated 3 weeks ago
- Apache Pinot - A realtime distributed OLAP datastore☆6,097Updated this week
- A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others☆3,869May 13, 2026Updated 3 weeks ago
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,632Mar 1, 2023Updated 3 years ago
- A Scalable Concurrent Key-Value Map for Big Data Analytics☆276Jan 18, 2024Updated 2 years ago
- Apache Druid: a high performance real-time analytics database.☆14,017Updated this week
- Anthelion is a plugin for Apache Nutch to crawl semantic annotations within HTML pages.☆2,832Dec 17, 2015Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Apache Calcite☆5,134Updated this week
- Distributed Prometheus time series database☆1,463Updated this week
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,265Jun 1, 2026Updated last week
- High performance data store solution☆1,446May 15, 2026Updated 3 weeks ago
- Pravega - Streaming as a new software defined storage primitive☆2,002Mar 2, 2025Updated last year
- PostgreSQL extension providing approximate algorithms based on apache/datasketches-cpp☆92May 15, 2026Updated 3 weeks ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,033Nov 21, 2022Updated 3 years ago
- An embeddable write-once key-value store written in Java☆938Dec 2, 2019Updated 6 years ago
- Sql interface to druid.☆78Dec 14, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Distributed object store☆1,786Updated this week
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆281Aug 3, 2018Updated 7 years ago
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆661Feb 6, 2014Updated 12 years ago
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,568Jun 1, 2026Updated last week
- Open source Java implementation for Raft consensus protocol.☆1,452Updated this week
- Java library for efficiently working with flat heap memory☆516Apr 3, 2026Updated 2 months ago
- A Kubernetes toolkit for building distributed applications using cloud native principles☆2,363Jun 23, 2024Updated last year
- fastutil extends the Java™ Collections Framework by providing type-specific maps, sets, lists and queues.☆2,185Dec 2, 2025Updated 6 months ago
- The Apache Storm implementation of the Bullet backend☆41Apr 17, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A low-level integer compression library in Java☆567Mar 15, 2026Updated 2 months ago
- Janino is a super-small, super-fast Java™ compiler.☆1,324May 31, 2024Updated 2 years ago
- Apache Flink Stateful Functions☆536May 15, 2026Updated 3 weeks ago
- A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means☆2,152Feb 17, 2025Updated last year
- The official home of the Presto distributed SQL query engine for big data☆16,711Updated this week
- Apache Geode☆2,368May 30, 2026Updated last week
- Java library for the HyperLogLog algorithm☆318Feb 7, 2018Updated 8 years ago