A software library of stochastic streaming algorithms, a.k.a. sketches.
☆947Feb 18, 2026Updated last week
Alternatives and similar repositories for datasketches-java
Users that are interested in datasketches-java are comparing it to the libraries listed below
Sorting:
- High performance native memory access for Java.☆128Feb 20, 2026Updated last week
- Core C++ Sketch Library☆252Feb 15, 2026Updated last week
- Sketch adaptors for Hive.☆50Feb 24, 2025Updated last year
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆105Jan 20, 2026Updated last month
- Website for DataSketches.☆108Feb 20, 2026Updated last week
- Apache Pinot - A realtime distributed OLAP datastore☆6,032Updated this week
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,700Mar 1, 2023Updated 2 years ago
- Stream summarizer and cardinality estimator.☆2,266Nov 28, 2019Updated 6 years ago
- A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others☆3,821Feb 20, 2026Updated last week
- Anthelion is a plugin for Apache Nutch to crawl semantic annotations within HTML pages.☆2,841Dec 17, 2015Updated 10 years ago
- Distributed Prometheus time series database☆1,462Updated this week
- Apache Calcite☆5,077Updated this week
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,260Feb 19, 2026Updated last week
- Pravega - Streaming as a new software defined storage primitive☆2,005Mar 2, 2025Updated 11 months ago
- An embeddable write-once key-value store written in Java☆941Dec 2, 2019Updated 6 years ago
- A Scalable Concurrent Key-Value Map for Big Data Analytics☆275Jan 18, 2024Updated 2 years ago
- Apache Druid: a high performance real-time analytics database.☆13,942Updated this week
- High performance data store solution☆1,446Feb 21, 2026Updated last week
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,036Nov 21, 2022Updated 3 years ago
- Distributed object store☆1,781Feb 19, 2026Updated last week
- Open source Java implementation for Raft consensus protocol.☆1,443Updated this week
- Apache Geode☆2,357Jan 22, 2026Updated last month
- Sketch Library for vector-based models☆15Mar 30, 2025Updated 10 months ago
- A Kubernetes toolkit for building distributed applications using cloud native principles☆2,366Jun 23, 2024Updated last year
- fastutil extends the Java™ Collections Framework by providing type-specific maps, sets, lists and queues.☆2,110Dec 2, 2025Updated 2 months ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆281Aug 3, 2018Updated 7 years ago
- Sql interface to druid.☆77Dec 14, 2015Updated 10 years ago
- Apache HAWQ☆697May 16, 2024Updated last year
- High Performance data structures and utility methods for Java☆3,160Updated this week
- Apache Ignite☆5,044Updated this week
- A low-level integer compression library in Java☆564Dec 22, 2025Updated 2 months ago
- Mirror of Apache Helix☆493Feb 19, 2026Updated last week
- A high performance caching library for Java☆17,498Updated this week
- Java library for efficiently working with flat heap memory☆516Feb 3, 2026Updated 3 weeks ago
- A large-scale entity and relation database supporting aggregation of properties☆1,791Jun 6, 2025Updated 8 months ago
- Janino is a super-small, super-fast Java™ compiler.☆1,320May 31, 2024Updated last year
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆888Feb 9, 2026Updated 2 weeks ago
- An open source ML system for the end-to-end data science lifecycle☆1,079Feb 20, 2026Updated last week
- Replicate your Key Value Store across your network, with consistency, persistance and performance.☆2,939Jan 29, 2026Updated 3 weeks ago