Stream summarizer and cardinality estimator.
☆2,267Nov 28, 2019Updated 6 years ago
Alternatives and similar repositories for stream-lib
Users that are interested in stream-lib are comparing it to the libraries listed below
Sorting:
- Java library for the HyperLogLog algorithm☆319Feb 7, 2018Updated 8 years ago
- A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others☆3,837Updated this week
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆949Mar 12, 2026Updated last week
- Abstract Algebra for Scala☆2,302Nov 21, 2025Updated 3 months ago
- A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means☆2,144Feb 17, 2025Updated last year
- A collection of algorithms for mining data streams☆206Dec 16, 2023Updated 2 years ago
- A high performance caching library for Java☆17,549Updated this week
- Replicate your Key Value Store across your network, with consistency, persistance and performance.☆2,940Jan 29, 2026Updated last month
- Capturing JVM- and application-level metrics. So you know what's going on.☆7,852Mar 12, 2026Updated last week
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,688Mar 1, 2023Updated 3 years ago
- SQL-based streaming analytics platform at scale☆1,226Jun 21, 2020Updated 5 years ago
- Apache Druid: a high performance real-time analytics database.☆13,962Updated this week
- A Kubernetes toolkit for building distributed applications using cloud native principles☆2,366Jun 23, 2024Updated last year
- Library of different Bloom filters in Java with optional Redis-backing, counting and many hashing options.☆862Dec 9, 2025Updated 3 months ago
- Java Collections till the last breadcrumb of memory and performance☆1,025Feb 1, 2017Updated 9 years ago
- High Performance Inter-Thread Messaging Library☆18,260Apr 2, 2025Updated 11 months ago
- ☆436Jul 1, 2020Updated 5 years ago
- Fibers, Channels and Actors for the JVM☆4,566Jan 21, 2024Updated 2 years ago
- ☆3,806Mar 11, 2026Updated last week
- Apache Pinot - A realtime distributed OLAP datastore☆6,047Updated this week
- MapDB provides concurrent Maps, Sets and Queues backed by disk storage or off-heap-memory. It is a fast and easy to use embedded Java dat…☆5,039Jun 4, 2024Updated last year
- High Performance data structures and utility methods for Java☆3,170Mar 12, 2026Updated last week
- A high performance replicated log service. (The development is moved to Apache Incubator)☆2,207Feb 25, 2020Updated 6 years ago
- High Performance Primitive Collections for Java☆1,038Updated this week
- A low-level integer compression library in Java☆565Dec 22, 2025Updated 2 months ago
- Apache Flink☆25,875Updated this week
- Zero-allocation hashing for Java☆840Jan 29, 2026Updated last month
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,260Updated this week
- The official home of the Presto distributed SQL query engine for big data☆16,668Updated this week
- Java library for efficiently working with flat heap memory☆516Updated this week
- Notes talking about the design and implementation of Apache Spark☆5,364Apr 2, 2024Updated last year
- A compressed alternative to the Java BitSet class☆566Oct 21, 2025Updated 4 months ago
- Mirror of Apache Eagle☆410Aug 22, 2020Updated 5 years ago
- Enterprise Stream Process Engine☆3,885Jun 16, 2023Updated 2 years ago
- Alluxio, data orchestration for analytics and machine learning in the cloud☆7,167Apr 29, 2025Updated 10 months ago
- Micro second messaging that stores everything to disk☆3,701Mar 10, 2026Updated last week
- 酷玩 Spark: Spark 源代码解析、Spark 类库等☆3,483May 18, 2022Updated 3 years ago
- CMAK is a tool for managing Apache Kafka clusters☆11,944Aug 2, 2023Updated 2 years ago
- configuration library for JVM languages using HOCON files☆6,292Mar 2, 2026Updated 2 weeks ago