mayconbordin / streaminer
A collection of algorithms for mining data streams
☆203Updated last year
Alternatives and similar repositories for streaminer:
Users that are interested in streaminer are comparing it to the libraries listed below
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆659Updated 11 years ago
- Enabling queries on compressed data.☆279Updated last year
- Streaming estimation of percentiles, especially high percentiles.☆63Updated 12 years ago
- Mirror of Apache Samoa (Incubating)☆248Updated 2 years ago
- Persistent Adaptive Radix Trees in Java☆81Updated 4 years ago
- An efficient updatable key-value store for Apache Spark☆251Updated 8 years ago
- Distributed, streaming anomaly detection and prediction with HTM in Apache Flink☆135Updated 7 years ago
- An experimental Graph Streaming API for Apache Flink☆142Updated 4 years ago
- Drizzle integration with Apache Spark☆120Updated 6 years ago
- Java library for the HyperLogLog algorithm☆315Updated 7 years ago
- DDSketch: A Fast and Fully-Mergeable Quantile Sketch with Relative-Error Guarantees.☆120Updated this week
- Fast JVM collection☆59Updated 10 years ago
- A streaming / online query processing / analytics engine based on Apache Storm☆271Updated 7 years ago
- HyperLogLog (original and hyperloglog++) algorithm implementation in java.☆81Updated 4 years ago
- Bloofi: A java implementation of multidimensional Bloom filters☆79Updated 9 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 8 years ago
- Self regulation and auto-tuning for distributed system☆65Updated last year
- Large off-heap arrays and mmap files for Scala and Java☆402Updated 2 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Updated last year
- A streaming key-value store implementation using native Flink Streaming operators☆23Updated 9 years ago
- An extension of Yahoo's Benchmarks☆107Updated last year
- Low latency, strong consistency, fault tolerant distributed key value store. Colocate data and compute to achieve best performance cloud …☆114Updated 9 years ago
- Joins for skewed datasets in Spark☆57Updated 7 years ago
- A Scalable Concurrent Key-Value Map for Big Data Analytics☆270Updated last year
- High performance native memory access for Java.☆125Updated this week
- CKite - A JVM implementation of the Raft distributed consensus algorithm written in Scala☆213Updated 6 years ago
- A simple integer compression library in Java☆547Updated 10 months ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆31Updated 6 years ago
- This repository provides Scotty, a framework for efficient window aggregations for out-of-order Stream Processing.☆77Updated last year
- Tools to work with off-heap memory using sun.misc.Unsafe☆136Updated 8 years ago