mayconbordin / streaminer
A collection of algorithms for mining data streams
☆203Updated last year
Alternatives and similar repositories for streaminer:
Users that are interested in streaminer are comparing it to the libraries listed below
- Drizzle integration with Apache Spark☆120Updated 6 years ago
- Mirror of Apache Samoa (Incubating)☆248Updated last year
- Enabling queries on compressed data.☆278Updated last year
- An experimental Graph Streaming API for Apache Flink☆141Updated 4 years ago
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆660Updated 11 years ago
- An extension of Yahoo's Benchmarks☆107Updated last year
- An efficient updatable key-value store for Apache Spark☆251Updated 8 years ago
- Persistent Adaptive Radix Trees in Java☆80Updated 4 years ago
- Mirror of Apache Crunch (Incubating)☆104Updated 4 years ago
- Streaming estimation of percentiles, especially high percentiles.☆63Updated 12 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆426Updated 8 years ago
- High performance native memory access for Java.☆125Updated last month
- Self regulation and auto-tuning for distributed system☆65Updated last year
- Java library for the HyperLogLog algorithm☆314Updated 7 years ago
- Fast JVM collection☆59Updated 10 years ago
- This repository provides Scotty, a framework for efficient window aggregations for out-of-order Stream Processing.☆76Updated last year
- A streaming / online query processing / analytics engine based on Apache Storm☆271Updated 7 years ago
- DDSketch: A Fast and Fully-Mergeable Quantile Sketch with Relative-Error Guarantees.☆119Updated 9 months ago
- Distributed, streaming anomaly detection and prediction with HTM in Apache Flink☆135Updated 7 years ago
- Website for DataSketches.☆98Updated this week
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆31Updated 6 years ago
- HBase as a TinkerPop Graph Database☆256Updated last week
- HyperLogLog (original and hyperloglog++) algorithm implementation in java.☆82Updated 4 years ago
- Huge Collections for Java using efficient off heap storage☆274Updated 10 years ago
- Large off-heap arrays and mmap files for Scala and Java☆402Updated 2 years ago
- Probabilistic data structures for Guava.☆54Updated 4 years ago
- Druid indexing plugin for using Spark in batch jobs☆101Updated 3 years ago
- Spark RDD with Lucene's query and entity linkage capabilities☆125Updated this week
- Joins for skewed datasets in Spark☆57Updated 7 years ago
- Bitmap compression using the CONCISE algorithm☆43Updated 8 years ago