apache / datasketches-javaLinks
A software library of stochastic streaming algorithms, a.k.a. sketches.
☆927Updated this week
Alternatives and similar repositories for datasketches-java
Users that are interested in datasketches-java are comparing it to the libraries listed below
Sorting:
- Stream summarizer and cardinality estimator.☆2,265Updated 5 years ago
- A streaming / online query processing / analytics engine based on Apache Storm☆273Updated 8 years ago
- Mirror of Apache Samza☆831Updated 5 months ago
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...☆639Updated last year
- An embeddable write-once key-value store written in Java☆938Updated 5 years ago
- Airlift framework for building REST services☆621Updated this week
- Java library for efficiently working with heap and off-heap memory☆512Updated 3 months ago
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆660Updated 11 years ago
- Java library for the HyperLogLog algorithm☆316Updated 7 years ago
- Mirror of Apache Helix☆487Updated 3 weeks ago
- This code base is retained for historical interest only, please visit Apache Incubator Repo for latest one☆561Updated 3 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,036Updated 2 years ago
- Mirror of Apache Apex core☆350Updated 4 years ago
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆746Updated this week
- Apache Tez☆503Updated last week
- Apache HAWQ☆694Updated last year
- A compressed alternative to the Java BitSet class☆563Updated 2 years ago
- Mirror of Apache Giraph☆618Updated 2 years ago
- JVM readings☆492Updated 4 years ago
- Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…☆515Updated 5 years ago
- Lightweight real-time big data streaming engine over Akka☆759Updated 3 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,246Updated 2 weeks ago
- An open-source columnar data format designed for fast & realtime analytic with big data.☆453Updated 2 years ago
- Transactional Support for HBase (Mirror of https://github.com/apache/incubator-omid)☆300Updated 8 years ago
- Real-time Query for Hadoop; mirror of Apache Impala☆34Updated 2 years ago
- A fully asynchronous, non-blocking, thread-safe, high-performance HBase client.☆610Updated 2 years ago
- Apache Drill is a distributed MPP query layer for self describing data☆1,991Updated 3 weeks ago
- LinkedIn's previous generation Kafka to HDFS pipeline.☆883Updated 5 years ago
- Apache Phoenix☆1,047Updated last week
- Mirror of Apache Samoa (Incubating)☆251Updated 2 years ago