apache / datasketches-vector
Sketch Library for vector-based models
☆14Updated this week
Alternatives and similar repositories for datasketches-vector:
Users that are interested in datasketches-vector are comparing it to the libraries listed below
- ByteBuffer collection classes for java and jvm-based languages.☆33Updated 6 years ago
- Bloofi: A java implementation of multidimensional Bloom filters☆79Updated 9 years ago
- Graphulo: Accumulo library of matrix math primitives and graph algorithms☆78Updated 11 months ago
- Routines and data structures for using isarn-sketches idiomatically in Apache Spark☆29Updated 10 months ago
- Java numerics library for optimization, polynomial root finding, sorting, robust model fitting, and more.☆51Updated last week
- A framework for scalable graph computing.☆147Updated 6 years ago
- Java Matrix Benchmark is a tool for evaluating Java linear algebra libraries for speed, stability, and memory usage.☆59Updated last year
- Java library to create and search random access files (including in S3) using the space-filling hilbert index (sparse)☆48Updated last week
- Implementation of the Loopy Belief Propagation algorithm for Apache Spark☆41Updated 4 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- ☆29Updated 3 years ago
- Automatic offload of user-written Spark kernels to accelerators☆18Updated 8 years ago
- Alenka JDBC is a library for accessing and manipulating data with the open-source GPU database Alenka.☆19Updated 10 years ago
- The Musketeer workflow manager.☆41Updated 6 years ago
- This project describes the D4M 2.0 Schema used in many Accumulo systems.☆21Updated 4 years ago
- Idempotent query executor☆51Updated 3 weeks ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Updated 8 years ago
- A java library for stored queries☆16Updated last year
- Cascading on Apache Flink®☆54Updated last year
- Milan is a Scala API and runtime infrastructure for building data-oriented systems, built on top of Apache Flink.☆39Updated last year
- Dynamic Distributed Dimensional Data Model☆41Updated 11 months ago
- Sux4J is an effort to bring succinct data structures to Java.☆161Updated last year
- Peel is a framework that helps you to define, execute, analyze, and share experiments for distributed systems and algorithms.☆27Updated 2 years ago
- Apache Amaterasu☆56Updated 5 years ago
- Then comes thunder.☆22Updated 8 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 8 years ago
- Persistent Adaptive Radix Trees in Java☆80Updated 4 years ago
- A streaming key-value store implementation using native Flink Streaming operators☆23Updated 9 years ago
- HBase as a JSON Document Database☆24Updated last year
- Cluster computing using Stateful Dataflow Graphs☆26Updated 2 years ago