jparkie / PDDLinks
Advanced Bloom Filter Based Algorithms for Efficient Approximate Data De-Duplication in Streams
☆246Updated 8 years ago
Alternatives and similar repositories for PDD
Users that are interested in PDD are comparing it to the libraries listed below
Sorting:
- Doradus is a REST service that extends a Cassandra NoSQL database with a graph-based data model, advanced indexing and search features, a…☆204Updated 10 years ago
- An event bus framework for event driven programming☆72Updated 3 years ago
- Cross language bloom filter implementation☆301Updated 3 years ago
- BigTable, Document and Graph Database with Full Text Search☆186Updated 7 years ago
- An alternative take on Java object relational mapping☆51Updated last year
- Data structure server.☆181Updated 9 years ago
- Interactive visualization framework for Runway models of distributed systems☆186Updated 3 years ago
- Disco Machine Learning Library☆99Updated 10 years ago
- ☆171Updated 11 years ago
- Implementations of a data structure with false negatives but no false positives.☆364Updated 2 years ago
- invesdwin-context modules that provide persistence features☆44Updated 2 weeks ago
- The Chronix Server implementation that is based on Apache Solr.☆266Updated 6 years ago
- Improved Secondary Indexing with new Query Capabilities (OR, scoping) for Cassandra☆145Updated 10 years ago
- Distributed Named Pipes☆455Updated 8 years ago
- BloomFilter in python☆101Updated 8 years ago
- The DB that's replicated, sharded and transactional.☆175Updated 10 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 9 years ago
- Keyvi - a key value index that powers Cliqz search engine. It is an in-memory FST-based data structure highly optimized for size and look…☆178Updated 7 years ago
- Serving system for batch generated data sets☆178Updated 8 years ago
- Finite state dictionaries in Java☆132Updated 4 years ago
- A library to implement asynchronous dependency graphs for services in Java☆261Updated 2 years ago
- HyperMinHash: Bringing intersections to HyperLogLog☆308Updated 7 years ago
- TAPIR distributed transactional storage system☆424Updated 5 years ago
- UI for interactive data analysis | https://snorkel.logv.org☆167Updated last year
- At Twitter I often asked a simple question, render a tweet given the text and an unordered list of its entities☆42Updated 4 years ago
- Consus is a geo-replicated transactional key-value store.☆227Updated 7 years ago
- Quickly detect already witnessed data.☆156Updated last year
- Distributed decision tree ensemble learning in Scala☆390Updated 7 years ago
- Kerf (Kerf1) is a columnar tick database and time-series language for Linux/OSX/BSD/iOS/Android. It is written in C and natively speaks J…☆548Updated last year
- Probabilistic data structures server. The data model is key-value, where values are: Bloomfilters, LinearCounters, HyperLogLogs, CountMin…☆24Updated 10 years ago