jparkie / PDD
Advanced Bloom Filter Based Algorithms for Efficient Approximate Data De-Duplication in Streams
☆242Updated 8 years ago
Alternatives and similar repositories for PDD:
Users that are interested in PDD are comparing it to the libraries listed below
- Doradus is a REST service that extends a Cassandra NoSQL database with a graph-based data model, advanced indexing and search features, a…☆204Updated 9 years ago
- Cross language bloom filter implementation☆297Updated 2 years ago
- Interactive visualization framework for Runway models of distributed systems☆187Updated 3 years ago
- An event bus framework for event driven programming☆71Updated 2 years ago
- Data structure server.☆182Updated 8 years ago
- Distributed Named Pipes☆455Updated 7 years ago
- ☆172Updated 10 years ago
- BigTable, Document and Graph Database with Full Text Search☆186Updated 7 years ago
- Implementations of a data structure with false negatives but no false positives.☆356Updated last year
- File-backed append-only object store.☆117Updated 8 years ago
- invesdwin-context modules that provide persistence features☆43Updated this week
- The Chronix Server implementation that is based on Apache Solr.☆265Updated 5 years ago
- cron-like jobs for back-end systems☆76Updated 6 years ago
- A general-purpose data analysis engine radically changing the way batch and stream data is processed☆7Updated 6 years ago
- An alternative take on Java object relational mapping☆51Updated 7 months ago
- Disco Machine Learning Library☆99Updated 9 years ago
- HyperMinHash: Bringing intersections to HyperLogLog☆303Updated 7 years ago
- Serving system for batch generated data sets☆176Updated 7 years ago
- Improved Secondary Indexing with new Query Capabilities (OR, scoping) for Cassandra☆145Updated 9 years ago
- BloomFilter in python☆101Updated 7 years ago
- Speculative Paxos replication protocol☆130Updated 8 years ago
- Quickly detect already witnessed data.☆158Updated 8 months ago
- UI for interactive data analysis | https://snorkel.logv.org☆163Updated last year
- Finite state dictionaries in Java☆130Updated 3 years ago
- A thin Java object persistence layer for JDBC☆83Updated 4 years ago
- TAPIR distributed transactional storage system☆422Updated 4 years ago
- A java library for stored queries☆375Updated 2 years ago
- Simulating the performance of various streaming algorithms. #experimentalmathematics☆59Updated 7 years ago
- Consus is a geo-replicated transactional key-value store.☆225Updated 6 years ago
- A probabilistic data structure service and storage☆772Updated 8 years ago