ronomon / deduplication
Fast multi-threaded content-dependent chunking deduplication for Buffers in C++ with a reference implementation in Javascript. Ships with extensive tests, a fuzz test and a benchmark.
☆74Updated 5 years ago
Alternatives and similar repositories for deduplication:
Users that are interested in deduplication are comparing it to the libraries listed below
- A SQLite extension for extracting values from serialized Protobuf messages☆87Updated last month
- fixed-length integer trim☆33Updated 2 years ago
- A compact implementation of Dr. Askitis HatTrie☆80Updated 10 years ago
- Fast implementation of Content Defined Chunking (CDC) based on a rolling Rabin Checksum in C.☆53Updated 10 years ago
- Append-only key-value database on a distributed shared-log☆49Updated 7 months ago
- FastCDC implementation in Rust☆149Updated last year
- Direct IO helpers for block devices and regular files on FreeBSD, Linux, macOS and Windows.☆72Updated 2 years ago
- EliasFanoCompression: quasi-succinct compression of sorted integers in C#☆45Updated 3 years ago
- Lock-free slab allocator / freelist.☆65Updated 9 years ago
- Batch Monitor - Gain performance by combining work from multiple threads into a single batch☆30Updated 9 years ago
- A CPace PAKE implementation using libsodium.☆35Updated 4 years ago
- Library implementing the storage and the query evaluation for a text search engine. It uses on a key value store database interface to st…☆47Updated 3 years ago
- Fast, SIMD-accelerated hash function for content-defined chunking☆23Updated 4 years ago
- mpool UAPI and CLI for HSE 1.x☆36Updated 3 years ago
- SQLite Clustered Database☆18Updated 5 years ago
- Official Github Mirror of the LumoSQL Database Project (https://lumosql.org/src/lumosql)☆180Updated last year
- a 64-bit histogram / quantile sketch☆58Updated 2 months ago
- memcachedb ported from BerkeleyDB to LMDB originally from http://memcachedb.googlecode.com/svn/trunk☆86Updated 9 years ago
- The dream accurate approximate set cardinality estimator based on 3-bit HyperLogLog. More accurate than Redis HyperLogLog.☆55Updated 4 years ago
- WebR2sync+☆12Updated 8 years ago
- Embedded storage benchmarking tool☆138Updated 2 years ago
- Quick sort code using AVX2 instructions☆68Updated 7 years ago
- A not-yet-ready-for-use FoundationDB-backed FUSE filesystem. Seriously, don't use it.☆13Updated 11 months ago
- Automatically exported from code.google.com/p/idzip☆40Updated 10 years ago
- Compute xxHash hash codes for 8 keys in parallel☆46Updated 5 years ago
- HTTP benchmark utility☆21Updated last year
- Proof of concept LSM-tree built on MDB☆17Updated 9 years ago
- FemtoZip is a "shared dictionary" compression library optimized for small documents that may not compress well with traditional tools suc…☆145Updated 3 years ago
- Tracking, Benchmarking and Sharing Information about an open source embedded data storage engines, internals, architectures, data storage…☆181Updated 6 years ago
- SQLite extension for generating UUIDs☆47Updated 6 years ago