RapidCDC: Leveraging Duplicate Locality to Accelerate Chunking in CDC-based Deduplication Systems
☆17May 25, 2020Updated 5 years ago
Alternatives and similar repositories for RapidCDC
Users that are interested in RapidCDC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code provides an example C language based implementation of the idea proposed in the paper "SS-CDC: a two-stage parallel content-defi…☆11Sep 10, 2019Updated 6 years ago
- DedupBench is a benchmarking tool for content-defined chunking techniques used in data deduplication. It currently supports eleven uniqu…☆24Feb 20, 2026Updated 3 months ago
- An implementation of FastCDC in C☆35Jun 27, 2022Updated 3 years ago
- FastCDC implementation in Python https://pypi.org/project/fastcdc/☆65Jun 27, 2024Updated last year
- ☆16May 4, 2021Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆17Nov 27, 2018Updated 7 years ago
- An experimental platform for chunk-level data deduplication. Key words: DDFS, Sparse Index, Extreme Binning, SiLo, Sample Index, BLC; CBR…☆168Apr 17, 2016Updated 10 years ago
- Some paper lists related to storage systems☆51Apr 20, 2026Updated last month
- CXL Management Interface library☆25Apr 30, 2026Updated 3 weeks ago
- Find near-duplicate documents using minhashing implemented in Go.☆16Dec 22, 2015Updated 10 years ago
- ☆25Mar 31, 2022Updated 4 years ago
- ☆24May 6, 2022Updated 4 years ago
- Deduplication for cfDNA sequencing data☆11Jul 5, 2017Updated 8 years ago
- Fast and efficient content-defined chunking for data deduplication. Java implementation of FastCDC as library.☆26Sep 21, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Python tool to search for and remove duplicated files in messy datasets☆15Dec 23, 2024Updated last year
- A FUSE filesystem that lets you mount remote rsync modules☆26Jun 5, 2024Updated last year
- Fast duplicate file detection library☆26Jan 5, 2017Updated 9 years ago
- String deduplication package for Go☆19Jan 10, 2024Updated 2 years ago
- 文档去重功能是为了解决搜索引擎的文档语义重复的问题,方法是多重哈希下的语义指纹算法。☆11Aug 17, 2013Updated 12 years ago
- A Golang package that implements CDC chunkers with a generic interface☆123Apr 9, 2026Updated last month
- bktree data structure with a Python interface for a CPP implementation☆13Jan 11, 2017Updated 9 years ago
- 🕹️ Group and deduplicate concurrent tasks☆31May 15, 2026Updated last week
- Rabin hashing and content-defined chunking for Go☆20Sep 11, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Python library and dashboard for hyperparameter search and model training for computer vision tasks based on PyTorch, Optuna, FiftyOne, D…☆17Jul 14, 2023Updated 2 years ago
- Source code for the FAST '23 paper “MadFS: Per-File Virtualization for Userspace Persistent Memory Filesystems”☆18Mar 5, 2023Updated 3 years ago
- A Python FUSE file system that features transparent deduplication and compression which make it ideal for archiving backups.☆139Jul 22, 2010Updated 15 years ago
- Symbolic range analysis for LLVM.☆12Jan 10, 2016Updated 10 years ago
- template for https://cnli.me☆10Feb 27, 2025Updated last year
- Symbolic Liveness Analysis of real-world software building upon KLEE to detect liveness violations (e.g. infinite loop bugs)☆12Dec 16, 2021Updated 4 years ago
- ☆10Jan 31, 2022Updated 4 years ago
- Use clonefile to deduplicate files on APFS.☆57Apr 8, 2026Updated last month
- ☆10May 24, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ✨ Epris is a JavaScript library that simplifies interface development☆26May 30, 2022Updated 3 years ago
- ☆11Aug 5, 2020Updated 5 years ago
- 实验室安全考试题库👷🏻♀️☆13Nov 25, 2020Updated 5 years ago
- Multithreaded 7-zip compatible file archiver☆33Mar 17, 2019Updated 7 years ago
- ICS2018 for HUST☆17Dec 28, 2018Updated 7 years ago
- ☆10Feb 20, 2021Updated 5 years ago
- ☆13Jun 12, 2024Updated last year