FastCDC implementation in Python https://pypi.org/project/fastcdc/
☆65Jun 27, 2024Updated last year
Alternatives and similar repositories for fastcdc-py
Users that are interested in fastcdc-py are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast text chunking algorithms for Python☆12Oct 7, 2020Updated 5 years ago
- FastCDC implementation in Rust☆193Apr 28, 2026Updated 3 weeks ago
- Fast multi-threaded content-dependent chunking deduplication for Buffers in C++ with a reference implementation in Javascript. Ships with…☆75Mar 1, 2020Updated 6 years ago
- Fast implementation of Content Defined Chunking (CDC) based on a rolling Rabin Checksum in C.☆55Oct 2, 2014Updated 11 years ago
- An experimental platform for chunk-level data deduplication. Key words: DDFS, Sparse Index, Extreme Binning, SiLo, Sample Index, BLC; CBR…☆168Apr 17, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of Content Defined Chunking (CDC) in Go☆350Oct 1, 2023Updated 2 years ago
- Multiple ways of chunking for data deduplication: Fixed size chunking, Content defined chunking, and File based chunking.☆19Dec 20, 2013Updated 12 years ago
- Get a list of deduped files on a ZFS filesystem☆13Oct 14, 2020Updated 5 years ago
- ACM SoCC 2019, "Coupling Decentralized Key-Value Stores with Erasure Coding"☆15May 22, 2021Updated 5 years ago
- small fastcdc implementation in c99☆18Dec 31, 2022Updated 3 years ago
- ISCC - Software Development Kit☆20May 10, 2026Updated last week
- Find near-duplicate documents using minhashing implemented in Go.☆16Dec 22, 2015Updated 10 years ago
- Python wrapper for epubcheck☆22Jun 4, 2024Updated last year
- A Go library implementing a buzhash rolling hash function☆31Aug 16, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Jun 11, 2023Updated 2 years ago
- This package implements the FastCDC content defined chunking algorithm☆31Sep 30, 2020Updated 5 years ago
- A Golang package that implements CDC chunkers with a generic interface☆123Apr 9, 2026Updated last month
- ☆25Mar 31, 2022Updated 4 years ago
- A python module for generating Rabin fingerprints☆41Jan 5, 2018Updated 8 years ago
- Deduplication for cfDNA sequencing data☆11Jul 5, 2017Updated 8 years ago
- A Python tool to search for and remove duplicated files in messy datasets☆15Dec 23, 2024Updated last year
- Rabin fingerprinting and deduplication library in C☆28Feb 16, 2016Updated 10 years ago
- A softeware for image based building modeling.☆15Nov 26, 2014Updated 11 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Original Joy☆11Dec 17, 2024Updated last year
- IR Receiver based on an FTDI chip for usage with lirc (see http://www.huitsing.nl/irftdi/)☆10Oct 17, 2014Updated 11 years ago
- String deduplication package for Go☆19Jan 10, 2024Updated 2 years ago
- 文档去重功能是为了解决搜索引擎的文档语义重复的问题,方法是多重哈希下的语义指纹算法。☆11Aug 17, 2013Updated 12 years ago
- ☆12Jan 12, 2024Updated 2 years ago
- Find duplicate text files.☆14Jan 14, 2025Updated last year
- A really simple wiki engine with bottlepy☆13Mar 10, 2012Updated 14 years ago
- Pile Deduplication Code☆18May 15, 2023Updated 3 years ago
- 🕹️ Group and deduplicate concurrent tasks☆31May 15, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- SRCNN论文复现☆11Aug 2, 2018Updated 7 years ago
- Implementation of some rolling hashes in go☆68Sep 15, 2025Updated 8 months ago
- Rabin hashing and content-defined chunking for Go☆20Sep 11, 2017Updated 8 years ago
- SC 2021, "LogECMem: Coupling Erasure-Coded In-Memory Key-Value Stores with Parity Logging"☆12Jul 12, 2021Updated 4 years ago
- GitHub Stacked PR with JJ☆19Sep 12, 2023Updated 2 years ago
- You need a few lines of JS, not a vector database.☆19Oct 1, 2024Updated last year
- Switch between git worktrees with speed.☆15Apr 14, 2026Updated last month