Find near-duplicate documents using minhashing implemented in Go.
☆16Dec 22, 2015Updated 10 years ago
Alternatives and similar repositories for deduper
Users that are interested in deduper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a tiny iothub, simple and useful. 轻量的 iothub,用经典的物模型简化 iot 应用开发 device-shadow, iot, iothub, job, lightweight, mqtt, mysql, shadow, sqlit…☆33May 16, 2025Updated last year
- Text classifier for Go, aka document categorization.☆41Nov 27, 2015Updated 10 years ago
- find/file similar☆14Sep 15, 2017Updated 8 years ago
- Enhanced Markdown template processor☆15Dec 23, 2021Updated 4 years ago
- A Python tool to search for and remove duplicated files in messy datasets☆15Dec 23, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Easy handling of memory-mapped files☆22Mar 28, 2014Updated 12 years ago
- String deduplication package for Go☆19Jan 10, 2024Updated 2 years ago
- ☆26Nov 9, 2016Updated 9 years ago
- Suggester - the heart for full-text auto-complete web services☆29Jul 8, 2014Updated 11 years ago
- A terminal recorder that produces files capable of efficient random access☆14Nov 17, 2019Updated 6 years ago
- 🕹️ Group and deduplicate concurrent tasks☆31May 15, 2026Updated last week
- The central ASPIRE framework repository, start here if you want to use our tools (this contains all tools and documentation)☆14Apr 17, 2021Updated 5 years ago
- Fuzzy text searching like Sublime Text☆26Sep 10, 2015Updated 10 years ago
- A high performance lock free map type for go.☆20Apr 19, 2018Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 3d object viewer with live reload☆20Aug 18, 2025Updated 9 months ago
- Python utilities for detecting textual reuse☆21Nov 1, 2015Updated 10 years ago
- Content Defined Chunking playground☆52Mar 26, 2026Updated 2 months ago
- Utilities for extracting and compressing tgz and zip files.☆28May 19, 2026Updated last week
- A Go library implementing a buzhash rolling hash function☆31Aug 16, 2016Updated 9 years ago
- gzip indexer for random access into compressed files☆30Jan 4, 2018Updated 8 years ago
- A Brainfuck interpreter written in Rust and compiled to WebAssembly☆10Dec 4, 2017Updated 8 years ago
- Blockhash perceptual-hash algorithm for images. Written in pure Go.☆22Aug 4, 2020Updated 5 years ago
- meta-layer for EBAZ4205☆18Jun 7, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Go package and CLI to work with VHD images☆14Jul 6, 2020Updated 5 years ago
- Wrappers of OS-specific route table manipulation commands.☆13Feb 28, 2018Updated 8 years ago
- Self-organizing maps in Go☆74May 28, 2022Updated 3 years ago
- ☆15Mar 28, 2020Updated 6 years ago
- FFI bindings to libudev☆10Feb 28, 2024Updated 2 years ago
- This project contains simple methods to measure sample relatedness and identify potential swaps and contamination☆10Jan 8, 2016Updated 10 years ago
- Package valuegraph produces a graph representation of any Go value.☆32Feb 15, 2018Updated 8 years ago
- WebDAV client for Rust☆10Jun 6, 2018Updated 7 years ago
- A transcription to KiCAD of Ray Wilson's synthesizer modules☆13Mar 13, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Exposes Perforce server data as a FUSE filesystem☆22Feb 10, 2016Updated 10 years ago
- Modular breadboard holders & Instrumentation panel☆18Dec 15, 2024Updated last year
- My collection of Bazel BUILD examples☆20Aug 4, 2022Updated 3 years ago
- Go-based VISA resource manager.☆15Apr 16, 2026Updated last month
- ☆25Feb 9, 2016Updated 10 years ago
- Splice VPN access into your default network space☆16Jul 31, 2018Updated 7 years ago
- Library to extract text from HTML files☆11Dec 20, 2015Updated 10 years ago