Locality Sensitive Hashing
☆80May 29, 2026Updated last month
Alternatives and similar repositories for gaoya
Users that are interested in gaoya are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library for squeakily cleaning and filtering language datasets.☆50Jul 10, 2023Updated 2 years ago
- All-in-one text de-duplication☆764Mar 9, 2026Updated 3 months ago
- ☆77Mar 5, 2025Updated last year
- Simple and fast MinHash implementation in C with Python wrapper☆13Jul 24, 2025Updated 11 months ago
- The pipeline for the OSCAR corpus☆178Nov 9, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- HyperLogLog implementations.☆30Aug 11, 2024Updated last year
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆244Jun 17, 2026Updated 2 weeks ago
- spotify/annoy bindings for Rust.☆19May 2, 2023Updated 3 years ago
- HyperTwoBits implementation☆17Aug 29, 2025Updated 10 months ago
- Repository for analysis and experiments in the BigCode project.☆126Mar 20, 2024Updated 2 years ago
- Generate kmers/minimizers/hashes/MinHash signatures, including with multiple kmer sizes.☆24Jan 9, 2021Updated 5 years ago
- This provides tools for b-bit MinHash algorism.☆39Nov 21, 2025Updated 7 months ago
- ☆20Nov 23, 2022Updated 3 years ago
- GitOps automation for plain old docker compose stack deploy☆20Dec 13, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Web archiving utility library☆11Jun 19, 2026Updated 2 weeks ago
- Toolkit for manipulating FASTA and SAM files☆23Mar 14, 2024Updated 2 years ago
- Elasticsearch plugin for b-bit minhash algorism☆65Jun 17, 2024Updated 2 years ago
- A HyperLogLog implementation in Rust.☆52Feb 9, 2026Updated 4 months ago
- Bioinformatics 101 tool for counting unique k-length substrings in DNA☆33Feb 17, 2026Updated 4 months ago
- Rust wrapper for Microsoft's ONNX Runtime with CUDA support (version 1.7)☆24Jul 3, 2022Updated 4 years ago
- Viral genome coverage evaluation for metagenomic diagnostics☆27Aug 19, 2025Updated 10 months ago
- ☆15Oct 24, 2023Updated 2 years ago
- MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW☆2,939Jun 23, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A natural-language snippet manager for `vim`☆11Sep 7, 2020Updated 5 years ago
- [Inactive] French translation of the Rust Programming Language Book☆15Aug 12, 2019Updated 6 years ago
- An ultrafast and memory efficient tool for phylogenomics☆26Apr 17, 2026Updated 2 months ago
- data related codebase for polyglot project☆19Mar 30, 2023Updated 3 years ago
- Convert vcf in parquet☆32Jan 23, 2025Updated last year
- Spark Streaming jobs.☆11Mar 10, 2015Updated 11 years ago
- ☆1,270Jul 30, 2024Updated last year
- Concurrency algorithms☆13Apr 7, 2025Updated last year
- Succinct data structures using very efficient rank and select☆133Jun 14, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Example of using next.js, nextauth.js and typescript for both anonymous sessions and authenticated sessions☆10Feb 6, 2024Updated 2 years ago
- Implementation of ip-nsw from Non-metric Similarity Graphs for Maximum Inner Product Search☆41Sep 17, 2018Updated 7 years ago
- Weighted MinHash implementation on CUDA (multi-gpu).☆122Nov 29, 2023Updated 2 years ago
- Task-based Parallelism in Rust☆17Nov 17, 2021Updated 4 years ago
- A Demo server serving Bert through ONNX with GPU written in Rust with <3☆42Jul 30, 2021Updated 4 years ago
- RusTTS is an unofficial Coqui TTS implementation.☆21Aug 12, 2022Updated 3 years ago
- The (B)ig (F)unction (T)axonomy is a detailed reference for common compute functions executed by different libraries, databases, and tool…☆18Dec 12, 2024Updated last year