Locality Sensitive Hashing
☆80May 29, 2026Updated last week
Alternatives and similar repositories for gaoya
Users that are interested in gaoya are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library for squeakily cleaning and filtering language datasets.☆50Jul 10, 2023Updated 2 years ago
- All-in-one text de-duplication☆760Mar 9, 2026Updated 3 months ago
- Rust coder/decoder for Nucleotide Archival Format (NAF) files.☆10Jan 31, 2025Updated last year
- Simple and fast MinHash implementation in C with Python wrapper☆13Jul 24, 2025Updated 10 months ago
- The pipeline for the OSCAR corpus☆177Nov 9, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- HyperLogLog implementations.☆30Aug 11, 2024Updated last year
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆242Updated this week
- spotify/annoy bindings for Rust.☆19May 2, 2023Updated 3 years ago
- HyperTwoBits implementation☆17Aug 29, 2025Updated 9 months ago
- Repository for analysis and experiments in the BigCode project.☆126Mar 20, 2024Updated 2 years ago
- Rust implementation of probminhash, superminhash and hyperloglog sketching algorithms☆31Jan 22, 2026Updated 4 months ago
- ☆20Nov 23, 2022Updated 3 years ago
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 2 years ago
- Evaluation of Sentence Representations in Polish☆23Dec 29, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Learn Rust on AWS or Learn AWS with Rust. Do whatever you would like.☆28Sep 2, 2021Updated 4 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- Web archiving utility library☆11May 5, 2026Updated last month
- ☆23Dec 4, 2023Updated 2 years ago
- Toolkit for manipulating FASTA and SAM files☆23Mar 14, 2024Updated 2 years ago
- Bioinformatics 101 tool for counting unique k-length substrings in DNA☆33Feb 17, 2026Updated 3 months ago
- Viral genome coverage evaluation for metagenomic diagnostics☆27Aug 19, 2025Updated 9 months ago
- ☆15Oct 24, 2023Updated 2 years ago
- A natural-language snippet manager for `vim`☆11Sep 7, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [Inactive] French translation of the Rust Programming Language Book☆15Aug 12, 2019Updated 6 years ago
- An ultrafast and memory efficient tool for phylogenomics☆26Apr 17, 2026Updated last month
- Convert vcf in parquet☆32Jan 23, 2025Updated last year
- Spark Streaming jobs.☆11Mar 10, 2015Updated 11 years ago
- ☆1,273Jul 30, 2024Updated last year
- Concurrency algorithms☆13Apr 7, 2025Updated last year
- Succinct data structures using very efficient rank and select☆133Apr 17, 2026Updated last month
- Example of using next.js, nextauth.js and typescript for both anonymous sessions and authenticated sessions☆10Feb 6, 2024Updated 2 years ago
- Expand / Unshorten an exhaustive list of Shortened URL's☆21Feb 1, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of ip-nsw from Non-metric Similarity Graphs for Maximum Inner Product Search☆41Sep 17, 2018Updated 7 years ago
- ☆14Mar 26, 2026Updated 2 months ago
- Distributed preprocessing and data loading for language datasets☆40Apr 10, 2024Updated 2 years ago
- A Demo server serving Bert through ONNX with GPU written in Rust with <3☆42Jul 30, 2021Updated 4 years ago
- Task-based Parallelism in Rust☆17Nov 17, 2021Updated 4 years ago
- Color theme inspired by the Spacegray theme in Sublime Text☆12Jun 1, 2025Updated last year
- The (B)ig (F)unction (T)axonomy is a detailed reference for common compute functions executed by different libraries, databases, and tool…☆18Dec 12, 2024Updated last year