A fast python implementation of the SimHash algorithm.
☆27Oct 27, 2021Updated 4 years ago
Alternatives and similar repositories for floc-simhash
Users that are interested in floc-simhash are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Visual Hash for matching copies of visually similar images.☆16Mar 17, 2025Updated last year
- Rust implementation of probminhash, superminhash and hyperloglog sketching algorithms☆31Jan 22, 2026Updated 4 months ago
- Basis for constructing a new project on top of mu.semte.ch☆16Mar 1, 2026Updated 2 months ago
- Scrape and structure raw data from the Norwegian parliament's API.☆12Oct 24, 2025Updated 7 months ago
- Translation of query languages to serialized KoralQuery protocol☆15May 14, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A Python port of the Apache Lucene ASCII Folding Filter that converts alphabetic, numeric, and symbolic Unicode characters which are not …☆15May 3, 2020Updated 6 years ago
- Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴)☆15May 3, 2021Updated 5 years ago
- Efficient batch-detection of audio sample matching (kind of like shazam, but more involved)☆10Mar 11, 2015Updated 11 years ago
- Little side display of Jupyter kernel rich output☆12Sep 17, 2015Updated 10 years ago
- Benchmark scripts for comparing different tokenizers and sentence segmenters of German☆12Feb 27, 2023Updated 3 years ago
- Frontend for ipfs-search.com☆25Oct 7, 2023Updated 2 years ago
- semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language …☆27Jul 25, 2024Updated last year
- Spark Streaming jobs.☆11Mar 10, 2015Updated 11 years ago
- Basis of FragDenStaat.de's „Koalitionstracker“☆15Jul 14, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Small string compression using smaz compression algorithm. Fast, because it's in C. Supports Python 3+☆13Oct 18, 2025Updated 7 months ago
- ProbMinHash – A Class of Locality-Sensitive Hash Algorithms for the (Probability) Jaccard Similarity☆44Oct 26, 2020Updated 5 years ago
- Support for writing WARC files with Scrapy☆24Dec 21, 2019Updated 6 years ago
- A reddit bot that finds original publish dates on linked articles.☆10Nov 30, 2024Updated last year
- ☆18Jan 21, 2021Updated 5 years ago
- Rust wrapper for the BlingFire tokenization library☆15Jun 23, 2020Updated 5 years ago
- Distributed k-nearest Neighbors using Locality Sensitive Hashing and SYCL☆10Jun 7, 2021Updated 4 years ago
- A trend viewer written in Python/JavaScript☆21Nov 15, 2024Updated last year
- German Drama Corpus☆11May 19, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Mar 20, 2024Updated 2 years ago
- A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.☆20Jul 5, 2024Updated last year
- Implementation of COO, CSR, CSC, SSS and TJDS sparse matrix formats.☆11Jul 15, 2015Updated 10 years ago
- A repository of sample code designed to help you Tweet random dog facts☆15Sep 23, 2022Updated 3 years ago
- Data Lineage Tracing Library☆24Nov 30, 2021Updated 4 years ago
- Multi-index hashing for the resolution of ANN search problem on large datasets☆15Oct 16, 2018Updated 7 years ago
- Tool to bulk follow accounts related Open Science on Mastodon. Runs at https://germanrepro.github.io/Mastodon-OpenScience/ Based on the D…☆16Mar 26, 2026Updated 2 months ago
- Monitoring a PyTorch Lightning CNN with Weights & Biases☆15Jul 26, 2021Updated 4 years ago
- Dynamic Hashed Blocks (DHB) data structure for dynamic graphs☆12Sep 8, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Random Bingo Sheet for DB delays☆16Oct 3, 2024Updated last year
- 一般圖最大權匹配☆11Oct 3, 2016Updated 9 years ago
- An efficient simhash implementation for python☆128Oct 25, 2019Updated 6 years ago
- 稀疏矩阵-向量乘的并行优化算法(OpenMP,AVX)☆11Jul 7, 2021Updated 4 years ago
- A C++ VLSI circuit schematic and layout database library☆15Jul 1, 2024Updated last year
- 🔮Getting Started with Pytorch: Text Classification Tutorial☆15Oct 25, 2017Updated 8 years ago
- This library contains rectilinear spanning graph construction, finding minimum spanning tree and an implementation of binary search tree☆10Aug 22, 2015Updated 10 years ago