A fast python implementation of the SimHash algorithm.
☆27Oct 27, 2021Updated 4 years ago
Alternatives and similar repositories for floc-simhash
Users that are interested in floc-simhash are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Visual Hash for matching copies of visually similar images.☆16Mar 17, 2025Updated last year
- FLoC Simulator☆37Aug 10, 2021Updated 4 years ago
- Add screenshot button to youtube.com☆15Jun 22, 2018Updated 7 years ago
- ☆13Dec 28, 2022Updated 3 years ago
- A rotating socks proxy using Tor, Delegate and Haproxy☆14Apr 8, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Demo Application with DataSUS death records and Streamlit☆11Dec 14, 2019Updated 6 years ago
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Mar 11, 2021Updated 5 years ago
- A Python port of the Apache Lucene ASCII Folding Filter that converts alphabetic, numeric, and symbolic Unicode characters which are not …☆15May 3, 2020Updated 6 years ago
- Some outlier_detection methods: Robust PCA (RPCA), Randomized RPCA, Robust Autoencoder☆10Dec 10, 2018Updated 7 years ago
- IBGE - Censo 2010 - Localização e respectivo Código de Setor Censitário☆10Apr 3, 2021Updated 5 years ago
- Benchmark scripts for comparing different tokenizers and sentence segmenters of German☆12Feb 27, 2023Updated 3 years ago
- Implementation of the Tower Method, a novel approach to handling missing values.☆13Mar 12, 2024Updated 2 years ago
- Frontend for ipfs-search.com☆25Oct 7, 2023Updated 2 years ago
- Basis of FragDenStaat.de's „Koalitionstracker“☆15Jul 14, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Support for writing WARC files with Scrapy☆24Dec 21, 2019Updated 6 years ago
- Benson turns a list of URLs into mp3s of the contents of each web page - take control over your reading backlog!☆16Oct 30, 2024Updated last year
- Norwegian Speech Transformer Models☆19Mar 26, 2026Updated last month
- AWS Batch Demo☆18Jul 31, 2018Updated 7 years ago
- A C++17 port of the JavaScript pixelmatch library, providing a small pixel-level image comparison library.☆14Updated this week
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆39Feb 5, 2026Updated 3 months ago
- A trend viewer written in Python/JavaScript☆21Nov 15, 2024Updated last year
- Example configurations for the Community Solid Server☆22Mar 9, 2026Updated last month
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Mar 20, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.☆20Jul 5, 2024Updated last year
- A repository of sample code designed to help you Tweet random dog facts☆15Sep 23, 2022Updated 3 years ago
- ☆15Mar 20, 2020Updated 6 years ago
- Data Lineage Tracing Library☆24Nov 30, 2021Updated 4 years ago
- Multi-index hashing for the resolution of ANN search problem on large datasets☆15Oct 16, 2018Updated 7 years ago
- URL normalization for Python☆100Apr 25, 2026Updated last week
- statically generated weekly digest of articles read in Pocket☆10May 14, 2019Updated 6 years ago
- 一般圖最大權匹配☆11Oct 3, 2016Updated 9 years ago
- A nuxt module to expose Vuex state in the browser URL for easy sharing☆12Aug 28, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Dec 2, 2024Updated last year
- 稀疏矩阵-向量乘的并行优化算法(OpenMP,AVX)☆11Jul 7, 2021Updated 4 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆20Updated this week
- R package for online training of regression models using FTRL Proximal☆12Feb 7, 2017Updated 9 years ago
- DHLAB is a library of python modules for accessing text and pictures at the National Library of Norway.☆26Apr 21, 2026Updated 2 weeks ago
- MIDict (Multi-Index Dict) can be indexed by any "keys" or "values", suitable as a bidirectional/inverse dict or a multi-key/multi-value d…☆14May 19, 2016Updated 9 years ago
- Racket bindings for the Slack API☆10Mar 4, 2019Updated 7 years ago