Fast Python Bloom Filter using Mmap
☆135Sep 14, 2025Updated 6 months ago
Alternatives and similar repositories for pybloomfiltermmap3
Users that are interested in pybloomfiltermmap3 are comparing it to the libraries listed below
Sorting:
- Fast Python Bloom Filter using Mmap☆745Nov 4, 2019Updated 6 years ago
- Source code for our LBR paper "Closed-Form Models for Collaborative Filtering with Side-Information" published at RecSys 2020.☆15Jul 22, 2021Updated 4 years ago
- Scalable Bloom Filter implemented in Python☆1,623Jul 1, 2021Updated 4 years ago
- Things and stuff for times, dates and datetimes. Maybe they're useful☆14Aug 1, 2018Updated 7 years ago
- A component that tries to avoid downloading duplicate content☆28Feb 10, 2026Updated last month
- An index data structure for approximate string search.☆23May 6, 2019Updated 6 years ago
- Crawler that retrieves commoncrawl's crawled hosts and their corresponding IPs☆21Sep 1, 2025Updated 6 months ago
- A UserScript to detect GPT generated comments on Hackernews.☆13Dec 10, 2022Updated 3 years ago
- Simple, fast dictionary-based language detector for short texts.☆20Feb 5, 2026Updated last month
- Napkin is a simple tool to produce statistical analysis of a text☆12Feb 25, 2024Updated 2 years ago
- Naïve Bayesian Text Classifier on Redis☆116Jun 19, 2019Updated 6 years ago
- The WordScape repository contains code for the WordScape pipeline to create datasets to train document understanding models.☆39Dec 7, 2023Updated 2 years ago
- Rust implementation of the DCSO Bloom filter☆29Jul 15, 2025Updated 8 months ago
- Python bindings for FarmHash and CityHash☆46Oct 9, 2025Updated 5 months ago
- Train, evaluate, and optimize implicit feedback-based recommender systems.☆31Feb 20, 2026Updated last month
- Data science tools from Moz☆23Jan 11, 2017Updated 9 years ago
- The Eventlog Compendium is the go-to resource for understanding Windows Event Logs.☆53Apr 22, 2025Updated 10 months ago
- Recursive Neural Tensor Networks☆11Feb 3, 2014Updated 12 years ago
- Rust bindings for SQLite’s lsm1 extension in stand-alone manner.☆25Apr 16, 2025Updated 11 months ago
- In this repository, we will present techniques to detect covariate drift, and demonstrate how to incorporate your own custom drift detect…☆13May 26, 2021Updated 4 years ago
- Python bindings for xorfilter(faster and smaller than bloom and cuckoo filters)☆121Jan 1, 2026Updated 2 months ago
- CocktailParty is a data broker system based on phoenix framework☆23Apr 23, 2025Updated 10 months ago
- Implementation of Bayesian Sets for fast similarity searches.☆14Oct 2, 2011Updated 14 years ago
- Proportional request rejection based on load metrics.☆16Apr 18, 2023Updated 2 years ago
- Augmented Interval Tree implemented in Cython/C☆20Jan 17, 2025Updated last year
- Index of URLs to pdf files all over the internet and scripts☆25May 2, 2023Updated 2 years ago
- Base45☆22Feb 20, 2026Updated last month
- ☆12Jan 21, 2017Updated 9 years ago
- full text search engine based on compact data structures☆13Jan 26, 2015Updated 11 years ago
- A tool to archive Stack Exchange sites posts for offline browsing.☆12Nov 9, 2018Updated 7 years ago
- UNMAINTAINED - A simple Python throttling lib relying on the token bucket algorithm☆38Jan 27, 2017Updated 9 years ago
- Pydata MAB Tutorial☆10Jul 6, 2018Updated 7 years ago
- source code for NAACL2022 main conference "Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs"☆10Sep 26, 2022Updated 3 years ago
- CyCAT.org API back-end server including crawlers☆29Feb 4, 2023Updated 3 years ago
- GNURadio block to determine unkown symbol rates☆19Mar 25, 2022Updated 3 years ago
- A GPU language model, based on btree backed tries.☆29Mar 6, 2018Updated 8 years ago
- ☆10Jan 12, 2018Updated 8 years ago
- Python Binding for xxHash☆453Updated this week
- ☆12Dec 19, 2023Updated 2 years ago