Python package to accelerate the sparse matrix multiplication and top-n similarity selection
☆422Apr 9, 2026Updated 3 weeks ago
Alternatives and similar repositories for sparse_dot_topn
Users that are interested in sparse_dot_topn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spark Monitoring☆13Feb 28, 2023Updated 3 years ago
- Fuzzy string matching, grouping, and evaluation.☆794Jul 10, 2025Updated 9 months ago
- The privacy-preserving record linkage toolkit: a proof-of-concept public demo of next-gen data linkage techniques.☆16May 22, 2024Updated last year
- ISO 20275☆10Oct 22, 2023Updated 2 years ago
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆511Jan 9, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Abstractions for feature engineering on large graphs of tabular data.☆26Apr 15, 2026Updated 2 weeks ago
- Jupyter Widget to display resources used by the kernels☆13Aug 11, 2021Updated 4 years ago
- Google QUEST Q&A Labeling Kaggle Competition 6th Place Solution☆45Jun 11, 2020Updated 5 years ago
- Ordeq simplifies IO and modularizes pipeline logic.☆41Dec 19, 2025Updated 4 months ago
- Bag of, not words, but tricks!☆68Oct 31, 2023Updated 2 years ago
- Rapid fuzzy string matching in Python using various string metrics☆3,871Apr 20, 2026Updated last week
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,048Feb 21, 2024Updated 2 years ago
- Python wrapper for a C++ Double Metaphone☆15Jan 12, 2026Updated 3 months ago
- ☆13Dec 21, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆868Apr 20, 2026Updated last week
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆93Mar 11, 2026Updated last month
- Set of tools to do parameter estimation from likelihood fits and estimate uncertainties on the fitted parameters or derived quantities.☆15May 22, 2019Updated 6 years ago
- Rich Context leaderboard competition, including the corpus and current SOTA for required tasks.☆22Nov 28, 2020Updated 5 years ago
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends☆2,111Updated this week
- just a bunch of useful embeddings for scikit-learn pipelines☆524Feb 12, 2026Updated 2 months ago
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,526Apr 18, 2025Updated last year
- Extra blocks for scikit-learn pipelines.☆1,392Apr 21, 2026Updated last week
- 📛 Fuzzy Name Matching with Machine Learning☆268Jun 17, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,455Jul 29, 2025Updated 9 months ago
- Dataframe Integration with spaCy.☆103Mar 12, 2021Updated 5 years ago
- Simple scripts to generate and use an Annoy index and lmdb map☆28Jan 4, 2018Updated 8 years ago
- Doubt your data, find bad labels.☆516Jul 15, 2024Updated last year
- All-pair set similarity search on millions of sets in Python and on a laptop☆604Oct 11, 2022Updated 3 years ago
- Python package for Model Metric Uncertainty estimation☆16Sep 5, 2024Updated last year
- Approximate Nearest Neighbor Search for Sparse Data in Python!☆918Oct 2, 2020Updated 5 years ago
- Yet Another Matplotlib Extension☆15Dec 1, 2021Updated 4 years ago
- MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW☆2,911Apr 18, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆474Feb 6, 2023Updated 3 years ago
- ☆32Dec 15, 2023Updated 2 years ago
- Entity Linker solution☆1,207Sep 21, 2023Updated 2 years ago
- Light-weight, Python-based data-analysis framework☆12Feb 24, 2019Updated 7 years ago
- Tree-based indexes for neural-search☆33Mar 4, 2024Updated 2 years ago
- kagglerが使いそうなslack emojiをまとめたリポジトリだよ。☆21Feb 20, 2022Updated 4 years ago
- A simple and efficient tool to parallelize Pandas operations on all available CPUs☆3,807Jul 9, 2024Updated last year