ing-bank / sparse_dot_topnLinks
Python package to accelerate the sparse matrix multiplication and top-n similarity selection
☆406Updated this week
Alternatives and similar repositories for sparse_dot_topn
Users that are interested in sparse_dot_topn are comparing it to the libraries listed below
Sorting:
- Super Fast String Matching in Python☆370Updated 3 months ago
- Fuzzy string matching, grouping, and evaluation.☆767Updated this week
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆282Updated 2 years ago
- A collection of tutorials for Snorkel☆398Updated 7 months ago
- 📛 Fuzzy Name Matching with Machine Learning☆264Updated last year
- Fuzzy matching and more functionality for spaCy.☆256Updated last year
- A tool for compiling trained SKLearn models into other representations (such as SQL, Sympy or Excel formulas)☆173Updated 2 years ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆140Updated 11 months ago
- Doubt your data, find bad labels.☆513Updated 11 months ago
- Python package for performing Entity and Text Matching using Deep Learning.☆594Updated last year
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆299Updated 2 months ago
- Natural Intelligence is still a pretty good idea.☆815Updated 11 months ago
- Simplifies use of the Dedupe library via Pandas☆136Updated 2 years ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆314Updated 2 months ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆398Updated 3 years ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆472Updated 2 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Updated 10 months ago
- ☆191Updated last year
- 🐦 Quickly annotate data from the comfort of your Jupyter notebook☆278Updated 2 years ago
- ☆66Updated 2 years ago
- Phi_K correlation analyzer library☆164Updated 5 months ago
- Data Analysis Baseline Library☆727Updated 6 months ago
- Python package for Gower distance☆78Updated last year
- just a bunch of useful embeddings for scikit-learn pipelines☆501Updated 3 months ago
- Fixes contractions such as `you're` to `you are`☆318Updated 2 years ago
- A Python module to convert natural language numerics into ints and floats.☆228Updated 9 months ago
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- Super Simple Similarities Service☆149Updated 3 months ago
- UpliftML: A Python Package for Scalable Uplift Modeling☆327Updated 2 years ago
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆502Updated 5 months ago