ing-bank / sparse_dot_topn
Python package to accelerate the sparse matrix multiplication and top-n similarity selection
☆400Updated last month
Alternatives and similar repositories for sparse_dot_topn:
Users that are interested in sparse_dot_topn are comparing it to the libraries listed below
- Super Fast String Matching in Python☆363Updated 8 months ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆975Updated 10 months ago
- Fuzzy string matching, grouping, and evaluation.☆750Updated 3 weeks ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆137Updated 6 months ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆282Updated 2 years ago
- Fuzzy matching and more functionality for spaCy.☆255Updated 6 months ago
- Phi_K correlation analyzer library☆157Updated this week
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆299Updated last year
- Ensemble topic modelling with pLSA☆113Updated 3 years ago
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆268Updated 3 weeks ago
- Data Analysis Baseline Library☆728Updated last month
- Sensible multi-core apply function for Pandas☆79Updated 2 weeks ago
- Abydos NLP/IR library for Python☆184Updated 2 years ago
- Doubt your data, find bad labels.☆508Updated 6 months ago
- 🐦 Quickly annotate data from the comfort of your Jupyter notebook☆276Updated last year
- Time should be taken seer-iously☆312Updated last year
- Natural Intelligence is still a pretty good idea.☆801Updated 6 months ago
- Implementation of statistical models to analyze time lagged conversions☆261Updated 7 months ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆472Updated last year
- Data Analysis Baseline Library☆130Updated 2 months ago
- Compute Sentence Embeddings Fast!☆618Updated last year
- A kedro-plugin for integration of mlflow capabilities inside kedro projects (especially machine learning model versioning and packaging)☆208Updated last week
- skweak: A software toolkit for weak supervision applied to NLP tasks☆921Updated 4 months ago
- A collection of tutorials for Snorkel☆393Updated last month
- Practical active learning in python☆189Updated 2 years ago
- A drop-in replacement for Scikit-Learn’s GridSearchCV / RandomizedSearchCV -- but with cutting edge hyperparameter tuning techniques.☆467Updated last year
- Google USE (Universal Sentence Encoder) for spaCy☆180Updated last year
- machine learning with logical rules in Python☆625Updated 11 months ago
- Fixes contractions such as `you're` to `you are`☆311Updated 2 years ago