ing-bank / sparse_dot_topn
Python package to accelerate the sparse matrix multiplication and top-n similarity selection
☆402Updated this week
Alternatives and similar repositories for sparse_dot_topn:
Users that are interested in sparse_dot_topn are comparing it to the libraries listed below
- Super Fast String Matching in Python☆367Updated last week
- A tool for compiling trained SKLearn models into other representations (such as SQL, Sympy or Excel formulas)☆172Updated 2 years ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆283Updated 2 years ago
- Personal data science and machine learning toolbox☆365Updated 5 years ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆136Updated 8 months ago
- ☆65Updated 2 years ago
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆277Updated this week
- UpliftML: A Python Package for Scalable Uplift Modeling☆325Updated last year
- Simplifies use of the Dedupe library via Pandas☆135Updated last year
- Fuzzy string matching, grouping, and evaluation.☆753Updated last month
- Data Analysis Baseline Library☆726Updated 3 months ago
- Group thousands of similar spreadsheet or database text entries in seconds☆156Updated last year
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆499Updated last month
- Easy pipelines for pandas DataFrames.☆717Updated 4 months ago
- Implementation of statistical models to analyze time lagged conversions☆261Updated 9 months ago
- Extra blocks for scikit-learn pipelines.☆1,312Updated this week
- Fast SHAP value computation for interpreting tree-based models☆535Updated last year
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated last year
- A set of data tools in Python☆500Updated 2 months ago
- Time should be taken seer-iously☆314Updated 2 years ago
- Doubt your data, find bad labels.☆509Updated 8 months ago
- ☆286Updated 2 years ago
- Python package for Gower distance☆77Updated 10 months ago
- just a bunch of useful embeddings for scikit-learn pipelines☆486Updated 2 months ago
- Phi_K correlation analyzer library☆162Updated last month
- Python package for Imputation Methods☆248Updated last year
- skweak: A software toolkit for weak supervision applied to NLP tasks☆923Updated 6 months ago
- Joblib Apache Spark Backend☆245Updated 7 months ago
- python partial dependence plot toolbox☆851Updated 6 months ago
- Ensemble topic modelling with pLSA☆114Updated 3 years ago