ing-bank / sparse_dot_topn
Python package to accelerate the sparse matrix multiplication and top-n similarity selection
☆399Updated last month
Related projects ⓘ
Alternatives and complementary repositories for sparse_dot_topn
- Fuzzy string matching, grouping, and evaluation.☆748Updated 6 months ago
- Super Fast String Matching in Python☆364Updated 6 months ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆281Updated 2 years ago
- Group thousands of similar spreadsheet or database text entries in seconds☆155Updated last year
- A tool for compiling trained SKLearn models into other representations (such as SQL, Sympy or Excel formulas)☆173Updated 2 years ago
- Fuzzy matching and more functionality for spaCy.☆252Updated 4 months ago
- Data Analysis Baseline Library☆724Updated 3 months ago
- Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.☆504Updated last month
- Phi_K correlation analyzer library☆157Updated last week
- Simplifies use of the Dedupe library via Pandas☆136Updated last year
- Easy pipelines for pandas DataFrames.☆716Updated 3 weeks ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆920Updated 2 months ago
- Natural Intelligence is still a pretty good idea.☆798Updated 4 months ago
- 📛 Fuzzy Name Matching with Machine Learning☆257Updated 5 months ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆287Updated last year
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆137Updated 4 months ago
- Ensemble topic modelling with pLSA☆112Updated 3 years ago
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆262Updated 5 months ago
- Tool for interactive embeddings visualization☆300Updated 3 months ago
- Doubt your data, find bad labels.☆503Updated 4 months ago
- The easy way to write your own flavor of Pandas☆301Updated last month
- ☆185Updated 5 months ago
- Google USE (Universal Sentence Encoder) for spaCy☆177Updated last year
- Company Name Processor written in Python☆327Updated 6 months ago
- Sensible multi-core apply function for Pandas☆77Updated 3 weeks ago
- Prepping tables for machine learning☆1,223Updated this week
- Data Analysis Baseline Library☆131Updated last month
- PYthon Automated Term Extraction☆305Updated last year
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆469Updated last year