ing-bank / sparse_dot_topnLinks
Python package to accelerate the sparse matrix multiplication and top-n similarity selection
☆413Updated last week
Alternatives and similar repositories for sparse_dot_topn
Users that are interested in sparse_dot_topn are comparing it to the libraries listed below
Sorting:
- Super Fast String Matching in Python☆369Updated 6 months ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆284Updated 3 years ago
- Fuzzy string matching, grouping, and evaluation.☆781Updated 2 months ago
- 📛 Fuzzy Name Matching with Machine Learning☆264Updated last year
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆141Updated last year
- A collection of tutorials for Snorkel☆403Updated 9 months ago
- Phi_K correlation analyzer library☆167Updated this week
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆306Updated 4 months ago
- Fuzzy matching and more functionality for spaCy.☆258Updated last year
- 🐦 Quickly annotate data from the comfort of your Jupyter notebook☆280Updated 2 years ago
- Doubt your data, find bad labels.☆515Updated last year
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆320Updated 4 months ago
- Natural Intelligence is still a pretty good idea.☆821Updated last year
- ☆193Updated last year
- Python package for Gower distance☆79Updated last year
- A Python module to convert natural language numerics into ints and floats.☆228Updated 11 months ago
- A tool for compiling trained SKLearn models into other representations (such as SQL, Sympy or Excel formulas)☆175Updated 2 years ago
- 📝Natural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible wi…☆63Updated 2 years ago
- Simplifies use of the Dedupe library via Pandas☆136Updated 2 years ago
- Sensible multi-core apply function for Pandas☆88Updated last week
- ☆66Updated 2 years ago
- Python package for performing Entity and Text Matching using Deep Learning.☆604Updated last year
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated 2 years ago
- Data Analysis Baseline Library☆133Updated 10 months ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆471Updated 2 years ago
- Fixes contractions such as `you're` to `you are`☆317Updated 2 years ago
- Abydos NLP/IR library for Python☆189Updated 2 years ago
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆47Updated 7 years ago
- just a bunch of useful embeddings for scikit-learn pipelines☆517Updated last month
- Bag of, not words, but tricks!☆68Updated last year