ing-bank / sparse_dot_topnLinks
Python package to accelerate the sparse matrix multiplication and top-n similarity selection
β419Updated 2 weeks ago
Alternatives and similar repositories for sparse_dot_topn
Users that are interested in sparse_dot_topn are comparing it to the libraries listed below
Sorting:
- Super Fast String Matching in Pythonβ371Updated 10 months ago
- π Fuzzy Name Matching with Machine Learningβ266Updated last year
- A comprehensive and scalable set of string tokenizers and similarity measures in Pythonβ142Updated last year
- Fuzzy string matching, grouping, and evaluation.β787Updated 6 months ago
- Fuzzy matching and more functionality for spaCy.β258Updated last year
- A tool for compiling trained SKLearn models into other representations (such as SQL, Sympy or Excel formulas)β176Updated 3 years ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4β286Updated 3 years ago
- Doubt your data, find bad labels.β516Updated last year
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs oβ¦β47Updated 7 years ago
- π¦ Quickly annotate data from the comfort of your Jupyter notebookβ281Updated 2 years ago
- β65Updated 3 years ago
- Simplifies use of the Dedupe library via Pandasβ136Updated 2 years ago
- A collection of tutorials for Snorkelβ407Updated last year
- πΈ fastText + Bloom embeddings for compact, full-coverage vectors with spaCyβ332Updated 9 months ago
- πNatural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible wiβ¦β63Updated 2 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasksβ926Updated last year
- A Python module to convert natural language numerics into ints and floats.β234Updated last year
- A library for debugging/inspecting machine learning classifiers and explaining their predictionsβ324Updated 9 months ago
- Phi_K correlation analyzer libraryβ172Updated 2 weeks ago
- Python package for Gower distanceβ82Updated last year
- Python package for performing Entity and Text Matching using Deep Learning.β614Updated last year
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!β476Updated 2 years ago
- Natural Intelligence is still a pretty good idea.β822Updated last year
- Notebooks configured to be run with Binder, usually found on my blog.β42Updated 2 years ago
- A small python library that can clump lists of data together.β148Updated 4 years ago
- β193Updated last year
- Monitor the stability of a Pandas or Spark dataframe βοΈβ510Updated 3 weeks ago
- Sensible multi-core apply function for Pandasβ88Updated last week
- Data Analysis Baseline Libraryβ727Updated last year
- A simple implementation of Apriori algorithm by Python.β255Updated 4 years ago