ing-bank / sparse_dot_topnLinks
Python package to accelerate the sparse matrix multiplication and top-n similarity selection
☆405Updated last month
Alternatives and similar repositories for sparse_dot_topn
Users that are interested in sparse_dot_topn are comparing it to the libraries listed below
Sorting:
- Fuzzy string matching, grouping, and evaluation.☆763Updated 3 weeks ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆283Updated 2 years ago
- A collection of tutorials for Snorkel☆396Updated 6 months ago
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆501Updated 4 months ago
- Super Fast String Matching in Python☆368Updated 2 months ago
- 🐦 Quickly annotate data from the comfort of your Jupyter notebook☆277Updated last year
- A tool for compiling trained SKLearn models into other representations (such as SQL, Sympy or Excel formulas)☆174Updated 2 years ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆312Updated last month
- 🐦 Quickly annotate data from the comfort of your Jupyter notebook☆785Updated last year
- skweak: A software toolkit for weak supervision applied to NLP tasks☆925Updated 9 months ago
- Data Analysis Baseline Library☆728Updated 5 months ago
- Phi_K correlation analyzer library☆164Updated 4 months ago
- Doubt your data, find bad labels.☆513Updated 10 months ago
- Natural Intelligence is still a pretty good idea.☆813Updated 10 months ago
- Test-Driven Data Analysis Functions☆299Updated 2 weeks ago
- Simplifies use of the Dedupe library via Pandas☆136Updated 2 years ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆138Updated 10 months ago
- Company Name Processor written in Python☆339Updated last year
- just a bunch of useful embeddings for scikit-learn pipelines☆499Updated 2 months ago
- Fuzzy matching and more functionality for spaCy.☆256Updated 10 months ago
- Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.☆517Updated last month
- Sensible multi-core apply function for Pandas☆82Updated 2 weeks ago
- Data Analysis Baseline Library☆132Updated 7 months ago
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆292Updated last month
- ☆190Updated last year
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆140Updated 2 months ago
- A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels f…☆505Updated 2 months ago
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆225Updated 4 years ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year