lmcinnes / enstop
Ensemble topic modelling with pLSA
☆114Updated 3 years ago
Alternatives and similar repositories for enstop:
Users that are interested in enstop are comparing it to the libraries listed below
- Scikit-learn compatible Topic Modelling with Hierarchical Statistical Block Models (Gerlach, Peixoto and Altmann, 2018)☆28Updated 6 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated 2 years ago
- Matrix tools for building and inspecting latent spaces☆27Updated 6 years ago
- Bag of, not words, but tricks!☆68Updated last year
- 🧮 Extended Latent Dirichlet Allocation for Collaborative Filtering in Recommender Systems.☆41Updated 2 years ago
- Vectorizers for a range of different data types☆101Updated 2 months ago
- Fast hierarchical clustering routines for R and Python.☆145Updated 3 weeks ago
- Python implementation of R package breakDown☆42Updated last year
- A Python package for hubness analysis and high-dimensional data mining☆44Updated 11 months ago
- Pipeline components that support partial_fit.☆46Updated 9 months ago
- Uniform Manifold Approximation with Two-phase Optimization (IEEE VIS 2022 short)☆107Updated 5 months ago
- ☆70Updated 2 years ago
- scikit-learn gradient-boosting-model interactions☆25Updated 2 years ago
- Scikit-learn compatible implementations of the Random Rotation Ensemble idea of (Blaser & Fryzlewicz, 2016)☆43Updated 9 years ago
- Example using Polyaxon to experiment with pre-training spaCy☆65Updated 3 years ago
- Python library for Ceteris Paribus Plots (What-if plots)☆24Updated 4 years ago
- Explaining dimensionality results using SHAP values☆54Updated 3 months ago
- ☆104Updated 6 years ago
- Train multi-task image, text, or ensemble (image + text) models☆45Updated last year
- Super Simple Similarities Service☆149Updated 3 weeks ago
- Official repository of RankEval: An Evaluation and Analysis Framework for Learning-to-Rank Solutions.☆89Updated 4 years ago
- A high performance implementation of HDBSCAN clustering. http://hdbscan.readthedocs.io/en/latest/☆96Updated 7 years ago
- ☆54Updated 3 years ago
- Notebooks and data associated to constructing and exploring a map of subreddits.☆55Updated 8 years ago
- NLP pipeline using word2vec (preprocessing/embedding/prediction/clustering)☆115Updated last year
- Scripts for paper "Encoding high-cardinality string categorical variables"☆24Updated 5 years ago
- Tools that make working with scikit-learn and pandas easier.☆44Updated last year
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated 2 years ago
- Jupyter notebook widget to quickly label text data☆47Updated 6 years ago
- Embed categorical variables via neural networks.☆59Updated 2 years ago