lmcinnes / enstop
Ensemble topic modelling with pLSA
โ114Updated 3 years ago
Alternatives and similar repositories for enstop:
Users that are interested in enstop are comparing it to the libraries listed below
- Notebooks configured to be run with Binder, usually found on my blog.โ42Updated last year
- Scikit-learn compatible Topic Modelling with Hierarchical Statistical Block Models (Gerlach, Peixoto and Altmann, 2018)โ28Updated 5 years ago
- ๐งฎ Extended Latent Dirichlet Allocation for Collaborative Filtering in Recommender Systems.โ41Updated 2 years ago
- Vectorizers for a range of different data typesโ99Updated last week
- Bag of, not words, but tricks!โ68Updated last year
- Scikit-learn compatible implementations of the Random Rotation Ensemble idea of (Blaser & Fryzlewicz, 2016)โ43Updated 8 years ago
- Train multi-task image, text, or ensemble (image + text) modelsโ45Updated last year
- Embedding Vector Oriented Clusteringโ125Updated last week
- Pipeline components that support partial_fit.โ45Updated 7 months ago
- Matrix tools for building and inspecting latent spacesโ27Updated 6 years ago
- Uniform Manifold Approximation with Two-phase Optimization (IEEE VIS 2022 short)โ108Updated 2 months ago
- scikit-learn gradient-boosting-model interactionsโ25Updated last year
- Explaining dimensionality results using SHAP valuesโ53Updated last month
- Text Mining and Topic Modeling Toolkit for Python with parallel processing powerโ190Updated last year
- Fast hierarchical clustering routines for R and Python.โ141Updated 6 months ago
- โ58Updated 2 years ago
- Hierarchical Uniform Manifold Approximation and Projectionโ233Updated last month
- โ54Updated 3 years ago
- Running Prodigy for a team of annotatorsโ53Updated 4 years ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn frโฆโ57Updated 3 years ago
- Example using Polyaxon to experiment with pre-training spaCyโ65Updated 3 years ago
- Scripts for paper "Encoding high-cardinality string categorical variables"โ24Updated 5 years ago
- Embed categorical variables via neural networks.โ59Updated last year
- A Python package for hubness analysis and high-dimensional data miningโ44Updated 8 months ago
- NLP pipeline using word2vec (preprocessing/embedding/prediction/clustering)โ115Updated 9 months ago
- โ104Updated 6 years ago
- vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distrโฆโ121Updated last month
- A visual labeling system implemented in Jupyter widgets.โ148Updated 3 months ago
- Python implementation of R package breakDownโ42Updated last year
- this repo might get acceptedโ29Updated 4 years ago