lmcinnes / enstop
Ensemble topic modelling with pLSA
โ114Updated 3 years ago
Alternatives and similar repositories for enstop:
Users that are interested in enstop are comparing it to the libraries listed below
- ๐งฎ Extended Latent Dirichlet Allocation for Collaborative Filtering in Recommender Systems.โ41Updated 2 years ago
- Scikit-learn compatible Topic Modelling with Hierarchical Statistical Block Models (Gerlach, Peixoto and Altmann, 2018)โ28Updated 6 years ago
- Notebooks configured to be run with Binder, usually found on my blog.โ42Updated last year
- Bag of, not words, but tricks!โ68Updated last year
- Train multi-task image, text, or ensemble (image + text) modelsโ45Updated last year
- Vectorizers for a range of different data typesโ101Updated last month
- Uniform Manifold Approximation with Two-phase Optimization (IEEE VIS 2022 short)โ107Updated 3 months ago
- Fast hierarchical clustering routines for R and Python.โ142Updated 8 months ago
- Explaining dimensionality results using SHAP valuesโ53Updated 2 months ago
- Matrix tools for building and inspecting latent spacesโ27Updated 6 years ago
- Python library for Ceteris Paribus Plots (What-if plots)โ19Updated 3 years ago
- Python implementation of 'Scalable Recommendation with Hierarchical Poisson Factorization'.โ79Updated 2 months ago
- Pipeline components that support partial_fit.โ45Updated 8 months ago
- Scripts for paper "Encoding high-cardinality string categorical variables"โ24Updated 5 years ago
- Python implementation of R package breakDownโ42Updated last year
- Notebooks and data associated to constructing and exploring a map of subreddits.โ55Updated 7 years ago
- NLP pipeline using word2vec (preprocessing/embedding/prediction/clustering)โ115Updated 10 months ago
- Example using Polyaxon to experiment with pre-training spaCyโ65Updated 3 years ago
- General Interpretability Packageโ58Updated 2 years ago
- Running Prodigy for a team of annotatorsโ53Updated 4 years ago
- Python Interface of the Scalable Bayesian Rule Listsโ19Updated 5 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing powerโ190Updated last year
- vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distrโฆโ121Updated 2 months ago
- Official repository of RankEval: An Evaluation and Analysis Framework for Learning-to-Rank Solutions.โ88Updated 4 years ago
- A high performance implementation of HDBSCAN clustering. http://hdbscan.readthedocs.io/en/latest/โ95Updated 7 years ago
- A Python package for hubness analysis and high-dimensional data miningโ44Updated 9 months ago
- Scikit-learn compatible implementations of the Random Rotation Ensemble idea of (Blaser & Fryzlewicz, 2016)โ43Updated 8 years ago
- Simplified tree-based classifier and regressor for interpretable machine learning (scikit-learn compatible)โ47Updated 4 years ago
- Embed categorical variables via neural networks.โ59Updated last year
- scikit-learn gradient-boosting-model interactionsโ25Updated last year