gittar / bkmeans
The breathing k-means algorithm (just one source file containing the algorithm as found on pypi)
☆20Updated 2 months ago
Related projects: ⓘ
- The "breathing k-means" algorithm with datasets and example notebooks☆87Updated 2 years ago
- Prune your sklearn models☆19Updated last year
- Pipeline components that support partial_fit.☆42Updated 2 months ago
- Advanced random forest methods in Python☆55Updated 10 months ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆50Updated 8 months ago
- Efficient matrix representations for working with tabular data☆109Updated this week
- Missing data amputation and exploration functions for Python☆64Updated last year
- The simplest way to deploy a machine learning model☆23Updated last year
- Fast implementation of Venn-ABERS probabilistic predictors☆69Updated 7 months ago
- A small collection of lesser-known statistical measures☆35Updated last week
- Notes from my presentation on Python packaging at PyGotham 2021☆21Updated 2 years ago
- Scikit-Learn compatible transformer that turns categorical variables into dense entity embeddings.☆41Updated last year
- Vectorizers for a range of different data types☆92Updated 3 weeks ago
- A proof-of-concept for a RAG to query the scikit-learn documentation☆18Updated last month
- Public notebooks and datasets to accompany the Data Analysis with Polars course on Udemy☆39Updated last year
- Repository for my master thesis on automated string handling☆16Updated 3 years ago
- It's a cooler way to store simple linear models.☆28Updated 2 months ago
- Feature engineering package with sklearn like functionality☆47Updated 2 weeks ago
- ☆108Updated 7 months ago
- An unsupervised feature selection technique using supervised algorithms such as XGBoost☆87Updated 8 months ago
- Toolkit to forge scikit-learn compatible estimators☆17Updated 2 weeks ago
- Supporting material for the book club☆14Updated 2 years ago
- Random Forest or XGBoost? It is Time to Explore LCE☆66Updated last year
- ☆27Updated 2 years ago
- A `select` accessor for easier subsetting of pandas DataFrames and Series☆33Updated last year
- A minimal Python kernel so you can run Python in your Python☆39Updated 2 years ago
- Gzip and nearest neighbors for text classification☆56Updated last year
- pipreqs with jupyter notebook support☆64Updated last year
- Exploring some issues related to churn☆16Updated 6 months ago
- Rethinking machine learning pipelines☆17Updated last month