jc-healy / EmbedAllTheThings
Slides and suplemental notebooks for my PyData LA 2019 talk titled "Embed all the things"
☆17Updated 5 years ago
Alternatives and similar repositories for EmbedAllTheThings:
Users that are interested in EmbedAllTheThings are comparing it to the libraries listed below
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆77Updated last year
- Missing data amputation and exploration functions for Python☆68Updated 2 years ago
- Data exploration done quick.☆19Updated 3 years ago
- A small python library that can clump lists of data together.☆149Updated 3 years ago
- Bag of, not words, but tricks!☆68Updated last year
- stratx is a library for A Stratification Approach to Partial Dependence for Codependent Variables☆65Updated 11 months ago
- Ensemble topic modelling with pLSA☆114Updated 3 years ago
- A tool for compiling trained SKLearn models into other representations (such as SQL, Sympy or Excel formulas)☆172Updated 2 years ago
- this repo might get accepted☆28Updated 4 years ago
- Data Analysis Baseline Library☆131Updated 5 months ago
- Clustergram - Visualization and diagnostics for cluster analysis in Python☆123Updated this week
- General Interpretability Package☆58Updated 2 years ago
- Decorators that logs stats.☆110Updated last month
- vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distr…☆121Updated 3 months ago
- OlliePy is a python package which can help data scientists in exploring their data and evaluating and analysing their machine learning ex…☆51Updated last year
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated 2 years ago
- Altair backend for pandas plotting☆102Updated 4 years ago
- Dask tutorial material for video tutorial series☆87Updated last year
- A pipeline for NLP projects using SkLearn☆24Updated 6 years ago
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- Confidence intervals for scikit-learn forest algorithms☆286Updated 9 months ago
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆224Updated 4 years ago
- An easy to use waterfall chart function for Python☆162Updated 4 years ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- ☆20Updated 9 months ago
- Public home of pycorels, the python binding to CORELS☆78Updated 4 years ago
- Bringing back uncertainty to machine learning.☆51Updated 10 months ago
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆81Updated 3 years ago
- ☆26Updated 4 years ago