thomasjpfan / awesome-python-data-science
A curated list of Python libraries used for data science.
☆87Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-python-data-science
- Data Analysis Baseline Library☆131Updated last month
- Phi_K correlation analyzer library☆157Updated last week
- Extends scikit-learn with new models, transformers, metrics, plotting.☆69Updated 2 months ago
- Sensible multi-core apply function for Pandas☆77Updated 3 weeks ago
- The easy way to write your own flavor of Pandas☆301Updated last month
- A library that unifies the API for most commonly used libraries and modeling techniques for time-series forecasting in the Python ecosyst…☆151Updated 9 months ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆104Updated last year
- Templates for jupyter notebooks☆141Updated 9 months ago
- Tutorial for a new versioning Machine Learning pipeline☆81Updated 3 years ago
- Mini module with syntax sugar for pandas/sklearn☆107Updated 4 years ago
- Altair backend for pandas plotting☆102Updated 3 years ago
- DataFrame support for scikit-learn.☆63Updated last year
- a python grammar for evolutionary algorithms and heuristics☆187Updated 2 years ago
- scikit-learn contrib estimators☆190Updated last month
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆82Updated 10 months ago
- stratx is a library for A Stratification Approach to Partial Dependence for Codependent Variables☆64Updated 7 months ago
- mlmachine accelerates machine learning experimentation☆30Updated 2 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆52Updated 4 years ago
- Data Analysis Baseline Library☆724Updated 3 months ago
- Fast hierarchical clustering routines for R and Python.☆138Updated 4 months ago
- Adds partial fit method to sklearn's forest estimators to allow incremental training without being limited to a linear model. Works with …☆35Updated 5 months ago
- Automated Data Science and Machine Learning library to optimize workflow.☆104Updated last year
- CinnaMon is a Python library which offers a number of tools to detect, explain, and correct data drift in a machine learning system☆76Updated last year
- pymc-learn: Practical probabilistic machine learning in Python☆223Updated 3 years ago
- Python package for Imputation Methods☆243Updated 10 months ago
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆223Updated 4 years ago
- Confidence intervals for scikit-learn forest algorithms☆284Updated 4 months ago
- Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking☆55Updated 3 years ago
- Clustergram - Visualization and diagnostics for cluster analysis in Python☆122Updated last month