coiled / data-science-at-scale
A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).
☆112Updated 2 years ago
Alternatives and similar repositories for data-science-at-scale:
Users that are interested in data-science-at-scale are comparing it to the libraries listed below
- It's all in the name☆76Updated last year
- ☆132Updated 8 months ago
- Jupyter Notebooks and other material from tutorial sessions on Machine Learning, Data Science, and related☆56Updated 3 years ago
- PyData London 2022 Tutorial☆66Updated 2 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆51Updated 4 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆104Updated last year
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆75Updated 11 months ago
- Deep Learning from Scratch with PyTorch☆115Updated 4 years ago
- Increase citations, ease review & collaboration A collection of "easy wins" to make machine learning in research reproducible. This tut…☆73Updated last month
- Workshop on Bayesian inference using PyMC☆27Updated 3 years ago
- ☆29Updated 5 years ago
- ☆15Updated 2 years ago
- HoloViz tutorial for KDD 2022☆35Updated 2 years ago
- Repository for a workshop on Bayesian Decision Analysis☆69Updated last year
- Explorations of survival analysis in Python☆51Updated last year
- Website: Data Umbrella & PyMC open source sessions☆26Updated 8 months ago
- ☆13Updated 3 years ago
- Tutorial covering a new workflow available going from pandas to scikit-learn☆40Updated 2 years ago
- How to Interpret SHAP Analyses: A Non-Technical Guide☆51Updated 3 years ago
- Data Analysis Baseline Library☆130Updated 3 months ago
- A repository used to provide an introduction to dataviz in Python☆53Updated 2 years ago
- Materials for "Parallelizing Scientific Python with Dask"☆70Updated 6 years ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- Sample projects using Ploomber.☆85Updated last year
- Tutorial material on machine learning with dirty data in Python☆62Updated 6 months ago
- Pytest for Data Science Beginners☆58Updated 6 years ago
- SciPy Conference Materials☆47Updated 2 weeks ago
- Talks / presentations / tutorials about Fairlearn and fairness in ML☆22Updated 2 years ago
- Data manipulation, analysis and visualisation in Python - specialist course Doctoral schools of Ghent University☆105Updated last week
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆46Updated 11 months ago