coiled / data-science-at-scaleLinks
A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).
☆118Updated 2 years ago
Alternatives and similar repositories for data-science-at-scale
Users that are interested in data-science-at-scale are comparing it to the libraries listed below
Sorting:
- PyData London 2022 Tutorial☆68Updated 3 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆106Updated 2 years ago
- Jupyter Notebooks and other material from tutorial sessions on Machine Learning, Data Science, and related☆56Updated 4 years ago
- Increase citations, ease review & collaboration A collection of "easy wins" to make machine learning in research reproducible. This tut…☆75Updated last month
- Tutorial material on machine learning with dirty data in Python☆61Updated last year
- It's all in the name☆81Updated 2 years ago
- Data Analysis Baseline Library☆133Updated last year
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆82Updated last month
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 4 years ago
- How to Interpret SHAP Analyses: A Non-Technical Guide☆57Updated 4 years ago
- Sample projects using Ploomber.☆86Updated last year
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆86Updated last year
- Deep Learning from Scratch with PyTorch☆120Updated 5 years ago
- Templates for jupyter notebooks☆147Updated last year
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆47Updated 8 months ago
- Tutorial covering a new workflow available going from pandas to scikit-learn☆39Updated 2 years ago
- Python data science and machine learning from Ted Petrou with Dunder Data☆55Updated 3 years ago
- ☆133Updated last year
- Materials for "Parallelizing Scientific Python with Dask"☆70Updated 7 years ago
- Python package for publishing Jupyter Notebooks as Medium blogposts☆148Updated 2 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆53Updated 5 years ago
- Talks about vaex☆36Updated 2 years ago
- 📈 The panel-highcharts package makes it easy to use HighCharts in Python, Notebooks and with HoloViz Panel.☆160Updated 3 years ago
- An abstraction layer for parameter tuning☆35Updated last year
- ☆28Updated 6 years ago
- Repository for a workshop on Bayesian Decision Analysis☆74Updated 2 years ago
- ☆15Updated 3 years ago
- Scipy 2019 Tutorial☆36Updated 5 years ago
- Example PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.☆105Updated 6 years ago
- Clustergram - Visualization and diagnostics for cluster analysis in Python☆127Updated last month