coiled / data-science-at-scale
A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).
☆113Updated 2 years ago
Alternatives and similar repositories for data-science-at-scale:
Users that are interested in data-science-at-scale are comparing it to the libraries listed below
- PyData London 2022 Tutorial☆66Updated 2 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆104Updated last year
- Tutorial covering a new workflow available going from pandas to scikit-learn☆40Updated 2 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆51Updated 4 years ago
- Dask tutorial material for video tutorial series☆87Updated last year
- HoloViz tutorial for KDD 2022☆35Updated 2 years ago
- Deep Learning from Scratch with PyTorch☆116Updated 4 years ago
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆47Updated last year
- It's all in the name☆76Updated last year
- 📈 The panel-highcharts package makes it easy to use HighCharts in Python, Notebooks and with HoloViz Panel.☆155Updated 2 years ago
- Increase citations, ease review & collaboration A collection of "easy wins" to make machine learning in research reproducible. This tut…☆74Updated 2 months ago
- Scipy 2019 Tutorial☆35Updated 5 years ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆75Updated last year
- Website: Data Umbrella & PyMC open source sessions☆26Updated 8 months ago
- Jupyter Notebooks and other material from tutorial sessions on Machine Learning, Data Science, and related☆56Updated 3 years ago
- An abstraction layer for parameter tuning☆35Updated 5 months ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- Materials for "Parallelizing Scientific Python with Dask"☆70Updated 6 years ago
- Tutorial material on machine learning with dirty data in Python☆62Updated 7 months ago
- Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking☆55Updated 3 years ago
- Material for the PyLadies Bayesian Tutorial, Feb 11, 2020☆12Updated 2 years ago
- ☆20Updated 8 months ago
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆51Updated last year
- Applied Machine Learning with Python☆77Updated 10 months ago
- Sample projects using Ploomber.☆86Updated last year
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆81Updated 3 years ago
- Data Analysis Baseline Library☆130Updated 3 months ago
- Slides, videos and other potentially useful artifacts from various presentations on responsible machine learning.☆22Updated 5 years ago
- ☆29Updated 5 years ago
- ☆133Updated 8 months ago