coiled / data-science-at-scaleLinks
A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).
☆115Updated 2 years ago
Alternatives and similar repositories for data-science-at-scale
Users that are interested in data-science-at-scale are comparing it to the libraries listed below
Sorting:
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆77Updated last year
- Dask tutorial material for video tutorial series☆87Updated last year
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆52Updated 4 years ago
- An abstraction layer for parameter tuning☆35Updated 9 months ago
- Tutorial material on machine learning with dirty data in Python☆60Updated 10 months ago
- It's all in the name☆77Updated last year
- Jupyter Notebooks and other material from tutorial sessions on Machine Learning, Data Science, and related☆56Updated 3 years ago
- Increase citations, ease review & collaboration A collection of "easy wins" to make machine learning in research reproducible. This tut…☆74Updated 5 months ago
- Structural Time Series on US electricity demand data☆22Updated 4 years ago
- Deep Learning from Scratch with PyTorch☆117Updated 4 years ago
- ☆29Updated 5 years ago
- ☆134Updated last year
- HoloViz tutorial for KDD 2022☆35Updated 2 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated 2 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆106Updated 2 years ago
- PyData London 2022 Tutorial☆66Updated 2 years ago
- Data Analysis Baseline Library☆132Updated 7 months ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- Learn Python through Data Processing in Pandas Tutorial☆38Updated 4 years ago
- Talks about vaex☆36Updated 2 years ago
- ☆47Updated 5 years ago
- ☆38Updated 2 years ago
- Sample projects using Ploomber.☆86Updated last year
- ☆20Updated 11 months ago
- Example PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.☆105Updated 6 years ago
- Materials for "Parallelizing Scientific Python with Dask"☆70Updated 6 years ago
- Material for the PyLadies Bayesian Tutorial, Feb 11, 2020☆12Updated 2 years ago
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆47Updated 2 months ago
- Demonstration notebooks for the Terality serverless data processing engine (www.terality.com)☆14Updated 3 years ago
- General Interpretability Package☆58Updated 2 years ago