coiled / data-science-at-scale
A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).
☆115Updated 2 years ago
Alternatives and similar repositories for data-science-at-scale:
Users that are interested in data-science-at-scale are comparing it to the libraries listed below
- Jupyter Notebooks and other material from tutorial sessions on Machine Learning, Data Science, and related☆56Updated 3 years ago
- Dask tutorial material for video tutorial series☆87Updated last year
- Deep Learning from Scratch with PyTorch☆116Updated 4 years ago
- HoloViz tutorial for KDD 2022☆35Updated 2 years ago
- Tutorial material on machine learning with dirty data in Python☆61Updated 9 months ago
- Introduction to scikit-learn: Machine Learning in Python☆20Updated 2 years ago
- Materials for "Parallelizing Scientific Python with Dask"☆70Updated 6 years ago
- ☆15Updated 3 years ago
- ☆29Updated 5 years ago
- It's all in the name☆76Updated last year
- Data Analysis Baseline Library☆131Updated 5 months ago
- Explorations of survival analysis in Python☆50Updated 2 years ago
- ☆38Updated 2 years ago
- ☆34Updated 3 years ago
- Automated Data Science and Machine Learning library to optimize workflow.☆104Updated 2 years ago
- Learn Python through Data Processing in Pandas Tutorial☆38Updated 4 years ago
- Tutorial covering a new workflow available going from pandas to scikit-learn☆40Updated 2 years ago
- Data manipulation, analysis and visualisation in Python - specialist course Doctoral schools of Ghent University☆107Updated 2 months ago
- Talks about vaex☆36Updated 2 years ago
- Website: Data Umbrella & PyMC open source sessions☆26Updated 10 months ago
- How to Interpret SHAP Analyses: A Non-Technical Guide☆52Updated 3 years ago
- Python data science and machine learning from Ted Petrou with Dunder Data☆55Updated 2 years ago
- Scipy 2019 Tutorial☆35Updated 5 years ago
- ☆20Updated 10 months ago
- ☆13Updated 3 years ago
- Repository for a workshop on Bayesian Decision Analysis☆69Updated 2 years ago
- Public code & notebooks accompanying our blog posts & YouTube tutorials (https://www.youtube.com/c/PyMCLabs)☆24Updated 4 months ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆77Updated last year
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆105Updated last year
- Increase citations, ease review & collaboration A collection of "easy wins" to make machine learning in research reproducible. This tut…☆74Updated 4 months ago