coiled / data-science-at-scaleLinks
A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).
☆115Updated 2 years ago
Alternatives and similar repositories for data-science-at-scale
Users that are interested in data-science-at-scale are comparing it to the libraries listed below
Sorting:
- PyData London 2022 Tutorial☆66Updated 3 years ago
- Jupyter Notebooks and other material from tutorial sessions on Machine Learning, Data Science, and related☆56Updated 4 years ago
- It's all in the name☆80Updated 2 years ago
- ☆28Updated 5 years ago
- Increase citations, ease review & collaboration A collection of "easy wins" to make machine learning in research reproducible. This tut…☆74Updated 7 months ago
- Deep Learning from Scratch with PyTorch☆118Updated 5 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆52Updated 4 years ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- Tutorial material on machine learning with dirty data in Python☆61Updated last year
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆47Updated 5 months ago
- Sample projects using Ploomber.☆86Updated last year
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆107Updated 2 years ago
- Interactive visualization of machine learning model evaluation metrics☆63Updated 5 years ago
- ☆15Updated 3 years ago
- Talks about vaex☆36Updated 2 years ago
- Explore 120 million taxi trips in real time with Dash and Vaex☆117Updated 4 years ago
- ☆133Updated last year
- Tutorial covering a new workflow available going from pandas to scikit-learn☆40Updated 2 years ago
- Data manipulation, analysis and visualisation in Python - specialist course Doctoral schools of Ghent University☆109Updated 6 months ago
- This repository contains materials for AC295 fall 2020☆19Updated 4 years ago
- Repository for a workshop on Bayesian Decision Analysis☆71Updated 2 years ago
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆51Updated last year
- SciPy Conference Materials☆47Updated 3 weeks ago
- Templates for jupyter notebooks☆146Updated last year
- Python package for publishing Jupyter Notebooks as Medium blogposts☆147Updated last year
- Explorations of survival analysis in Python☆50Updated 2 years ago
- Code samples for the Effective Data Science Infrastructure book☆115Updated 2 years ago
- Dask tutorial material for video tutorial series☆87Updated last year
- Data Analysis Baseline Library☆133Updated 9 months ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆81Updated last year