coiled / data-science-at-scaleLinks
A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).
☆119Updated 3 years ago
Alternatives and similar repositories for data-science-at-scale
Users that are interested in data-science-at-scale are comparing it to the libraries listed below
Sorting:
- It's all in the name☆82Updated 2 years ago
- PyData London 2022 Tutorial☆68Updated 3 years ago
- Sample projects using Ploomber.☆86Updated last year
- Talks about vaex☆36Updated 3 years ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 4 years ago
- Increase citations, ease review & collaboration A collection of "easy wins" to make machine learning in research reproducible. This tut…☆75Updated last month
- Jupyter Notebooks and other material from tutorial sessions on Machine Learning, Data Science, and related☆56Updated 4 years ago
- Deep Learning from Scratch with PyTorch☆120Updated 5 years ago
- ☆28Updated 6 years ago
- Explore 120 million taxi trips in real time with Dash and Vaex☆117Updated 5 years ago
- Explorations of survival analysis in Python☆50Updated 2 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆106Updated 2 years ago
- ☆15Updated 3 years ago
- 📈 The panel-highcharts package makes it easy to use HighCharts in Python, Notebooks and with HoloViz Panel.☆158Updated 3 years ago
- Tutorial covering a new workflow available going from pandas to scikit-learn☆39Updated 3 years ago
- How to Interpret SHAP Analyses: A Non-Technical Guide☆57Updated 4 years ago
- Tutorial material on machine learning with dirty data in Python☆61Updated last year
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆84Updated 3 months ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆53Updated 5 years ago
- HoloViz tutorial for KDD 2022☆34Updated 3 years ago
- Explore and compare 1K+ accurate decision trees in your browser!☆169Updated last year
- Structural Time Series on US electricity demand data☆22Updated 4 years ago
- Dask tutorial material for video tutorial series☆87Updated 2 years ago
- Data Analysis Baseline Library☆133Updated last year
- Code samples for the Effective Data Science Infrastructure book☆116Updated 2 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆48Updated 9 months ago
- Python package for publishing Jupyter Notebooks as Medium blogposts☆149Updated 2 years ago
- ☆134Updated last year
- pipreqs with jupyter notebook support☆71Updated 2 years ago