coiled / data-science-at-scaleLinks
A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).
☆120Updated 3 years ago
Alternatives and similar repositories for data-science-at-scale
Users that are interested in data-science-at-scale are comparing it to the libraries listed below
Sorting:
- Sample projects using Ploomber.☆86Updated 2 years ago
- PyData London 2022 Tutorial☆69Updated 3 years ago
- Talks about vaex☆36Updated 3 years ago
- This repository contains materials for AC295 fall 2020☆19Updated 5 years ago
- Jupyter Notebooks and other material from tutorial sessions on Machine Learning, Data Science, and related☆56Updated 4 years ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 4 years ago
- Deep Learning from Scratch with PyTorch☆121Updated 5 years ago
- Increase citations, ease review & collaboration A collection of "easy wins" to make machine learning in research reproducible. This tut…☆75Updated 2 months ago
- Data manipulation, analysis and visualisation in Python - specialist course Doctoral schools of Ghent University☆114Updated 2 weeks ago
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆48Updated 10 months ago
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 5 years ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆84Updated 4 months ago
- How to Interpret SHAP Analyses: A Non-Technical Guide☆58Updated 4 years ago
- It's all in the name☆82Updated 2 years ago
- Data Analysis Baseline Library☆133Updated last year
- Clustergram - Visualization and diagnostics for cluster analysis in Python☆127Updated last week
- Templates for jupyter notebooks☆147Updated last year
- Easy-to-run example notebooks for Dask☆387Updated 2 months ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆107Updated 2 years ago
- Explore 120 million taxi trips in real time with Dash and Vaex☆117Updated 5 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆52Updated 5 years ago
- Tutorial material on machine learning with dirty data in Python☆61Updated last year
- 📈 The panel-highcharts package makes it easy to use HighCharts in Python, Notebooks and with HoloViz Panel.☆159Updated 3 years ago
- ☆28Updated 6 years ago
- Dask tutorial material for video tutorial series☆87Updated 2 years ago
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆51Updated 2 years ago
- One day workshop for machine learning with scikit-learn☆62Updated 2 years ago
- Start a data science project with modern tools☆204Updated 2 years ago
- ☆134Updated last year
- Materials for "Parallelizing Scientific Python with Dask"☆70Updated 7 years ago