coiled / data-science-at-scale
A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).
☆114Updated 2 years ago
Alternatives and similar repositories for data-science-at-scale:
Users that are interested in data-science-at-scale are comparing it to the libraries listed below
- Deep Learning from Scratch with PyTorch☆116Updated 4 years ago
- Jupyter Notebooks and other material from tutorial sessions on Machine Learning, Data Science, and related☆56Updated 3 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆51Updated 4 years ago
- Data Analysis Baseline Library☆131Updated 4 months ago
- HoloViz tutorial for KDD 2022☆35Updated 2 years ago
- ☆133Updated 9 months ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆75Updated last year
- Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking☆55Updated 3 years ago
- Dask tutorial material for video tutorial series☆87Updated last year
- Python data science and machine learning from Ted Petrou with Dunder Data☆54Updated 2 years ago
- Data manipulation, analysis and visualisation in Python - specialist course Doctoral schools of Ghent University☆107Updated last month
- It's all in the name☆76Updated last year
- ☆29Updated 5 years ago
- There are always multiple ways to complete a task in Pandas. A minimal subset of the library is sufficient for almost everything.☆83Updated 2 years ago
- Sample projects using Ploomber.☆86Updated last year
- Explorations of survival analysis in Python☆50Updated 2 years ago
- Tutorial material on machine learning with dirty data in Python☆61Updated 8 months ago
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆47Updated 2 weeks ago
- One day workshop for machine learning with scikit-learn☆63Updated last year
- 📈 The panel-highcharts package makes it easy to use HighCharts in Python, Notebooks and with HoloViz Panel.☆156Updated 2 years ago
- Scipy 2019 Tutorial☆34Updated 5 years ago
- Increase citations, ease review & collaboration A collection of "easy wins" to make machine learning in research reproducible. This tut…☆74Updated 3 months ago
- Talks about vaex☆36Updated 2 years ago
- Automated Data Science and Machine Learning library to optimize workflow.☆104Updated 2 years ago
- "Hacking Dask" tutorial materials☆71Updated 3 years ago
- Tutorial covering a new workflow available going from pandas to scikit-learn☆40Updated 2 years ago
- Materials for "Parallelizing Scientific Python with Dask"☆70Updated 6 years ago
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆51Updated last year
- Website: Data Umbrella & PyMC open source sessions☆26Updated 9 months ago
- Public code & notebooks accompanying our blog posts & YouTube tutorials (https://www.youtube.com/c/PyMCLabs)☆24Updated 3 months ago