A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).
☆121Nov 20, 2022Updated 3 years ago
Alternatives and similar repositories for data-science-at-scale
Users that are interested in data-science-at-scale are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Introduction to Dask for PyTorch Workflows☆13Mar 3, 2021Updated 5 years ago
- e-Rum2020::A Unified Approach For Writing Automatic Reports☆20Jun 19, 2020Updated 6 years ago
- Python implementation of Gibbs sampling for the naı̈ve Bayes model presented by Resnik and Hardisty☆14Feb 10, 2018Updated 8 years ago
- The ecosystem of geospatial machine learning tools in the Pangeo world.☆12Mar 17, 2025Updated last year
- Cubed-Sphere data processing with xarray☆18Jan 16, 2020Updated 6 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- This repository contains the resources used for presentation/discussion in weekly iRE Lab meetings.☆14Sep 8, 2017Updated 8 years ago
- A xarray extension to show velocity fields as interactive maps in jupyterlab☆12Dec 2, 2020Updated 5 years ago
- A Panel app to demonstrate distorsions created by non-perceptual colormaps on geophysical data☆11Jan 22, 2026Updated 5 months ago
- ☆12Jan 18, 2019Updated 7 years ago
- ☆53May 25, 2026Updated last month
- Python interface to TileDB Cloud REST API☆15May 5, 2026Updated last month
- A small utility for generating ND array pyramids using Xarray and Zarr.☆120May 1, 2026Updated last month
- ☆116Nov 7, 2022Updated 3 years ago
- ☆12Apr 20, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- JupyterHub deployment for ENGR101 Winter 2018 at Portland Community College☆11Dec 8, 2022Updated 3 years ago
- Deep Learning from Scratch with PyTorch☆121Jul 10, 2020Updated 5 years ago
- A place to provide Coiled feedback☆29Mar 5, 2025Updated last year
- Materials for the "Recommender Systems through the lens of Decision Theory" tutorial delivered at the 30th Web Conference (WWW '21).☆11Apr 13, 2021Updated 5 years ago
- Simple examples of data pipelines from xarray to ML training☆22Dec 19, 2019Updated 6 years ago
- This repository replicates the figures from the 3rd edition of the book "Recursive Macroeconomic Theory" by Lars Ljungqvist and Thomas J.…☆12Feb 9, 2016Updated 10 years ago
- ☆33Aug 14, 2020Updated 5 years ago
- Repo for PyData 2018 tuorial☆12Oct 18, 2018Updated 7 years ago
- Example of using AWS for serverless on-demand seismic processing.☆14Mar 30, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Aug 14, 2022Updated 3 years ago
- Earth System Model Collection specification☆13Feb 3, 2023Updated 3 years ago
- An IPython notebook analysis of the UWC Tampines commercial building dataset☆13Apr 25, 2019Updated 7 years ago
- Easy to use Python library of customized functions for cleaning and analyzing data.☆522Jun 18, 2026Updated last week
- Unmap data from a pseudocolor image, with or without knowing the colormap.☆18Apr 4, 2023Updated 3 years ago
- This repo contains a short version of a dask tutorial.☆12Dec 5, 2022Updated 3 years ago
- Python package to call processed EE objects via the REST API to local data☆36Jun 8, 2024Updated 2 years ago
- ☆43Sep 13, 2021Updated 4 years ago
- Course site for MACS 30250 (Spring 2020) - Perspectives on Computational Research in Economics☆18Jun 3, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Models and parameterizations for the turbulent ocean surface boundary layer in Julia☆25Dec 1, 2022Updated 3 years ago
- ☆45Dec 9, 2025Updated 6 months ago
- ☆21Sep 29, 2021Updated 4 years ago
- Yelmo ice-sheet model code base☆20May 17, 2026Updated last month
- Introduction to Conda for (Data) Scientists☆50Mar 2, 2023Updated 3 years ago
- Gibbs sampling inference to LDA☆19Apr 4, 2014Updated 12 years ago
- Portfolio of Allen Downey at Olin College☆21Dec 13, 2022Updated 3 years ago